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CLOSED MANIFOLDS WITH HOMOGENEOUS COMPLEX 
STRUCTURE.* 


By Hsten-CHunG WANG. 


1. Introduction. Nearly all the information about closed complex 
manifolds comes either from the theory of kahlerian manifolds or from that 
of almost complex manifolds. On one hand, we do not know any topological 
property of complex manifolds which is not a property of almost complex 
manifolds; on the other hand, very few examples—those given by Calabi- 
Eckmann [6]—of simply-connected, closed non-kihlerian complex manifolds 
are known. On account of this situation, it seems desirable firstly to discuss 
some particular classes of complex manifolds which may impart some ideas 
for the general case. If we consider the complex structure as a geometrical 
object, a very natural assumption will be homogeneity. In fact, a complex 
manifold W is called homogeneous if, given any two points of W, there exists 
an analytic homeomorphism of W carrying one to the other. It is the aim 
of the present paper to study the simply-connected, closed homogeneous 
complex manifolds which, for simiplicity, will be called a C-space. We find 
that there are many more C-spaces than we expect, and the underlying space 
of most of them has infinitely many inequivalent non-kahlerian complex 
structures. 


Concerning the general properties, we have 


THEOREM A. If a C-space has vanishing second Betti number, then its 
Euler characteristic must vanish. 


THEOREM B. Let M be a simply-connected closed manifold. If the 
Kuler characteristic x of M differs from zero, then M has at most a finite 
number of inequivalent homogeneous complex structures. If xy=0, then M 
has either no homogeneous complex structure or non-countably many inequi- 
valent ones. 


THEOREM C. Let K be a compact semi-simple Lie group, and X a 
closed connected subgroup whose semi-simple part coincides with the semi- 
simple part of the centraliser of a toral subgroup of K. Then the coset 


* Received March 18, 1953. 


OHIO UNIVERSI 
LIBRARY. 


il 


| 


2 HSIEN-CHUNG WANG. 


space K/X, if even-dimensional, has a homogeneous complex structure. 
Conversely, each C-space is homeomorphic with such a coset space. 


Using S*® as an example, we see that Theorem A does not hold for 
homogeneous almost complex manifolds. It is unknown to the author 
whether or not Theorem A is valid if we drop the homogeneity restriction. 
Theorem C tells us that the class of C-spaces is rather large. 

For the actual determination of the C-spaces, we introduce a class of 
manifolds, called M-manifolds. In fact, a coset space K/U of a compact 
simple Lie group K is called an M-manifold if U is the semi-simple part of 
the centraliser of a toral subgroup of K. All M-manifolds are eventually 
determined. In this class, we have, in particular, the space of compact 
simple Lie groups, the complex Stiefel manifolds, the quarternion Stiefel mani- 
folds and the ordinary Stiefel manifolds of the form V2. The M-manifolds 
and C-spaces are related as follows: 


THEOREM D. Lach even-dimensional M-manifold has infinitely many 
inequivalent homogeneous complex structures and so does the product of two 
odd-dimensional ones. They have vanishing second Betti number, and hence 
they are non-kahlerian. Moreover, each C-space W is homeomorphic with a 
fibre decomposition space of a product of M-manifolds with torus as the 
fibre (i.e., the product is a torus bundle over the underlying space of W). 


It is to be noted that all these theorems still hold if, instead of simply- 
connectedness, we asume the spaces to have finite fundamental groups. The 
proof is essentially the same. 


2. Complex coset spaces. Let @ be a complex Lie group. A subgroup 
L of G is called a complex subgroup if L is closed and the underlying point 
set of L is a complex submanifold of G (by a complex submanifold J of a 
complex manifold 1/, we mean a topological submanifold with the property 
that, for each point q of M, there exist analytic functions fj, f.,---, fs 
defined in some neighborhood U of gq such that 


Jn U = {p: fi(p) = 0,1 = 1, 2,- S}). 


The identity component of a complex subgroup of a complex Lie group G is 
generated by a subalgebra (over complex field) of the complex Lie algebra 
associated with G. Conversely, the subgroup of G generated by such a sub- 
algebra, if closed, is a complex subgroup. 

Let L be a complex subgroup of a complex Lie group G. A complex 
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structure can be defined over the space G/L of left cosets in the usual manner. 
In fact, let ge G, g—=7(g) where 7: G—+G/L denotes the projection, and 
let %e(g) be the class of analytic local functions on G at g. We set Wax(q) 
to be the class of all complex-valued functions ¢ defined in some neighborhood 
of g such that dz e %ce(g). Using the same arguments as in the real analytic 
case, we can verify that the classes M%¢/z(q), ge G/L, of local functions 
defines a complex analytic structure over G/L. The resulting complex mani- 
fold will be called the complex coset space G/L. With respect to this complex 
structure, the mapping g: G/L > G/L given by is complex 
analytic where g denotes an element of G. Thus the complex coset space 
G/L is a homogeneous complex manifold. 

In general, a homogeneous complex manifold cannot be written as a 
complex coset space. Nevertheless, if compactness is assumed, we have the 
following: 


(2.1) Hach closed homogeneous complex manifold is analytically homeo- 
morphic with a complex coset space. 


Proof. Let W be a closed homogeneous complex manifold, and B the 
identity component of the group of all analytic homeomorphisms of W. 
Bochner and Montgomery [3] have shown that a complex structure can be 
given to the space of B such that (i) B becomes a complex Lie group with 
respect to this structure, and (ii) the mapping T: BX W—W given by 
T(g,v) =g(x), ge B, xe W, is analytic. From the homogeneity of W, 
we see that B is transitive over W. Let LZ be the isotropic subgroup of G 
at a point g of W. L is evidently closed. Moreover, from (ii), Z is, locally, 
defined by analytic equations in B. Thus L is a complex subgroup of B, 
and thus we can form the complex coset space B/Z. Since W is a manifold, 
the mapping 0: B/L W given by 6(gL) B, is an onto homeo- 
morphism. In fact, @ is two-sided analytic. To see this, let T,: B—> W be 
defined by T,(9) = 9(q) =T(g,q), and let y be an analytic local function 
on W. It is easy to see that yT, = W6r where 7: B—> B/L denotes the pro- 
jection. Because of (ii), 7, is an analytic mapping so that ~T, or Yr is an 
analytic local function on B. From the definition of complex coset spaces, 
it follows that y@ is an analytic local function on B/LZ. This tells us that 
the mapping @ is analytic. The analicity of its inverse 6 follows as a 
consequence [1], p. 179. (2.1) is thus proved. 

If the fundamental group of W is assumed to be finite, stronger assertion 
than (2.1) can be obtained. For this purpose, we shall first show a trivial 
property of transformation groups. 
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(2.2) Let W=K/L be a coset space of a compact Lie group K. If 
the fundamental group ,(W) is finite, then the maximal semi-simple sub- 


group J of K is transitive on W. 


Proof. Since K is compact, its maximal semi-simple subgroup J is a 
normal subgroup, and moreover, K/J is abelian. It follows that JL is a 
closed normal subgroup and that K/JL is a toral group. Since all the groups 
involved are Lie groups, we know [11], p. 30, that K/L is a fibre bundle 
over K/JL with JL/J as the fibre. JL/L is connected as J is connected. 
Thus the fundamental group 7,(K/JL) of the base space is a factor group 
of the fundamental group 7,(K/L) of the bundle space. By hypothesis, 
a,(K/L) is finite, and thus 7,(K/JLZ) is also finite. On the other hand, 
we know that K/JZ is a toral group. Therefore, K/JZ can only contain the 
identity, or in other words, K =JL. This tells us that J is transitive on 
W K/L. 


(2.3) Let W=B/L be a compact complex coset space. If the funda- 
mental group of W is finite, then each maximal connected semi-simple 
subgroup G of B is transitive over W. Moreover, W is analytically homeo- 
morphic with the complex coset space G/(GN L). 


Proof. It is well-known that such a G@ is generated by the semi-simple 
part of a Levi-decomposition of the complex Lie algebra of B. Thus G is a 
complex subgroup. Malcev [8] has shown that all the G’s are conjugate 
in B. Therefore, the choice of @ is immaterial. Let K bea maximal compact 
subgroup of B. We know that K is transitive over W [9]. It follows from 
(2.2) that the maximal connected semi-simple subgroup J of K is transitive 
on W. This J is contained in a maximal connected semi-simple subgroup @ 
of B. Thus G is transitive over W. This proves the first part of (2.3). 

To show the second part, let us put L’=GQO JL. Since both G and L 
are complex subgroups, L’ is also a complex subgroup, and thus the complex 
coset space G/L’ is well-defined. Because of the transitiveness of G on B/L, 
the mapping f: G/L’ B/L defined by f(gL’) =gL is an onto homeo- 
morphism. We can easily verify that f is analytic. (2.3) is proved. 


3. Solvable subgroups. From the above discussions, the study of com- 
pact homogeneous compact spaces is reduced to the study of complex coset 
spaces G/L. In this section, we shall establish some necessary conditions on 
the maximal solvable subgroups of Z in order that G/L is compact. 


It is desirable to say here a few words about our notations. Let G be 
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a Lie group. K(() always denotes a maximal connected compact subgroup 
of G. By dimension of G, we shall always mean its topological dimension. 
Thus for a complex Lie group G, dim. G@ is twice of the dimension of its 
complex Lie algebra. 


(3.1) Let W=G/L be a coset space of a Lie group G. If 
(3. 2) dim. G— dim. K(G@) = dim. L — dim. K(L), 


then W is compact. Conversely, if W is compact and has a finite fundamental 
group, then equality (3.2) holds. 


Proof. Suppose that (3. 2) holds. Then dim. W = dim. K(G) — dim. K(Z). 
Let R be an orbit in W under K(G). Then R is homeomorphic with 
K(G)/(K(@) 9 L), and then dim. R = dim. K(G) — dim. (K(G) N L). Since 
K(G@) OL is a compact subgroup of Z, its dimension cannot be greater than 
that K(L). It follows then 


dim. k = dim. K(G) — dim. K(L) = dim. W, 


and then dim. R =dim. W. But F# is a closed manifold so that R = W. 
This tells us that W is compact. 

Conversely, suppose that W is compact and has finite fundamental group. 
Choose a K(G) such that K(G@) contains a K(L). From a theorem of 
Montgomery [9], A(G@) is transitive on W. Thus 


dim. G — dim. L = dim. W = dim. K(G@) — dim. (K(G) NL). 


From our choice of K(G@), we known that K(Z) must be the identity com- 
ponent of K(G) OL. Thus dim. K(L) = dim. (K(G) NL). This equality 
together with the preceding one gives (3.2). (3.1) is proved. 

Now we shall*state some known results in a form suitable for our 
purposes. Let G be a complex semi-simple Lie algebra, and H a Cartan 
subalgebra of G. An element h of H is called rational if all the roots of h 
(i.e., the characteristic roots of a(h) where « denotes the adjoint repre- 
sentation of G) are rational numbers. Evidently, h is rational if and only 
if there exists an integer m such that exp maia(h) is the identity. We shall 


call a base hi, ho,- - -,h, of H a rational base if each h; is rational. The 
existence of rational base is well-known. Let A, A2,* * *, Az be the coordinates 
in H with respect to a rational base h,,h2,---,hy Let ¢ be a linear 


representation of the Lie algebra G. Then each weight A of ¢ can be written 
as a linear combination of the A’s: + +: Since the 
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base is rational, the coefficients r; must be rational numbers. Thus we can 
order the weights in the classical manner. In fact, let 


+° °° A’ = 8A, 


be two weights. We say that A< A’ if, for some 1, we have r;=%, 
= S2,° < Sinn. If, in particular, @ is the adjoint repre- 
sentation of G. Then the weights become roots of G. Thus the roots have 
an ordering. Let ep, e, be two root vectors corresponding to the roots p, 
respectively. Then we say ép < e, if and only if p<y. It should be noted 
that this ordering depends on the choice of the rational base of H. 

Let N be the subspace spanned by all the root vectors ép,, @p,,° * * 5 p, 
which are greater than zero, and S=N-+H. A direct verification tells us 
that S is a maximal solvable subalgebra of G. Since 0 < ep if and only if 
ep <0, we know that the number ¢ of positive root vectors is equal to one | 
half of the total number of root vectors. Thus ; 


dim. S (dim.G + 1)/2 


where J denotes the rank of G. Morosoff [10] has shown that each solvable 
subalgebra of G is conjugate to a subalgebra of S. Thus all maximal solvable | 
subalgebra of G are conjugate. Moreover, each maximal solvable subalgebra | 
of G can be constructed in the above manner by choosing a suitable H and a 
suitable rational base of H. , 

Now let G a complex Lie group corresponding to the Lie algebra G. 
We consider the subgroup S generated by S in G. Since S contains a Cartan 
subalgebra H of G, S must be closed [7], and hence a complex subgroup of G. 
Obviously, S is a maximal connected solvable subgroup, and 


(3. 3) dim. S = 2 dim. S = dim. G + 1 =1-+ (dim. @) /2. 


Since S is solvable, each K(S) is a toral group. From the fact that all 
maximal connected solvable subgroups of G are conjugate to S, it follows that ~ 
K(S) is a maximal toral subgroup of G. Thus dim. K(S) =1. This equality | 
together with (3.3) gives 


(3.4) dim. G—dim. K(G@) = (dim. G@)/2 = dim. 8S — dim. K(8). 


Now we can show easily the following: 


(3.5) Let B be a complex Lie group. All its maximal connected 
solvable subgroups M are complex subgroups and are conjugate to one 
another. Moreover, 7,(B/M) is finite and 


dim. B — dim. K(B) = dim. M — dim. K(M). 
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Proof. Let BGR be a Levi-decomposition of B where G denotes a 
maximal semi-simple subgroup of B and RF the radical. Evidently, each 
maximal connected solvable subgroup M of B takes the form M = SR where 
S is a maximal connected solvable subgroup of G. Thus M is a complex sub- 
group of B. From the results of Morosoff, we know that all such S’s are con- 
jugate in G. It follows that all such M’s are conjugate in B. The first 
part of (3.5) is proved. For the second part, we consider the coset space 
W+—B/M. Since FR is a common normal subgroup of B and M, we have 
W = (B/R)/(M/R) =G/S. (3.4) together with (3.1) tells us that 
W=B/M is compact. G is a complex semi-simple Lie group so that its 
fundamental group is finite. From this fact and the connectedness of 8, it 
follows that the fundamental group of W is finite. Now applying the second 
part of (3.1) to W, we get the equality 


dim. B — dim. K(B) = dim. M — dim. K(M/). 
This proves (3.5). 


(3.6) Let W = G/L be a complex coset space of a complex semi-simple 
Iie group G, and M a maximal connected solvable subgroup of L. If W 1s 
compact and has finite fundamental group, then dim. M = (dim. @)/2 + v 
where v denotes the dimension of a maximal compact subgroup of M. 


Proof. From (3.1), we know that 
dim. L — dim. K(L) = dim. G — dim. K(G@). 
Proposition (3.5) tells us that . 
dim. L — dim. K(L) = dim. M — dim. K(M). 


Thus dim. Jf — dim. K(J/) = dim. G—dim. K(@). Here G@ is a complex 
semi-simple Lie group so that dim. K(G@) = (dim. @)/2. It follows then 
dim. JJ = (dim. G)/2-+ dim. K(M). (3.6) is proved. 


4, J-subalgebras. In this section, we shall introduce and study a class 
of subalgebras which facilitate to determine the structure of the isotropic 
subgroup L of the compact complex coset spaces G/L. Let H be a Cartan 
subalgebra of a complex semi-simple Lie algebra G. A linear subspace P of 
dimension p in H is called a rational plane or a rational subalgebra if P 
contains p linearly independent rational elements. If p—1J—41 where 
1= rank (G) = dim. H, then we call it a rational hyperplane. Now let P 
be a p-dimensional rational plane. We choose p linearly independent rational 
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elements /,,---,k, in P. Let « denote the adjoint representation of G. 
With respect to a suitable base of G, each a(/;) takes the form of a diagonal 
matrix. From the definition of rational elements in H, we know that the 
elements of these diagonal matrices a(k;) must be rational numbers. It 
follows that, in the adjoint group G* of G, P generates a closed subgroup 
isomorphie with the direct product JT?  R? of a p-dimensional toral group 
and a p-dimensional vector group. Since each group form of the complex 
semi-simple Lie algebra G is a covering group of G* with finite leaves, the 
above property of P not only holds for G* but also for any group form of G. 


Der. A subalgebra J of H is called a J-subalgebra if it does not lie on 
any rational hyperplane of H. 


(4.1) Let H be a Cartan subalgebra of a complex semi-simple Lie 
algebra G of rank l. Let J be a subalgebra of H, and J the subgroup generated 
by J in a group form G of G. If J contains an l-dimensional vector subgroup, 


then J is a J-subalgebra. 


Proof. If J lies on a rational hyperplane P, then J would be a sub- 
group of the group P generated by P in G. We know that P is isomorphic 
with the direct product T'* & R* of an (J —1)-dimensional group and an 
(J —1)-dimensional vector group. P has no /-dimensional vector subgroup. 
A contradiction is obtained. Hence J is a J-subalgebra. 

Before establishing further properties of J-subalgebras, we have to digress 
a little and define some terminiologies used in representation theory. Let ¢ 
be a representation of the Lie algebra G, and V the complex vector space on 
which #(G) takes place. We shall fix a Cartan subalgebra H of G and a 


rational base h,,- - -,h, of H. The coordinates with respect to this base of 
H will be denoted by A,,- - +, A; Associated with ¢, there are a finite number 
of linear forms A,, As,- - +, As, over H, called weights. To each weight A 


corresponds a linear subspace Vy, called weight space, such that, for each 
vector 7 of V, and each element h of H, we have ¢(h)n = A(h)y. Moreover, 
the space V is decomposed into the direct sum V—=V ,+V;,+---4+7, 
of the weight spaces. Since we have a fixed coordinate system \y,° - -,A; in 
H, we can write the weights as linear combinations of the )’s: 


k=1 
The base h;,- --,h; of H is rational so that the coefficients mj, are all 


rational numbers. 
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(4.2) Let J CH be a J-subalgebra and V* be linear subspace of V 
invariant under $(J). Then 


Here all the notations have the same meaning as in the preceding paragraph. 


Proof. Let A; and A; be two distinct weights. We shall first see that 
Ai(h) #Aj(h), he J. Tf they were identical for all h in J, then J would 
lie in the hyperplane P which is defined in terms of the coordinates 


D (Miz — Ay = 0. 
k=1 


Since h,,- + -, 4; form a rational base of H and the m’s are rational numbers, 
this P is a rational hyperplane. This contradicts our assumption that J is a 
J-subalgebra. Thus A;(h) #Aj(h), he J. We have only a finite number 
of weights so that there exists an element hy of J such that Ai(ho) 4 Aj(ho) 
whenever 1 > 7. 


Now let é be a vector of V*. It can be written in the form: 


&eVs, 
Thus 


(4 (to) (As (lho) k—=0,1,2,- °°. 


Since A;(ho) # Aj(ho) for 1~j, the vectors can be written as 
linear combinations of 


Eb (ho) E, (ho) (PC ho) 


But on account of the invariance of V* under $(J), the latter vectors belong 
to V*. It follows then é;e¢ V*, and then §eV*M V,,. Proposition (4. 2) 
is therefore proved. 


(4.3) Let L be a subalgebra of G such that L contains a J-subalgebra 
J of H. Then L is spanned by LQ H and some root vectors of G. 


Proof. In (4.2), let us take @ to be the adjoint representation of G. 
Then V =G, and the weights of ¢ are 0 and the various roots of G. The 
weight space V, corresponding to zero is H, and the weight space Vp corre- 
sponding to a root p of @ is the one-dimensional space containing the root 
vector é». Taking V* = L, we get (4.3) immediately from (4. 2). 
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5. The structure of the isotropic subgroup. Let W = G/L be a com- 
plex coset space of a complex semi-simple Lie group G. We shall always 
assume that W is compact and has finite fundamental group. Let M be a 
maximal connected solvable subgroup of Z. Choose a maximal connected 
solvable subgroup S of G such that MC 8S. Let G, L, S, M be the complex 
Lie algebras of G, L, S, M respectively. From the facts that MCS and 
that M is a maximal solvable subalgebra of L, we see immediately that 
M=LOS. From § 3, there exists a Cartan subalgebra H of G@ and a 
rational base h,,- --,h, of H such that S is spanned by H and all the 
positive root vectors ép,, €p..° °°, ép, of G (the ordering of the root vectors 
is defined with respect to the base hy,---,hi). Here ]—rank(@) and 
dim. G = 2¢+1. From (3.6), we know 


dim. M = (dim. G)/2 + v= 2t+/+ 2, v=dim. K(M). 
Thus 
(5. 1) dim. M=i+ (1+ v)/2. 
Let JHMM. Since both M and H are linear subspaces of S, 
dim. J = dim. M + dim. H — dim. S = (1+ v)/2. 


Let J and H be the subgroups generated by J and H in G respectively. H is 
evidently closed. The subgroup J, being the identity component of the inter- 
section MM H of two closed subgroups, is also closed. Moreover, 


dim. J = 2dim. J=/1+ 


We can always write the connected abelian Lie group J in the product form: 
J =T* X R® where T* denotes an n-dimensional toral group and R? a 
b-dimensional vector group. Evidently, a+b—dim.J=/1+ v. Since 
JCM, aSdim. K(M) =v. It follows then On the other 
hand, the subgroup H, being generated by a Cartan subalgebra, is isomorphic 
with the direct product of an J-dimensional toral group and an 
l-dimensional vector group. Hence H, and hence J, cannot have any vector 
subgroup of dimension higher than /. Thus =1,a—v and X 
From (4.1), we know that J must be a J-subalgebra. (4.3) then tells us 
that L is spanned by LN H and some root vectors of G. Now we shall see 
that L contains all the positive root vectors. For this purpose, let us consider 
M. Since J CM, M is spanned by MOH and some root vectors, say 
Then dim. M = dim.(MN HB) +s=—dim. J+s=—s+ (1+ 1)/2. 
Comparing this equality with (5.1), we get st. Since M CSS, all these 
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root vectors éq,,° °°, @a, must be positive. But we have only ¢ positive root 
vectors. Thus M, and hence LE, contains all the positive root vectors. 
Summing up the above results, we have 


(5.2) Let W= G/L bea complex coset space of a complex semi-simple 
Lie group G. Suppose that W is compact and has finite fundamental group. 
Let G, L be the complex Lie algebras of G, L respectively. Then there exists 
a Cartan subalgebra H of G and a rational base h,,- - -,h, of H with the 
following properties: (i) LO H is a J-subalgebra of H, (ii) L is spanned by 
LOH and some root vectors of G, and (iii) all the root vectors which are 
positive with respect to the base hi,- - -,h; le on L. 


To study further properties of L, let us denote by © the totality of roots 
B of G such that both eg and eg belong to L. The roots in ® occur in 
pairs B, — B. 


(5.3) The linear subspace Q spanned by 


{ep, ha = [eg, e-g]: 
is a semi-simple subalgebra of L. 


Proof. Since [hg,Q@] C Q, to show that Q is a subalgebra, it suffices 
to show that [eg, ¢,]e@Q for B,ye®. If 8+ y is not a root of G, then this 
evidently holds. Now assume 8 + y =r to be actually a root of G. Since 
eg and e, belong to L, the root vector e, belongs to L as well. By definition 
of ©, we know that e_g, e-yeL. It follows then [e_g, e_,] ¢ L, and then e_,eL. 
This tells us that + © ®, or in other words, e,e Q. But e, and [eg, e,] only 
differ by a non-zero scalar. Thus [eg, ey]e¢@. Thus Q forms a subalgebra. 
Its semi-simplicity is evident. 


In fact, Q is the essential part of L. The following property of Q plays 
a basic réle in our study of compact homogeneous complex manifolds. 


(5.4) There exists in H a rational subalgebra H, such that the cen- 
traliser C of H, is decomposed into the direct sum H, @ Q of H, and Q. 


Proof. Let H, be the totality of elements h of H such that [h, Q@] = 0. 
It can be easily verified that 


H, = {h: he H, B(h) for all in 


Evidently H, is a subalgebra of H. To see that it is rational, let us put, for 
each Be @, 


Ps = {h: he H, B(h) =0}. 
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This Pg is a rational hyperplane. Since H, is the intersection of all the Pg, 
Bed, it follows that H, is rational. 

Let 7* be the rank of Q. Then among the roots of Q, or what is the 
same, among the roots @ in @, there are exactly /* independent ones. Thus 
among the rational hyperplanes Pg, there are 1* independent ones, and thus 
dim. H, = dim. (1 Pg) =1—I1*. Now let H. be the space spanned by 
{he = [eg, e-g]: Be @}. This is evidently a Cartan subalgebra of Q, and 
thus dim. H, = /*. Moreover, since is semi-simple and [H,,Q]| 0, we 


have 
H, Q=0. 


A counting of dimensions of H, H, and H, tells us at once that H = H, + H:. 

Let C be the centraliser of H, in G. Obviously, H CC and Q CC. 
Thus to prove (5.4), it is sufficient to prove C CH,-+@Q. We shall prove 
this by using method of contradiction. Suppose CGC H,+@Q. Since C 
contains a Cartan subalgebra H, C is spanned by H and certain root vectors 
of G. But we already know that H=H,+H.,CH,+@Q. Thus there 
must be a root vector ep of G such that epeC, ep9#Q. From the definition 
of C and Q, we see at once that e_p also has this property, i.e., ep eC, ep#¢Q. 
Thus there exists always a positive root vector of G which belongs to C but 
not to Q. Let D be the linear subspace spanned by all such positive root 
vectors. Then DA0,DCC,DAQ=0. Moreover, D C L on account of 
(5.2). Firstly, we want to show [Q,D] C D. Since [H, D] C D, it suffices 
to show that, for each ege and each epe D, [eg, ep] If B+ is not a 
root of G, then [eg, ep |] = 0 evidently belongs to D. Now suppose 8 + p=—r 
to be a root of G. Since eg,ep e L, e- e L. The vector e_, cannot belong to L, 
for if so, we would have ®, e,e Q, Q, and then epe Q which is 
impossible. Thus e_,¢L. From this fact, it follows that e, is positive and 
er£Q. We know that eg,epeC so that e, which is a multiple of [eg, ep| 
must belong to C. Thus e,e D, and hence [eg,ep]¢D. This proves that 
[Q,D] CD. 

Let ¢ be the representation of the complex semi-simple Lie algebra Q 
over the linear space D defined by 


$(x)(y) = [2, y],reQ,yeD. 


On account of the fact [Q,D] C D, this representation is well-defined. Let 
Cp, Cp’ * » &p, be the positive root vectors which span D. For each h” of 
H., we have 


$(h”) (€p,) = ep,. 


| 


et 
of 
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Since H, is a Cartan subalgebra of Q, ep, are weight vectors of the repre- 
sentation ¢. Their weights are the linear forms p; confined to H,. It is 
well-known that the sum of all the weights of a representation of a complex 
semi-simple Lie algebra is equal to zero. Thus, for each h” of H., we have 


> pi(h’”) =0. Let h be a general element of H. It can be written in the 
4=1 
form h=h’ + h” where h’c H,, cH. Thus 


pi(h) = pi(h’) pi(h”) = pilh’). 
But each ep, belongs to the centraliser C of H,. Thus p;(h’) 0, and thus 
pi(h) = 0, heli. 
4=1 
From the definition of D, all the root vectors ép,,- - *, ép,, are positive in the 


sense of § 3 so that § p;(h) cannot vanish for all h in H. <A contradiction is 
i=1 


therefore obtained. This proves (5. 4). 


6. The underlying space of homogeneous complex manifolds. Now we 
are in position to give some definite results about homogeneous complex 
manifolds. To simplify the statements, we find it desirable to lay down 
here two definitions. 


Der. <A simply-connected, compact homogeneous complex manifold will 
be called a C-space. 


Der. Let K be a compact semi-simple Lie group. By a C-subgroup of 
K, we mean a closed and connected subgroup whose semi-simple part coin- 
cides with the semi-simple part of the centraliser of a toral subgroup of K. 
Obviously, the identity is a C-subgroup, each toral subgroup is a C-subgroup 
and the centraliser of a toral subgroup is a C-subgroup. 


THEOREM I. Lach C-space is homeomorphic with a coset space K/X 
where K denotes a compact semi-simple Lie group, and X a C-subgroup of K. 


Proof. Let W be a C-space. From (2.3), we can regard W as a com- 
plex coset space @/L of a complex semi-simple Lie group G. Suppose that 
G, L, H, H,, C, , h:,: - -,h; have the same meaning as in § 5. From now on, 
both real Lie algebra and complex Lie algebra will occur in our discussions. 
To distinguish them, we shall always use an ‘0’ as upper index in each real 
Lie algebra. It is known that the root vectors of G can be so normalised that 
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i[er, ep] are rational elements and [ég, eg] is a rational multiple of ea.g 
whenever «+ 8 is a root [13]. From now on, all the root vectors ep are 
assumed to be normalised. Let K° be the Lie algebra consisting of all the 
linear combinations, with real coefficients, of the following vectors: 


+, thi, i(ep + ep), — p= roots of G}. 


The subgroup K generated by this real Lie algebra K° is a maximal compact 
subgroup of G. By Montgomery theorem [9], K is transitive on W. Thus 
W is homeomorphic with the coset space K/(K NL). Let Y¥=KOL. 
To prove our theorem, it suffices to show that X is a C-subgroup of K. 
Firstly, we find that the Lie algebra X° of X is equal to K°N L, and thus 
is spanned, in the sense of vector space over real field, by 


(6. 1) {H° L, i(eg + eg), es —e-p: Be O} 


where H® denotes H. 

Now consider the subalgebra H, of H discussed in (5.4). Since it is a 
rational subalgebra of H, it has a rational base k,,- --,ka, a—=dim. My. 
Thus the real Lie algebra H°, = H, M K° is spanned, over the field of real 
numbers, by tk,,- - +, tka. It follows then that (a) H°, generates a toral sub- 
group 7, of K, (b) the centraliser of H°, in K° coincides with the intersection 
K° 1 C of K° with the centraliser C of H, in G. From § 5, we can easily see 
that K° MC is spanned, over the real field, by 


{H°, i(eg + ep), — eg: Be 


This together with (6.1) tells us that K° MC and X° have the same semi- 
simple part. Thus the centraliser of H°, in K° and X° have the same 
semi-simple part. Passing this relation to their respective groups, we obtain 
the fact that XY and the centraliser of 7, in K have the same semi-simple 
part. Moreover, because the simply-connectedness of W, X is connected. 
Thus X is a C-subgroup of K. This proves our theorem. 


CoroLLtary. Let W be a C-space containing more than one point. If 
W has vanishing second Bettt number, then its Euler characteristic also 


vanishes. 


Proof. Since W is simply-connected, the second homotopy group 72(W) 
of W is isomorphic with the 2-dimensional homology group of W with integer 
coefficients. Thus z2.(W) is finite. From Theorem I, W is homeomorphic 
with the coset space K/X where K is a compact semi-simple Lie group and 
X a C-subgroup of K. Thus z,(K) is finite. From the finiteness of both 
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n,(W) and z,(K), it follows that the fundamental group 7,(X) of X is finite. 
Hence X must be semi-simple. X is a C-subgroup so that XY is the semi- 
simple part of the centraliser U of a certain toral subgroup T, of K. Suppose 
that the Euler characteristic of W does not vanish. Then X and K have the 
same rank. Such a subgroup X has discrete centraliser in K. Thus 7, con- 
tains only the identity. It follows then that K —U—X. In other words, 
W =K/X consists of a single point. The corollary is therefore proved. 


7. Converse of Theorem I. It is the purpose of this section and the 
next to prove the following theorem which can be regarded as the converse 
of Theorem I. 


THEOREM II. Let X be a C-subgroup of a simply-connected compact 
semi-simple Lie group K. If K/X is even-dimensional, then K/X has a 
homogeneous complex structure, or in other words, K/X is homeomorphic 
with a C-space. 


We shall first prove a few propositions as lemmas. 


(7.1) Let K be a compact semi-simple Lie group, and X a C-subgroup 
of K with semi-simple part U. Then there exists a toral subgroup T, of K 
having the following properties: 
(i) TZ, contains the identity component T, of the center of X, 
(ii) U is the semi-simple part of the centraliser of T,, 
(iii) dim. 7, + rank (U) = rank (XK). 


Proof. let N be the centraliser of U. Then 7, C N, and then 7’, is 
contained in a maximal toral subgroup, say 7,, of N. We shall see that 
this 7, has all the required properties. By hypothesis, X is a C-subgroup so 
that U is the semi-simple part of the centraliser B of a toral subgroup, say 7’, 


| of K. Evidently, B and K have the same rank. Let 7” be the identity 


component of the center of B. We have then 


dim. 7” +- rank (U) = rank (B) = rank (XK). 


| Since 7” C N, there exists an element p of N such that pT’p* C T, because 
| T, is a maximal toral subgroup of N. It follows then that dim. 7’ < dim. 7, 


and then that dim. 7, + rank (U) =rank (K). On the other hand, 7,U 
forms a subgroup of K and, because of the semi-simplicity of U, the inter- 
section 7; U is discrete. Thus 


rank (K) = rank (T,U) = dim. T, + rank (UV). 
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This inequality together with the preceding one tells us that 
rank (K) = dim. 7, + rank (U). 


Therefore, the property (iii) is established. 

Since property (i) is evident from our construction, we shall only con- 
sider the property (ii). We first observe that T CT’. Thus p7p* C T;. 
From the definition of 7, it follows that pUp-' is the semi-simple part of the 
centraliser of p7Tp-*. On account of the fact pTp* C T,, we know that 
pUp™ contains the semi-simple part U’ of the centraliser of T,. But pe N 
so that pUpt—U. This tells us that U D U’. On the other hand, we 
already know that U is semi-simple and that U belongs to the centraliser of 
T,. Hence U C U’, and hence U =U’. (7.1) is hereby proved. 


(7.2) Let G be a complex semi-simple Lie group, G the complex Lie 


subgroup of G such that its Lie algebra (a Lie algebra over real field) H°, ts 
contained in H. Then there exists m linearly independent rational elements 
hi, ho, + hm of H such that thy, thm span, with real coefficients, 


the algebra H°,. 


generates the circle group 77;. The elements hy, ho, - -,4m obviously have 
all the required properties. Thus to prove (7.2), it suffices to prove it for | 
m = 1. 


Let G* be the linear adjoint group of G, and z: G— G* be the natural 
homomorphism. The image z(7") is evidently also a circle group. Let x be 


algebra of G, H a Cartan subalgebra of G. Let T™ be an m-dimensional toral 


Proof. Suppose that (7.2) holds for m1. We decompose 7” into | 
the direct product 71; of m cirele groups. By our 
supposition, for each j, there exists a rational element h; of H such that ih; | 


an element of G which generates T* in G. Then the same element a will | 
generate 7(7") in G*. Thus 7(7") consists of all the linear transformations 
{exp ta(x): teal numbers} where a denotes the adjoint representation [ 
of G. Since z(7") is a circle group, there exists a real number fy different f 
from zero such that exp {,¢(x) is the identity. It follows that the charac- f 
teristic root of t)%(x) must be of the form: : 


where the n’s are integers. Taking h = t)x/zi, we see immediately that the 
characteristic roots of a(h) are all integers. Thus h is a rational element f 
of H. Moreover, ih evidently generates the group T' in G. (%.2) is hereby 


proved. 
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(7.3) Let hi, ho,- + +,hi be a rational base of a Cartan subalgebra H 
of a complex semi-simple Lie algebra G. Let H®° denote the set of all linear 
combinations with real coefficients of +, th; Suppose that L is a sub- 
algebra of G whose normaliser M in G contains H. Then if H° QO L generates 
a closed subgroup in a group form G of G, so does L. 


Proof. Wet L, M and T? be the subgroups generated by L, M and H° 
in G respectively. Then 7’ is a toral group of dimension /, and Z is a normal 
subgroup of M. Since M contains a Cartan subalgebra, M is closed in G. 
For each element x of G, let us denote by g(x) the one-parameter subgroup 
generated by zx in G. On account of a theorem of Malcev [7], to show that D 
is closed in G, it suftices to show that, for each ae L, the closure g(x) of 
g(x) lies on L. If g(x) is not compact, then g(a) is isomorphic with the 


' additive group of reals, and hence g(x) =g(x) CL. Now assume g(x) to 


be compact. Since xe LC M and M is closed, g(x) C M. From the fact 
that H° C H C M, we know that T' is a maximal toral subgroup of M. There 
exists then an element q of M such that gg(x)q C T'. But L is a normal 
subgroup of M and g(x) C L. Therefore, we have 


Let T* be the identity component of T'M LZ. From the connectedness of g(x), 

qg(x)q* C T*, and hence qg(a)q? C T*. The subgroup T*, being generated 

by H° OL, is closed by hypothesis. Therefore, , 
C T#=TFCTINLCL 


whence g(2) C g*Lqg—L. Thus L is closed. 


(7.4) Let H be a Cartan subalgebra of G, and H, a rational subalgebra 
of H. Suppose that hy, +, ha, haw,’ form a rational base of H 
such that hy, span Hy. Then the linear subspace L spanned by 


{LO H, eg, ep: Be 9, pe 


forms a subalgebra of G where @ denotes the totality of roots B of G such 
that [eg,H,] 0, and ©’ the totality of positive roots with respect to the 
rational base hy, hy. 


Proof. Firstly, we see at once that 
[LOH,L] CL, CL, [ep erJeL 
whenever £, ye and Thus to prove (7%. 4), it is sufficient to show 
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that [eg, ep] eL for Be @ and pe’, p¢®. For this purpose, let us denote 
by A1,°**, Az the coordinates in H with respect to the rational base h,,--- , hi. 
We write the roots 8, p as linear combinations of the 2’s: 


B T1A, + + + NasiAas + + riAr 


+ + SaAa + + + SiA1- 
Since = 0, B(h’) =0 for all h’ in H,. It follows then r; =r, 
=1,=0. Let j be the least integer such that s; 40. The root p is positive 
so that s; > 0. If >a, then p(hi) =p(hz) =: -=p(ha) =0, and then 
[ep, H,] =0 which contradicts the fact p#@. Therefore, 7 Sa, and we have 


B+ p=Sjj + + (Sasi + + + 11) AL 


Since s; > 0, 8 + p is always positive in our sense. The commutator [eg, ep], ° 


being either zero or a multiple of the positive root vector eg,p, must belong 
to L. This proves (7. 4). 


8. Proof of Theorem II. By hypothesis, X is a C-subgroup of K. We 
consider the subgroups 7, 7; and U as given in (7.1). Then 


rank (K) = dim. T, + rank (U), rank = dim. + rank (U). 


Taking account of the known facts that the difference between the dimension | 


and the rank of a compact Lie group is always even, we obtain immediately 


dim. 7’, — dim. 7, = rank (K) — rank (Y) =dim. K —dim. X (mod. 2). 
Since dim. K — dim. Y = dim. (K/X) is even, it follows that 
(8.1) dim. T, —dim. 7, =0 (mod. 2). 


Let K°, X°, T°,, T°, be the Lie algebras (real Lie algebras) of K, X, 7, 


T, respectively. We consider the complex form G of K°, and choose a Cartan | 
subalgebra H of G. Denoting by ep the normalised root vectors of G [13], | 
we can imbed K° in G as the totality of linear combinations with real | 


coefficients of the following vectors: 


{i(€p — iLep, e_p | p= roots of G}. 


Let G be the simply-connected complex Lie group with G as its Lie algebra. | 
Then K is identified with the subgroup generated by K° in G. Let H® be 4 
the space spanned, with real coefficients, by {i[ep, ep]: p= roots of G}. 
This H°® is a maximal abelian subalgebra of K°. Since maximal abelian & 
subalgebras of K° are conjugate, we can assume T°, C H°. Since T,, 7, are . 


| 
| 
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ote toral subgroups and T°, C T°, C H°® CH, we know from (7.2) that there 
hi. | exist linearly independent rational elements h,, h2,- hy, *, such 
that (i) tha: ih», +, the span T°,, and (ii) thos, the 
span T°, where a= dim. T,, a—b=dim.T,. From (8.1), it follows 


(8.2) b=0 (mod.2). 


From (7.1), U is the semi-simple part of the centraliser of 7, in K. 


ive Thus its Lie algebra U° is the semi-simple part of the centraliser of T°, in K® 
1en A direct calculation tells us that U° is spanned, with real coefficients, by 
ave 


{t(eg + ep), —e-p, tLe, Be 9} 
where ® denotes the totality of roots B of G such that 


Let H, be the complex subspace spanned by hy, ho,- - -,ha. This is a rational 
' subalgebra of H. We can also regard ® as the totality of roots B of G such 
We that [eg,H,] 0. Let /’ be the rank of U. Then from (7.1) 


(8.3) tank —~ dim: T, —a. 


Among the vectors [eg, e_g], Be, there are exactly I’ linearly independent 


ones because {i[eg, es]: 8 @} span a maximal abelian subalgebra of U°. 


ion 
ly On account of (8.3), we can denote these J’ linearly independent elements by 
hast, hare,’ * ht Since eg are normalized root vectors, these h’s, being of 
_ the form [eg, eg], are rational elements of H. Moreover, since T°, 1 U® = 0, 
the elements 
r are linearly independent. Thus they form a rational base of H. 
ri We know from (7.1) that X is, locally, the direct product. 7, x U. 
n & 
" _ Thus X°— T°, + U®, and thus X° is spanned, with real coefficients, by 
‘eal {thos1; tha, thi, i(eg + eg B @}. 


Let ¢ be a non-zero real number. On account of (8.2), we can construct 


u = 6/2 vectors of the following form: 


ra. ky = th, Cho, ko = ths Cha, ky thy-1 chy. 
be | Now let us consider the complex linear subspace LE, spanned by 

are 


where ©’ denotes the totality of positive roots of G with respect to the rational 
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base hy, h2,- of H. From (7.4), L, forms a subalgebra of G. More- 
over, it can be easily seen that 


(8. 4) L, K° =X°. 


Firstly, we want to show that L, generates a closed subgroup in G. For this 
purpose, let us denote by H®° the totality of linear combinations with real 
coefficients of ih,,ih2,- Taking account of the fact c+40, we find 
immediately 


=H 1 X°. 


Since both H and X° generate a closed subgroup in G, so does their inter- 
section HM X°. Thus LE, H® generates a closed subgroup in G. The 
normaliser of L, in G is evidently H + L, which contains H. (7.3) then 
tells us that L, generates a closed subgroup L, in G. 

Let 2¢ be the number of roots of G, 2¢’ the number of roots in ©. Then 
® contains exactly ¢ roots, and 


dim. G = 2dim. G = 2(2¢ + 1), dim. K = 2¢ + 1. 


Moreover, © and ©’ have t—?’ roots in common. By a counting of linearly 
independent vectors, we find that 


dim. L, = 2dim. L, 


It follows then that 
(8. 5) dim. G — dim. LZ, = dim. K — dim. X. 


Now consider G as a transformation group of G/Z in the natural manner. 
Let O be an orbit of K on G/LZ,. Then O is homeomorphic with the coset 
space K/(KL,). From (8.4), X is the identity component of DL, K, 
and thus (8.5) tells us that 


dim. O = dim. K — dim. (IL, N K) 
= dim. K — dim. X = dim. G — dim. L, = dim. (G/L,). 


The orbit O, being a closed manifold, cannot be imbedded as a proper subset 

of a manifold of the same dimension. Thus 0 = G/L, and thus G/L, is | 
compact. Since G is simply-connected, and ZL, is connected, G/L, must be ! 
simply-connected. From the fact that K/(K L.) =O = G/L, is simply- 
connected, we know that KM L, is connected whence = KO L,. There- 
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fore, K/X =O is homeomorphic with the complex coset space G/L¢. 
Theorem II is thus proved. 

In the above proof of Theorem II, we have eventually shown that there 
are infinitely many Z,’s serving our purposes. This fact will be used later on. 
We shall state it precisely as follows: 


(8.6) Let K be a simply-connected, compact semi-simple Lie group, 
and X a C-subgroup of K. Suppose that K/X 1s of even dimension and has 
vanishing Euler characteristic. Let G be the complex semi-simple Lie group 
in which K is imbedded as a maximal compact subgroup. Then, to each real 
number c different from zero, there corresponds a connected complex subgroup 
L, of G such that (i) G/L, is homeomorphic with K/X and X =L, G, 
(ii) Le A Le whenever cAc’, (iii) all the L, have the same normaliser N 
whose Lie algebra N contains a Cartan subalgebra of the complex Lie algebra 


of G. 


Proof. Here, all the notations used in the proof of Theorem II will be 
adopted. We recall that L, is spanned by the vectors: 


{thi + cho,* +, cho, +, hy ep, ep: pe 


and Z, is the subgroup generated by LZ, in G. Connectedness of ZL, and 
property (i) have already been proved in Theorem II. Now by hypothesis, 
the Euler characteristic of K/X vanishes. Thus rank (X) < rank (K), and 
then 

b = dim. 7, — dim. T,) = rank (K) — rank (X) > 0. 


Hence L. ~L, whenever c~c’. The groups L, and L, they generate must 
be different. This proves property (ii). As for property (iii), we consider 
the normaliser of L, in G. It can be easily seen that N is the linear sub- 
spaces spanned by 

{H, eg, ep: Be O, pe 


which is independent of the parameter c. Moreover, this N evidently con- 
tains the Cartan subalgebra H of G. Thus the subgroup N generated by N 
in G is the common normaliser of all the Z,. This completes the proof. 


In what follows, we need that G@ is effective on G/L,. Then K is, in 
general, not simply-connected. Proposition (8.6) has to be modified in order 
to suit our purpose. 


(8.7) Let K be a compact semi-simple Lie group, and X a C-subgroup 
of K. Suppose that K/X is simply-connected and even-dimensional and has 


. 
; 
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vanishing Euler characteristic. Let G be the complex semi-simple Lie group 
in which K is imbedded as a maximal compact subgroup. If K acts effectively 
on K/X, then, in G, there is a collection {L-: ¢ =real number, c~0} of 
complex subgroups having the properties (i), (ii), (iil) in (8.6) as well as 
the property (iv) G acts effectively on G/L. 


Proof. Let K be the universal covering group of K, and X the subgroup 
of K which covers ¥. Since K/X — K/X is simply-connected, X is connected. 
We can easily see that Y is a C-subgroup of k. Let G@ be the complex semi- 
simple Lie group in which K is imbedded as a maximal compact subgroup. 
By (8.6), there is a collection {L,} of complex subgroups of G having the 
properties (i). (ii), (iii) with respect to K, X, G. Let +: GG be the 
projection. Up to an automorphism of G, we can assume 1(K) = K, 
(X) =X. The subgroups L.=-(L-) evidently have the properties (i), 
(ii), (iii) with respect to K, VY, G. It remains only to show that G@ acts 
effectively on G/L, or what is the same, that /, contains no normal subgroup 
of G except the identity. Let Q be the maximal normal subgroup of G 
contained in L,.. Then KNQCL.AK =X. From the effectiveness of K 
on kK/X, we know that KM Q contains only the identity. A simple argu- 
ment on Lie algebras tells us that dim. Q = 2dim. (K 1 Q), and hence Q 
must be discrete. Thus Q belongs to the center of G which belongs to K. 
It follows that Q contains only the identity. The effectiveness of G on G/L, 


is hereby established. 


9. Two propositions. In order to study the group of all analytic homeo- 
morphisms of a C-space, we shall first prove two rather simple propositions: 


(9.1) Let W=B/L, be a left coset space of a Lie group B. Suppose 
G to be a closed subgroup of B transitive over W. If B is effective on W, 
then the Lie algebra L, of L, cannot contain any non-zero subspace Q with 
[G. Q@] C Q, where G denotes the Lie algebra of G. 


Proof. Let V be the least subalgebra which contains Q. Evidently, 
VCL,, [6,V] CV. Let V be the subgroup generated by V in B. Then 
V CL,, and G is contained in the normaliser of V. Since @ is transitive 
on W, each element of W takes the form gL, where ge G. Let ve V. We 
have vg = g(g"vg) egV C gly, and thus vgl, = gL, for each g of G. This 
tells us that v induces the identity transformations on W. By hypothesis, 
B is effective so that V contains only the identity. Therefore, the subalgebra 


V as well as the linear subspace Q@ must be zero. (9.1) is proved. 
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(9.2) Let be a non-trivial irreducible representation of a complex 
semi-simple Lie algebra G over a vector space V, and V* a subspace of V 
invariant under $(ép) for all the positive root vectors ep of G. If, moreover, 
V* contains all the weight vectors with non-vanishing weight, then V = V*. 


Proof. We note that we have a fixed Cartan subalgebra H based on 
which the weight vectors, weight spaces and weights are defined. Moreover, 
we have a fixed rational base of H by which the ordering of weights and the 
ordering of roots of G are determined. Since ¢ is non-trivial, the minimum 
weight A of ¢ must be negative. Take a vector v with weight A. Then each 
vector v’ of the form: 

== By Be: Ee,(v) 


is a weight vector with weight equal to A+ a,+a,+---+-+4a,, where 
E.= (ea), € denoting the root vector corresponding to the root a. Since ¢ 
is irreducible, all the vectors v’ of the above form span the entire space V. 
Thus to prove our proposition, it suffices to show that each v’ belongs to V*. 


If Atata+---+a,40, then v’e V* by hypothesis. Now 
assume A +-a,-+-- +--+ «4,0. We shall call s the length of the vector v’, 
and use induction on s. If the length s is zero, then v’ =veV*. Suppose 
v’e V* whenever the length is less than s. We consider a v’ of length s. 
Since A+ a,+---+a¢,—0 and A is negative, one of the a’s must be 
positive. Let 7 be the least integer such that a; is positive. We can easily 
verify that 


j-1 
‘ 4=1 
where 
Since A+ a,+:: = — a; 0, 
From the invariance of V* under Lg, = $(éa,), we know that Ha,(v”) e V*. 
As for F;, there are three possibilities. If «+ «; is not a root, then 7; 0; 
if a+ «%;—=0, then F;(v) is a vector of length s—2; and if a+ a; is a 


root of G, then F;(v) is a multiple of a vector of length s—1. Thus, by 
induction hypothesis, F;(v) always belongs to V*. It follows then that 


j-1 

v = E£,,(v’) + SFi(v) belongs to V*. The induction is completed and 
4=1 

(9.2) is proved. 
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10. Group of all analytic homomorphisms. Let B be the group of all 
analytic homeomorphisms of a C-space. From a theorem of Bochner-Mont- 
gomery, it follows that B is a complex Lie group. In the present section, 
we shall show that the radical of B coincides with the center of B. 


TuroreM III. Let B be the identity component of the group of all 
analytic homeomorphisms of a C-space W. Then B is, locally, the direct 
product of a complex semi-simple Lie group and a complex vector group. 


Proof. From § 3, we know that B is a complex Lie group and that the 


isotropic subgroup L, of B is a complex subgroup of B. Since W=B/L, is | 
simply-connected, L, must be connected. Let B = GP be a Levi-decomposition | 


of B where G denotes a maximal semi-simple subgroup of B and # the 
radical. Choose a maximal connected solvable subgroup V of the intersection 


[L=L1,9G. This M is contained in a certain maximal connected solvable | 
subgroup, say M,, of Z, Evidently, is the identity component of MW, G. | 


We shall first show 
(10. 1) dim. MW, — dim. J/ = dim. R. 


For this purpose, we observe that B/M, is a fibre bundle over B/L, with L,/M, | 


as the fibre [11], p. 30. The base space B/L,, being homeomorphic with VW, 


is compact and simply-connected. By using first (3.5) and then (3.1), we | 
see that L,/M, is compact. Thus the bundle space B/M, is compact. More- | 
over, (3. 5) tells us that 7,(Z,/4,) is finite. From this fact and the simply- | 


connectedness of B/L,, it follows that the fundamental ‘group 7,(B/J/,) 


is finite. Proposition (2.3) then tells us that the maximal semi-simple sub- | 
group G is transitive over B/M,, and hence B/M, =G/(@NM,). We have, 


therefore, 
dim. G + dim. R — dim. MW, = dim. B — dim. M, 

= dim. G — dim. (/, N G) = dim. G — dim. MV 
which gives (10.1). 


Let B, G, R, L,, L, M,, M be the Lie algebras of B, G, R, L,, 1, M,, M 


respectively. All of them are complex Lie algebras. Passing the relations 


among these groups to their respective algebras, we obtain (a) B=G-+R > 
is a Levi-decomposition of the complex Lie algebra B, (b) M is a maximal : 
solvable subalgebra of LE, (c) M, is a maximal solvable subalgebra of Li, [ 
(d) L=L,1G, M=—M,G. Now choose a maximal solvable subalgebra 
S, of B such that M, C S,. Obviously, S, can be written in the form: q 


S, =S-+-R where S denotes a maximal solvable subalgebra of G. Thus 


a 

4 
M —M,nG= (M,N S,) M,N (S,NG) =M, 0S. 
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From the theory of vector spaces, 
dim. (S + M,) + dim. (M, 1 S) = dim. S + dim. M,, 
or what is the same, 
dim. (S + M,) = dim. S + dim. M, — dim. M. 


On the other hand, we have from (10.1) that dim. M, — dim. M = dim. R. 
Therefore, dim.(S+-M,) = dim. S+dim. R. But dim. S, = dim. S+dim.R 
and S+ M,C S,. It follows then 


(10. 2) RCS+R=S,—S+M,. 


Now let us consider S. Since it is a maximal solvable subalgebra of G, 
there exists a Cartan subalgebra H of G and a rational base hy, ho,- + +, hi 
of H such that S is spanned by H together with all the positive root vectors 
of G. Since W = B/L, is a C-space, (2.3) tells us that @ is transitive over 
W, and W=G/(L1,0 G) =G/L. Taking account of the fact M C S, we 
know from the first two paragraphs of § 5 that (i) LO H is a J-subalgebra 
of H, (ii) L is spanned by LM H and some root vectors, and (iii) all the 
positive root vectors belong to L. From (iii), it follows immediately 


(10. 3) [S,S] CLC 
Let ¢ be the representation of @ over the radical R defined by 
$(9) (w) — Ly, w], ge G, weR. 


This is a well-defined representation as R is the radical. We consider a non- 
trivial irreducible component ¢, of ¢, and denote by R’ the subspace of R 
corresponding to ¢:. Then R’ is decomposed into weight spaces: 


Ry + Ry 


where A; denote the weights. Let w be a vector of Ry, with Aj 0. From 
(10. 2), we know that w can be written in the form: w=2+ y, eS, ye My. 
Since {h: he H, A;(h) = 0} is a rational hyperplane and LQ Hi is a J-sub- 
algebra, there exists hoe HL such that Aj(ho) 40. Thus 


Aj(ho)w = (w) = [ho, + [ho 


and thus 


w = ([ho, + 


Both hy and y belong to L, so that [ho,y]eL,. As for [ho, xz], we have 
from (10. 3) 


[ho, [H, S] C [S, $8] C Ly. 
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Therefore, we L,, and then we, R’. Since this holds for all weight 
vectors with non-vanishing weight, we know that the subspace LM R’ con- 
tains all the weight vectors with non-vanishing weight. Moreover, for each 


positive root vector ep of G, ep e L, so that 
(ep) (L, R’) — [ M R’| R’. 


In other words, the subspace LE, R’ of R’ is invariant under ¢(ep). 
Proposition (9.2) then tells us that R’ = L, R’, or what is the same, 
R’ CL,. Taking account of the fact [G, R’] C R’, we see from (9.1) that 
2?’ 0 because of the effectiveness of B on W. Therefore, each irreducible 
component of ¢ must be trivial. In other words, ¢ is trivial. We have then 
[G,R]=0. This shows that the Levi-decomposition of B is a direct sum. 

It remains for us only to show that the radical R is abelian. Let w, w’ 
be two vectors in R. On account of (10.2), we can write them in the form: 


w=2x2+y, w=at+y’, CG, y,y CL. 
Thus 
[y, = [w—a, w’ —2’| = [w, w’] + [2, 2’], 
and thus 
[w, = [y, — 


The right hand side belongs to L, on account of (10.3) while the left hand 
side belongs to R. Hence [w,w’JeL,1R. From (9.1) and the fact 
[G, R] =0, we know that L, nN R = 0, whence [w, w’] =0. R is therefore 
abelian and Theorem III is proved. 


11. Homogeneous complex structures over a given manifold. Let M 
be a manifold over which two complex structures 9 and ’ are defined. 
These two structures are called equivalent if the two complex manifolds 
(M,%) and (4M, %’) are analytically homeomorphic. It is the aim of the 
present section to study the number of inequivalent homogeneous complex 
structures over a given manifold. 


THEOREM IV. Let M be a simply-connected closed manifold with non- 
vanishing Euler characteristic. Then M has at most a finite number of 
inequivalent homogeneous complea structures. 


Proof. On account of (2.3), it is sufficient to show that there are only 
at most a finite number of analytically non-homeomorphic coset spaces G/L 
whose underlying spaces are homeomorphic with M where G denotes a complex 
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semi-simple Lie group. Let dim. JJ. Since the maximal compact sub- 
group K(G) of G is transitive over M, we have dim. K(G) Sn(n + 1)/2. 
Thus dim.G = n(n-+ 1) and thus up to isomorphisms, there are only a 
finite number of possible G’s. Now fix the complex semi-simple Lie group G. 
Since the Euler characteristic of M is different from zero, we can see without 
difficulty that the Lie algebra E of Z contains a Cartan subalgebra H of the 
complex Lie algebra G of G. Thus L is spanned by H and some root vectors 
of G. It follows then that, up to inner automorphisms of G, such sub- 
algebras LE are finite in number. Thus, up to automorphisms of G, we have 
only a finite number of such subgroups ZL. Hence, the number of different 


coset spaces G/L homeomorphic with M is finite. This proves our theorem. 


When the Euler characteristic of Jf vanishes, the situation is entirely 
different. We shall see, in this case, that J/ has either no homogeneous 


complex structure or non-countably many inequivalent ones. 


(11.1) Let W=G/L, W’ =G’/L’ be two C-spaces where G, G’ denote 
respectively the maximal connected semi-simple subgroups of the groups B, 
B’ of all analytic homeomorphisms of W, W’. If W and W’ are analytic 
homeomorphic, then there is an isomorphism 6 of G onto G’ such that 
6(L) = 


Proof. From Theorem III, we know that B as well as B’ has only one 
maximal connected semi-simple subgroups. From this fact, Proposition 
(11.1) follows immediately. 


(11.2) Let N be a closed subgroup of a complex semi-simple Lie group 
G such that N contains a maximal toral subgroup T of G. Let A be the 
group of all automorphisms of G which leave N invariant, and A, the group 
of all those inner automorphisms induced by elements in N. Then A/Ao 
ts finite. 


Proof. We first observe that if an automorphism @ € A leaves invariant 
the subgroup 7’, then its differential dB leaves invariant a Cartan subalgebra 
H of the complex Lie algebra G of G. Therefore, such automorphisms are 
finite in number. Let us denote them by fi, Bs. We shall show 
that A = U Suppose ge A. Then a(T) CN. 
Since T and «(7’) are two maximal toral subgroups in N, there exists an 
inner automorphism y of V carrying 7 onto a(7’). Since N C G, we can 
regard this y as an inner automorphism of G, and thus ye Ao. The auto- 
morphism y~*« leaves invariant T. It must be one of the £’s, say B;. We 
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have then a+; A,B;. This tells us that A/A,» cannot have more than 
s elements. (11.2) is proved. 


(11.3) Let G bea complex semi-simple Lie group, and {L-} a collection 
of closed subgroups depending on a non-zero real parameter c. Suppose that 
(i) Le ALe whenever (ii) all the have the same normaliser N 
in G, (iii) the Lie algebra N of N contains a Cartan subalgebra H of the 
complex Lie algebra G of G. Then, in the collection {L,} of subgroups, 
there are non-countably many inequivalent ones under automorphisms of G. 


Proof. We divide the collection {Z,} of subgroups into equivalent classes 
by automorphisms of G. To prove (11.3), it is sufficient to show that each 


equivalent class contains only a finite number of L,’s. Suppose that L, and [ 


DL. are equivalent. Then there exists an automorphism @ of @ carrying L, 
onto Ly. Evidently, « will carry the normaliser of L, to that of L, or what 
is the same, «(V) = V. On account of property (iii), N contains a maximal 
toral subgroup of G so that (11.2) can be applied. Using the same notations 


A, Ao as in (11.2), we know that there exist elements 01, 02,° + *,om of A 


with the property 
A = a, Ao U U U OmAo. 


Since ae A, « = ayy for a certain j and a certain y of Ao. The group WN is F 


the normaliser of Z,, and y is an inner automorphism of VN. Thus L, = «(L,) 


= ojy(L.) =0;(L,). This tells us that each equivalent class contains at 


most m different Z,’s. Our Proposition (11.3) is hereby proved. 


TurorEM V. Let M be a simply-connected closed manifold. If tts | 


Euler characteristic vanishes, then it has either no homogeneous complex 
structure or non-countably many inequivalent ones. 


Proof. Suppose that M has a homogeneous complex structure. Then [ 
M is homeomorphic with a coset space K/X where K is a compact semi- [ 
simple Lie group and X a C-subgroup of K. As can be easily justified, we 


can assume that K acts effectively on K/X. Of course, there may be various 


pairs (K,X) with these properties. Now we choose (K, X) so that K is of | 
highest possible dimension. Let G be the complex semi-simple Lie group in 


which K is imbedded as a maximal compact subgroup. From (8.6), there 


exist a collection {Z,} of closed complex subgroups of G depending on a non- § 


zero real parameter c with the following properties: (i) G/Z, is homeomorphic 
with K/X for each c, (ii) Lp AL. whenever cc’, (iii) all the LZ, have 
the same normaliser NV whose Lie algebra contains a Cartan subalgebra of the 


oh 
| F 
| 
or 
| 
sh 
C- 
Fr 
q sl 
Be 
| 
ov 
of 
ple 


cr & 


e 


HOMOGENEOUS COMPLEX STRUCTURE. 29 


Lie algebra of G, (iv) @ acts effectively on G/Z,. Evidently, each G/L, is a 
C-space homeomorphic with VM. Thus, to prove our theorem, it suffices to 
prove that, in the collection {Z,: ¢40, c= real number} of complex coset 
spaces, there are non-countably many spaces which are analytically non-homeo- 
morphic with one another. 


Let W, = G/L,, and B be the group of all analytic homeomorphisms of 
W,.. In the natural manner, each element of @ can be regarded as an analytic 
homeomorphism of W,. On account of the effectiveness of G on W,, we can 
regard G as a subgroup of B. From Theorem III, B has a unique maximal 
connected semi-simple subgroup G*. Thus GC G*. We know from the 
proof of Theorem IJ that W, is homeomorphie with K(G*)/X* where K(G*) 
denotes a maximal compact subgroup of G*, and X* a C-subgroup of K(G*). 
From our choice of our K and X, it follows that dim. K(G*) S dim. K. 
Hence dim. G* = 2dim. K(G*) S 2dim. K = dim. G, and hence G= G*. 
In other words, G is the maximal connected semi-simple subgroup of the 
group of all analytic homeomorphisms of W,. Taking account of this fact, 
we know from (11.1) that W, and W-, are analytically homeomorphic if and 
only if Z, and LZ, are equivalent under automorphisms of G. (11.3) tells us 
that there are non-countably many inequivalent L,’s. Therefore, there are 
non-countably many analytically non-homeomorphic W,’s. The theorem is 
hereby proved. 


12. Determination of C-spaces. To determine all the C-spaces, we 
shall introduce a class of closed manifolds which are closely related with 
C-spaces. Let K be a simply-connected compact simple Lie group, and U a 
semi-simple C-subgroup of K. The coset space K/U is called an M-manifold. 
From the homotopy sequence, it follows immediately that an M-manifold is 
simply-connected and-has a finite second homotopy group. Thus its.second 
Betti number vanishes. 


THEOREM VI. Lach C-space W is a fibre decomposition space of a product 
M of M-manifolds with torus as the fibre (i.e., the product M is a torus bundle 
over W). Conversely, suppose M=M, X Mz Ms, to be a product 
of M-manifolds K;. If M is even-dimensioned, then M has infinitely many 
inequivalent homogeneous complex structures. Moreover, the resulting com- 
plex manifolds are non-kiihlerian. 


Proof. Let W be a C-space. By Theorem I, it is homeomorphic with a 
coset space K/X where K is a compact semi-simple Lie group and X a C- 
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subgroup of K. Let U be the maximal connected semi-simple subgroup of Y. 
Then U is a normal subgroup of XY, and X/U is a toral group. From the 
theory of fibre bundles [11], we know that K/U is a fibre bundle over K/X 
with Y/U as the fibre. Thus W is a fibre decomposition space of K/U with 
the torus ¥/U as the fibre. Hence, to prove the first part of Theorem VI, 
it suffices to prove that K/U is a product of M-manifolds. Firstly, we observe 
that K can be assumed simply connected, for otherwise, we can take the 
universal covering group of K in place of K. Thus K is decomposed into a 
direct product of simply-connected, compact simple Lie groups: 


By definition of C-subgroups, U is the semi-simple part of the centraliser Q 
of a toral subgroup of K. Evidently, Q has the same rank as K. It follows 
then [5] 

Q= (K,N Q) 


Thus U=U,®@U.®@-:--@U, where U; denotes the semi-simple part of 
K;N Q, and thus 


(12. 1) K/U = X K2/U2 XK K/Us. 


We can verify without difficulty that U; is a C-subgroup of K;. Moreover, 
each K; is a simply-connected, compact simple Lie group. Therefore, Ki/U; 
are M-manifolds. (12.1) then proves the first part of our theorem. 

As for the second part, we write M; = K;/U; where K; denotes a simply- 
connected, compact simple Lie group and U; a semi-simple C-subgroup of Ki. 
Then the product space M = M, X Mz is homeomorphic with 
the coset space 


(K,® ‘@K,)/(U: 80.8: -@U;,). 


The product K, ® K.®---®K; is obviously a simply-connected and compact 
semi-simple Lie group. Moreover, it is easy to see that U,®U.®:---@U; 
is a C-subgroup of K,®@K,@---@K% The existence of homogeneous 
complex structure over M then follows from Theorem II. By hypothesis, 
each M; is an M-manifold so it has vanishing second Betti number. Thus M 
also has vanishing second Betti number. It follows that M can never be 
rendered a kihlerian manifold. Besides, we know from the Corollary in § 6 
that the Euler characteristic of 1 must vanish. Theorem V then tells us 
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that M has infinitely many inequivalent homogeneous complex structures. 
Theorem VI is hereby proved. 

From the above theorem, the study of the topological properties of C- 
spaces is reduced to that of M-manifolds. By definition, an M-manifold is a 
coset space K/U where K is a compact simple Lie group and U a semi-simple 
C-subgroup of K. Thus U is the centraliser Q of a toral subgroup of K. 
Taking account of the fact that Q has the same rank as K, we can determine 
all the U’s by using known results [5], [12]. Here we shall simply state the 
results without elaborating. 


THeEorEM VII. The M-manifolds M = K/U corresponding to a classical 
group K are given as follows: 


BY/(Ai, X An XK X Bo), 
C/(Ai, X Aw XC), 
Di/(Ai, X An X X Ae XK Da), OSt 
where A;, B;, C;, Dj denote, respectively, the compact simple Lie groups of 
Cartan classes A, B, C, D with rank j. 


As for the exceptional groups, we will not enumerate all their semi- 
simple C-subgroups. Only the maximal ones will be given. In fact, this is 
sufficient as we can easily prove the following: 


Let U, U* be two semi-simple C-subgroups of K. If U C U* then U ts 
a C-subgroup of U*. 


TreorEM VIIF. All the semi-simple maximal C-subgroups of the excep- 
tional groups are given by the following table: 


Group maximal semi-simple C-subgroups 
Gs A, 
Ds, As, X Aa, Ai XK Ao Az 
By, dy dy By KA 


E, | Be, A A, Ba Ag, As Ds, As; As; As A, 
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Added in Proof. In connection with the results in this paper, H. Samel- 
son has recently proved that the underlying space of an even-dimensional 
compact Lie group has always a homogeneous complex structure (to appear 
in Portugaliae Mathematica). 


ALABAMA POLYTECHNIC INSTITUTE, 
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INVARIANT AFFINE CONNECTIONS ON HOMOGENEOUS 
SPACES.* 


By Katsumi Nomizu. 


Introduction. The purpose of the present paper is to clarify and 
generalize the work of E. Cartan on the relationship between differential 
geometry and the spaces of Lie groups and the homogeneous spaces. We 
study the existence and properties of invariant affine connections on homo- 
geneous spaces of certain types, in particular, on symmetric homogeneous 
spaces. By a symmetric homogeneous space we mean a homogeneous space 
G/H where @ is a Lie group with an involutive automorphism o and H is 
the subgroup of fixed point of o. 

Our problem may be explained, roughly, as follows. Let G/H be a 
homogeneous space of a connected Lie group. It is well known that if the 
subgroup H is compact, one can introduce a positive definite riemannian 
metric on G/H which is invariant by G acting on G/H in the natural fashion. 
The affine connection which is canonically associated to this invariant metric 
is then an invariant affine connection on G/H. The question arises, there- 
fore, whether there exist invariant affine connections on G/H when H is not 
necessarily compact. Even when it is compact, there may exist invariant 
affine connections which do not arise from any metrics. In the case of a 
symmetric homogeneous space G/H with compact, H, the invariant affine 
connection arising from any invariant metric has certain remarkable properties, 
first noted by E. Cartan, and serves as an important tool for the investigation 
of the space. If one attempts to generalize his theory to any symmetric 
homogeneous space, it is of primary importance to establish the existence of 
an invariant affine connection on it, having properties similar to those of the 
riemannian conncetion in Cartan’s study. 

We shall investigate these questions for a fairly general class of homo- 
geneous spaces, which we call reductive and which includes the symmetric 
homogeneous spaces. In Chapter I, we shall give a general definition of affine 
connection, which is due to Koszul, and we shall formulate certain basic 
concepts in the theory of affine connections in a form convenient for our 


*This paper was written while the author was a Eugene Higgins Fellow at 
Columbia University. 
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purpose. In Chapter II, we shall study the existence and properties of 
invariant affine connections on reductive homogeneous spaces. We shall see 
that the problems which arise can be reduced to algebraic questions con- 
cerning the Lie algebra of G and that of 7. Specifically, we show that there 
exists a one-to-one correspondence between the set of invariant affine connec- 
tions on G/H and the set of bilinear functions, called connection functions, 
which are themselves of simple nature (Theorem 8.1). In particular, there 
exist always two important affine connections, called the canonical affine 
connections of the first and second kind (Theorem 10.1 and Theorem 10. 2). 
The well known theorem concerning the holonomy group of a riemannian 
symmetric homogeneous space is obtained for the canonical affine connection 
of the second kind on any reductive homogeneous space (Theorem 12.1). 

Chapter III is devoted to a treatment of affine symmetric spaces. 
Applying the results of Chapter II, we shall first clarify the existence and 
properties of a natural affine connection on any symmetric homogeneous space. 
We include a generalization of Cartan’s theorem concerning the largest con- 
nected group of isometries of a rlemannian symmetric space (Theorem 16. 1). 
We shall then give a theorem showing the equivalence of various definitions 
of affine symmetric spaces (Theorem 17.1); the proof of the most essential 
part is given as a special case of a more general theorem which is established 
in Chapter IV. In Chapter IV, we shall characterize the spaces with affine 
connection such that the covariant derivatives of the torsion and curvature 
tensor fields are both zero (Theorem 18.1). It turns out that such a space 
may be characterized in two ways, geometrically and group-theoretically. 
Geometrically, it is an affine connection which is invariant by its parallelism. 
Group-theoretically, such a space may be locally representable as a reductive 
homogeneous space with its canonical affine connection of the second kind. 

My interest in the subject arose from Dr. Koszul’s seminar on symmetric 
spaces given at the University of Chicago in the spring of 1952, and from 
the subsequent conversations I had with Dr. Koszul and Professor Chern. 
I have also had the privilege of having frequent discussions and receiving 
valuable criticism from Professor Chevalley during the preparation of this 
work. I would like to take this opportunity to express my sincere gratitude 
to these mathematicians. 


Chapter I. Affine Connection. 


1. Affine connection. Let M be a connected differentiable manifold of 
class C*. We denote by % the ring of all differentiable functions on M and 
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by & the set of all infinitesimal transformations on M.1 Xf, where X eX 
and fe %, denotes the function in % which is defined by (Xf) (p) = Xp(f). 
On the other hand, fX denotes the infinitesimal transformation which is 
defined by (fX)p—=f(p)X>. In this way, X can be considered as a module 
over %. X can be also regarded as a Lie algebra of infinite dimension over 
the reals with the usual bracket operation [X,Y]. We then have 


(1.1) [fX, = fo[X, + f(X9)¥ 


for f,ge F and X,Y eX. 
An affine connection on M is, by definition,? a rule which assigns to each 
YeX an %-endomorphism ¢(X) of X and which satisfies the following 


conditions: 


(1. 2) (a) ¢(X,+ X.) =t(X,) + t(X2) 
(b) = ft(X)(V) + (VA). 
Then the following condition is automatically satisfied. 


(c) If either Y or Y is identically zero in an open subset U of M, 
then so is t(X)(Y). 


If W has an affine connection, then any open submanifold U of MM has 
an induced affine connection. Namely, one can define an affine connection 
tr in U as follows: Let Y and Y be any given infinitesimal transformations 
in UV. For any given point p in U, there exist infinitesimal transformations 
X’ and Y’ on M which coincide in a certain neighborhood of p with XY and Y 
respectively. One defines (ty(X)(Y))p = (#(X’) (Y’))>», which is a tangent 
vector at p. This tangent vector is uniquely determined independently of 
the choice of extensions X’ and Y’ of X and Y, in virtue of the condition (c). 
It is a trivial matter to verify that p— (tv(X)(Y))>, is an infinitesimal 
transformation in U and that Y > ty(X) satisfies the conditions of affine 
connection in U. 

In particular, if UV is a coordinate neighborhood with local coordinates 


v,z*,- - -,a@", the induced affine connection ty in U can be expressed by 


1For the terminology concerning manifolds we follow Chapter III of [7] with 
obvious change of analyticity into infinite differentiability. 

* This definition is due to Koszul and has been communicated to me orally. For 
the standard treatment of affine connections, we refer to [6]. 
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where (@/dz') are infinitesimal transformations in U such that (0/0zr')f = 0f/dz' 
for any differentiable function f in U and I*,(t,j,4=1,2,°-°-,mn) are 
certain differentiable functions in U. By a coordinate transformation in U 
the I*;; undergo the transformation which is well known in classical differ- 
ential geometry; this follows easily from the conditions of affine connection. 
Conversely, given an affine connection in the classical sense by means of I*; 
in each coordinate neighborhood, one defines ty by (1.3) in each coordinate 
neighborhood and then defines (¢(X)(Y))p = (tu(Xv) (Vv) )p at each point 
p of M, where U is a coordinate neighborhood of p and Xy and Yvy are the 
restriction of X and Y to U respectively. It is easy to see that this process 
defines legitimately an affine connection X -¢(X) in our sense. We have 
thus seen that our notion of affine connection coincides with the classical one. 


2. Transformations. Let ¢ be a differentiable homeomorphism of M 
onto itself. We shall denote by (¢’), the linear mapping of the tangent 
space T(p) at p into the tangent space T(¢(p)) at ¢(p) which is induced 
by ¢. We shall also denote by ¢’ the mapping of X onto itself defined by 

X)p = Xo4-p, 


as well as the mapping of % onto itself defined by 


(¢’-f)(p) =f(¢"(p)). 


Then we have the following formulas: 

(2.1) (Xf) = 

(2.2) (FX) = f) X) 

for and XeX. X is called invariant by ¢ if 

Given an affine connection X >¢(X) on M and a differentiable homeo- 
morphism ¢ of M onto itself, one can define a new affine connection 
by 
(2.8) (X)(¥) = ¥)). 

If ¢(X) =?#’(X) for every Ye X, we say that the affine connection is 
invariant by ¢, or that ¢ is an affine transformation of M (with respect to 
the given affine connection). 

Let us consider the case where the %-module X has a finite base 


X,,X2,° +, Xn, that is, every X e X can be uniquely expressed as X = 
i=1 


where f; (t= 1,2,---+,m) are functions in %. Then an affine connection 
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X—t(X) is uniquely determined by the values (in X) of #(Xi) (Xj) 
(i, 7 =1,2,--+,n) as we see from the conditions of affine connection. 
Conversely, by assigning any arbitrary values (in X) to ¢(Xi)(X;) (4,7 =1, 
2,- +,m) one can define ¢(X)(Y) for all XY and Y in so that X ~t(X) 
is an affine connection. Let furthermore ¢ be a differentiable homeomorphism 
of M onto itself. Then an affine connection Y ~¢(X) is invariant by ¢ if 
and only if (¢(X;:) (X;)) =t(¢’: Xi) Xj) for all 1,7 =1,2,° n. 
In particular, if each X; is invariant by ¢, then the affine connection which 
is determined by ¢(X;)(X;) is invariant by ¢ if and only if all ¢(X;) (Xj) 
are invariant by ¢. 

In order to illustrate an application of the above considerations, we shall 
see how the left invariant affine connections on a connected Lie group are 
determined. Let G be a connected Lie group of dimension n and let 
X,, Xo,- + +,X, be a set of linearly independent left invariant infinitesimal 
transformations of G. They form an %-base of the space X over G. A left 
invariant affine connection on G@ means an affine connection which is invariant 
by the left translations of G. By the above remarks, any left invariant affine 
connection is determined by its values of ¢(X;) (Xj) =1,2,°°°,m) in 
X which are themselves left invariant infinitesimal transformations. Since 
the left invariant infinitesimal transformations are determined by their values 
at the identity element e, that is, by the corresponding elements of the Lie 
algebra g of G, we see that each left invariant affine connection gives rise to 
a bilinear function « on g X g with values in g and vice versa. 


(2.4) There is a one-to-one correspondence between the set of left 
imvariant affine connections on G and the set of bilinear functions «a on g X g 
with values in g, the correspondence being given by (t(X)(¥))e=a(¥, X) 
for X,Y eg. 


3. Torsion and curvature. We shall find the expressions for the torsion 
and curvature tensor fields in terms of our definition of affine connection. 
Let U be a coordinate neighborhood in MW with local coordinates x’, x?,---, 2". 
Let X,, X2,- - -,X, be any base of the module X(U) of infinitesimal trans- 


formations in U such that = uipX, (i = 1, where wip are 
p=1 


certain differentiable functions in U with the determinant | ui, | which is 
not zero at any point of U. If we set ¢(@/dx*) (0/024) = as 
| 


before, the torsion tensor field has coefficients 7*;; = I*;; We have 
easily 
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(3.1) (0/dx*) = t(0/dx!) (8/dx') — (0/023) 
k=1 
= (Xp) —t(Xp) (Xa) — Xa] }- 
This leads us to the definition of the following mapping T of X X X into &: 
(3.2) (X,Y) =t(V)(X) —t(X) (VY) — LX, X,Y 


It has the properties that 7(X, Y) =— 7(Y, X) and T(fX, gY) = fgT(X, Y) 
where f,ge%. It is a tensor field of type (1,2). Since 


T 0/02!) = T'y(0/ax*), 
k=1 


we see that 7 is nothing but the torsion tensor field. 
As for the curvature tensor field, we have the coefficients 


n 
m=1 


which may be expressed in the following fashion: 


(3. 3) (0/00! ) = t(t(0/dx") (0/dx*) ) (8/024) 
— t(t(0/ax') (0/da*). 


We define therefore a mapping R of X X X XK X into X, or what amounts to 
the same thing, a mapping of X X X into the space of mappings of X into 
itself by 

(3.4) R(X, V)-Z=t(t(Z)(V))X —t(t(Z) (X))¥ — t(Z) - [X,Y] 


for X,Y,Z in X. R has the properties that R(X, Y) =—R(Y,X) and 
R(fX, gY) -hZ =fgh R(X,Y)-Z for X,Y,Z in X and f,g,h in §. It is 
a tensor field of type (1.3). We see from (3.3) that R is nothing but the 
curvature tensor field. 


4, Covariant differentiation. The covariant differentiation A which 
arises from a given affine connection is the derivation of the algebra of tensor 
fields on M which is uniquely determined by the following conditions: 


(4.1) (a) Af—df for fe, where d is the exterior differentiation ; 


(b) For any XeX and for any differential form w of degree 1, 
wo) + <X, Awd = o>, where < , > denotes the con- 
traction of tensors; 
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(c) AX for XeX is determined as follows: in any coordinate 
neighborhood U with coordinates x1, x?,- -, a" 


k= 


§,k=1 


where © denotes the tensor product. 


From the last condition (c) and (1.3) we see that A(0/dz+) = t(0/02‘) 
and hence 
(4. 2) A(X) =t(X) for X eX. 


For each Y eX we introduce a mapping Ay of X into itself by 
(4. 3) Ay: X = t(X)Y, 


It is easy to see that the value Ay: XY at a point p depends, for each X, only 
on the value of Y at p. (3.4) may be written as 


(4. 4) R(X, Y) = [Ax, Ay] — Arx,y3. 


This shows that the linear mapping 1 — Ay of the Lie algebra X into the 
Lie algebra of endomorphisms of the linear space X is not in general a homo- 
morphism of Lie algebras and the ‘ defect’ is given by the curvature tensor 
field. 


Without difficulty, we can find the expression for AR as follows: 
(4. 5) AR(X,Y;Z)-W=Az- (R(X, Y)-W) — R(X, Y) - (Az: W) 
— R(Az:X,Y)-W— R(X, Az: Y) - W. 


We write this expression also as ((Az:R)(X,Y))-W. This corresponds to 
the classical formula: 


n 
p=1 


n n n 
n 
p=1 


p=1 p=1 


5. Parallelism. By a regular curve in M we shall mean a continuous 
mapping x(s) of the unit interval [0,1] into M which can be extended to a 
differentiable mapping of an open interval (—e,1-+.) for some « > 0 and 
whose image is a (1-dimensional) submanifold of M in the sense of [7]. 
By a curve we shall mean a continuous mapping of the unit interval into M 
which is composed of a finite number of regular curves. The parallelism is 
first defined along regular curves lying in a coordinate neighborhood and then, 
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by obvious extension, along any curves in W. Let 2(s) be a regular curve 
lying in a coordinate neighborhood U, and let X, be the family of tangent 
vectors to the curve. A given family Y, of tangent vectors at the points z(s) 


is said to be parallel along x(s) if 
(5.1) t(Y)(X) =0 on 2(s), 


where V and Y are infinitesimal transformations defined in U which coincide 
with .Y, and Y, respectively on z(s). Such extensions X and Y always exist 
and the condition (5.1) does not depend on the choice of extensions. It is 
easy to show that this notion of parallelism coincides with the classical one. 

A path is defined to be a regular curve such that the family of tangent 
vectors XY, to the curve is parallel along the curve itself. For any point p 
of M and for any given tangent vector X, at p, there exists a path x(s). 
defined for —e<s<e for some «>0, such that 7(0) =p and such 
that the tangent vector at p to z(s) is equal to the given >, that is, 
lim (1/s) (f(z(s)) —f(p)) =X>f for any function f. Such a parameter s 


is uniquely determined and is called the canonical parameter of the path z(s). 
M is called complete if every path can be extended for any large value of its 


canonical parameter. 


6. Holonomy groups. We shall not go into the concept of development 
of frames along a curve, which enables us to define the holonomy group 
associated to the given affine connection. When we choose an arbitrary point 
p, the holonomy group is realized as the group of affine transformations of 
the affine tangent space at p, that is, the tangent space T'(p) provided with 
a natural affine structure. However, the homogeneous part of the holonomy 
group can be defined only with the help of parallelism. In the present paper 
we shall agree that the word holonomy group means the homogeneous part 
of the whole holonomy group. Thus the holonomy group H* is the linear 
group in the tangent space consisting of the parallel displacements corre- 
sponding to all closed curves. The subgroup H*, of H* which corresponds 
to all closed curves homotopic to 0 will be called the restricted holonomy 


group of M. 

If M is n-dimensional, the holonomy group H* and the restricted 
holonomy group H*, are subgroups of GL(n,R). It has been proved 
recently by several authors that H*, is the identity component of H* and 
that H*, is a connected Lie group. We shall state a theorem of Chevalley 
which shows how the Lie algebra of the restricted hononomy group is deter- 
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mined by the curvature tensor field and all its successive covariant differentials. 

For each point p of M, let h*(p) be the Lie algebra of endomorphisms 

of the tangent space at p which is generated by all endomorphisms of the forms 

(R(X, ((Az* Y)) ps ((Aw: where X, Y, 

Z, W,: ++ denote arbitrary infinitesimal transformations. Let r(p) = dim h*(p) 

and r(M) = — r(p). If r(p) =r(J), p is called a regular point. Then, 
pel 


if the set of regular points of M is connected, the restricted holonomy group 
has the Lie algebra which is isomorphic with h*(p) for any regular point p 
(Chevalley).° 

If M admits a transitive group of affine transformations, then all §*(p), 
pe M, are isomorphic with each other so that WM consists of regular points. 
Therefore, the restricted holonomy group has the Lie algebra isomorphic with 
h*(p) for any point pe M. We shall use this result in Chapter II. 


Chapter II. Invariant Affine Connections on Homogeneous Spaces. 


7. Reductive homogeneous spaces. A homogeneous space G/H of a 
connected Lie group @ is called reductive if the following condition is satisfied : 
in the Lie algebra g of G@ there exists a subspace m such that g=m-4 h 
(direct sum of vector subspaces) and ad(h)m Cm for all he H, where § 
is the subalgebra of g corresponding to the identity component H, of H and 
ad(h) denotes the adjoint representation of H in g. G/H is reductive in 
either of the following cases: 


(a) H is compact; 

(b) H is connected and the subalgebra § is reductive in g, that is. 
ad(h) in g is completely reducible. This is the case if H is connected and 
semi-simple. 

(c) H is a discrete subgroup of G; h 0 and m=~g. 


From now on, we shall always assume a homogeneous space to be 
reductive and shall consider a fixed decomposition g—=m-+ of the Lie 
alyebra satisfying the above condition. 

G/H is considered to be the space of cosets rH, xe G, on which G acts 
to the left, that is, ae G maps the point 7H into the point arH. We denote 
by p the point represented by the coset H. Then the subspace m can be 
identified with the tangent space at po. Let X,,X2,- --,Xn be a base of g 
such that the first m elements span m and the last n —m elements span . 


8 This result is unpublished. 
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As is well known,‘ there exists a neighborhood V of the identity element 
e of G with a canonical system of coordinates 2’, 2*,- - -,2" such that V is 
the topological product of the sets 


{re V | gmt? — 0} 
and 
K = - 7" = 0}, 


where K is contained in Hy. If we denote by z the canonical projection of 
G onto G/H, x is a differentiable homeomorphism of NV onto a neighborhood 
N* of po in G/H. We have x(c) =7(c) - po for ce N, where r(c) denotes 
the transformation of G/H which is induced by c. N* is the set of points 
of the form r(c) po, ce XN. 

For each element 1 of m, we define an infinitesimal transformation * 
in N* as follows: 


(7. 1) = (7 (€) ) (ce), 


where X in the right hand side denotes the tangent vector at po which is 
identified with the given Yem. (X*),, is nothing but this element X of 
the tangent space at po. Since 7(c) is a differentiable homeomorphism of 
G/H onto itself and since N and \* are differentiably homeomorphic by z, 
we see that X* so defined is an infinitesimal transformation in N*. Since 
X,,X2,: + +,Xm is a base of m, the corresponding X,*, X.*,- --,Xm* are 
linearly independent at every point of NV*, that is, they form a base of the 
module of infinitesimal transformations on NV*. 

Let h be any element of H. Since r(h) +o = po, 7(h) induces a linear 
transformation (r(h))’,, of the tangent space at pp onto itself, which is the 
same as ad(h) on m. We shall show how each infinitesimal transformation 
X* (Xem) is transformed by 7(h). Take a subset N, of N such that 
hN,h- C N (the existence of such NV, follows from the condition ad(h)m C m) 
and let N,* = {r(c)-po|ceN,}. Then we have 7(h)-N,* C N* and 


(7. 2) (r(h))’- X* = (ad(h)X)* on r(h) - N,*. 
In fact, we have for any ce N, 


((7(h) X*) == (7 (h))’: (A*) = (r(he))’- X 
= (r(hch"))’> (r(h))’- X = (r(heh*) (ad(h)) -X 
= (ad (h)X)* 
since N and r(hch*) - po =7(h) po. 
For later use, we shall prove 


‘For example, see Chapter IV, § V of [7]. 
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(7.3) For any X and Y in m, [X*, Y*]p, = where 
denotes the m-component of the element [X,Y¥]eg=m- b. 


In fact, let x(t) and y(s) be the 1-parameter subgroup of G generated by 
X and ¥ respectively. For any function f in N*, we have 


(Y*f) ence) lim (1/s){ (f° — (fom) 
where 2*(¢) denotes the image of x(t) by z, and 
— lim (1/t){(¥"P) (2*(t)) — (po) 


From this expression and a similar one for Y*,,(X*f), we see that 
[x*, Y*]of = [X,Y]. (f or) = [X,Y] nf, which proves (7. 3). 


8. Fundamental existence theorem. Keeping the notation of the pre- 
ceding section, we shall now prove the fundamental existence theorem for 
invariant affine connections on G/H. An invariant affine connection means 
an affine connection on G/H which is invariant by all r(a), ae G. 


THEOREM 8.1. Let G/H be a reductive homogeneous space with a fixed 
decomposition of the Lie algebra g=m-+b, ad(H)m Cm. There exists a 
one-to-one correspondence between the set of all invariant affine connections on 
G/H and the set of all bilinear functions « on m X m with values in m which 
are invariant by ad(H), that is, ad(h) -a(X,Y) = a(ad(h)X,ad(h)Y) for 
X,YemandheH. The correspondence is given by 


(8. 1) a(X,¥) = (t(¥*) (X*)) 


For the sake of simplicity, we shall call such a bilinear function « on 
m X m a connection function on m X m (or, more precisely, a connection 
function with respect to the decomposition g = m-+ 6). 


Proof. Assume first that there exists an invariant affine connection ¢ on 
G/H. We define a bilinear function «(X,Y) on m X m with values in m 
by (8.1). Let A be any fixed element of H, and take a subset N, of N such 
that (7.2) holds. Since the affine connection is invariant by G, the induced 
affine connection in N, is invariant by r(h), that is, we have by (2.3) 


on 7(h)-N,*. Taking the values of both sides at py and using (7.2), 
we obtain 


(r(h))’ (t(Y*) (X*)) pp a(X, =ad(h) -a(X, Y) 
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and 
X*) (ad (h) ¥)*) (ad (h)X)*) ) 
= a(ad(h)X,ad(h)Y), 
which prove the invariance of a(X,Y) by ad(h). 

Conversely, we shall prove that any connection function a on m Xm 
gives rise to an invariant affine connection which is related to the original « by 
(8.1). For a base X,, Y2,---, Xm of m, the corresponding X,*, X,*,---,Xm* 
form a base of the module of infinitesimal transformations in N*. As we have 
remarked in Section 2, there exists an affine connection ¢* in N* such that 


Since @ is invariant by ad(#), it follows from (7.2) that 
for any he H. More generally, we have 
(8.3) For any infinitesimal transformations X* and Yt in N*, 


This can be proved easily by using (1.2), (2.1), (2.2) and (8.2). Now 
we define an affine connection ¢ in'‘G/H as follows. 
For any infinitesimal transformation X* and Y? on G/H, we define 


at any point p—r(a): po (ae G) of G/H. (8.3) implies that this tangent 
vector at » is independent of the choice of an element ae G such that 
p=1(a)- po. It is easy to show that p— (¢(¥7) (Xt) ), is an infinitesimal 
transforamtion on G/H and that Yt + ¢(¥7) defines an affine connection on 
G/H. For example, we have for any differentiable function f on G/H 
(setting p—r(a@) po and ¢ 

that is, (fXt) = ft(Y?t)(X*). The other conditions of affine connec- 
tion can be proved similarly. Finally, the invariance of this affine connection 
t is clear from the definition. Since 

(X*) ) og = (X*) = a (X, 

t is related to the given « by (8.1). 
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It is also clear that an invariant affine connection on G/H is uniquely 
determined by the corresponding connection function « We have thereby 
completed the proof. 


9. Torsion and curvature of invariant affine connections. Let ¢ be the 
invariant affine connection on G/H which is determined by a connection 
function @ on mm. We shall find the expressions of the torsion and 
curvature tensor fields of ¢ in terms of a We remark that these tensor 
fields are invariant by G and hence are determined by their values at po. 
The covariant differentiation commutes with r(a), ae G, that is, r(a) 
=A-7r(a)-F for every ae G and for every tensor field F. 

The torsion tensor field 7’ is easily determined. Let X* and Y* be the 
infinitesimal transformations which correspond to XY and Y in m respectively. 
We have from (3. 2) that T(X*, Y*) = t(Y*)(X*) — t(X*)(Y*) — [X*, Y*]. 
It value at po is then equal to a(X, Y) —a(¥Y,X) —[X, Y],, in virtue of 
(7.3) and (8.1). We shall denote this expression by 


(9. 1) T(X,Y) =a(X, VY) —«(¥,X) —[X,V] (X,Yem). 
It is a bilinear function on m XK m with values in m. 
As for the curvature tensor field R, we have by (3. 4) 
(9.2) R(X*, Y*) 
= t(t(Z*) (Y*))X* —t(t(Z*) (X*)) — t(Z*) - [X*, Y*]. 


In order to calculate this, let X,,X2,- + -,Xm be a base of m. We set for 
any (ce N) 
Xi*)g =D )Xu*,qeN*, t= 


where ¢,i are differentiable functions in p and gq which satisfy the following 
condition : 

(9. 3) $ui(Ps Po) = bui(Po, 9) =0 for any p, q in N*. 

Then we have 


= Po) (Ps Po)® Xs) 


(Ps Po) Po)) Xv} 


X1))*» + > (ps po) (Xo*) p 
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where (Xx*¢y1) (p, Po) is the function of p which is the result of applying 
X;* to the second variable of ¢y(p,q) at ¢g=po. We have, by a similar 
computation as above 


(t(t(Xi*) Xj") (4(a(Xe, + (Ds Po) Xj" 
— a(X;, X1)) + > (Xu*$yr) (Pos Po) Xx) 
4 (Xj*) (P, Po) Xv 


since (X;,*y1)(p, Po) = 0 by (9. 3). In the last expression Po) 
denotes the result of applying X;* at p= pp to the function (X;*¢,1)(p, po). 
Hence we get 


= a(X;, a( Xx, — a(X;, a(X;, 


We shall show that the last term is equal to — [[Xj, Xx]p, Xi], where 
[X;,Xz]_5 denotes the h-component of the element [X;,X,]eg. Let 
p=t(c)° po and g=—r(c’)- po with c,c’e N. Then 

= (ad(h*)X), 
where ce” ¢ N and he K are uniquely determined elements such that cc’ = c”h 
in the decomposition of V into the product of NV and K explained in Section 7. 
Therefore we see that (ad(h-*)X,)*, = > dyi(p,q7)(X»*)q. It is not difficult 

p=1 


to derive from this that 


(9.5) (P; Po) — (Xx*) (P, Po) }Xy 


Finally the term (¢(X;*) - [Xj;*, Xi.*])p, is easily seen to be equal to 
a([X;, Xe]m, X71). Using this and (9.4), (9.5) and (9.2), we obtain the 
following expression for the value at »,. of the curvature tensor field 


~ 


v=1 
( 
| 
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(9.6) R(X, ¥)-Z—a(X,a(¥,Z)) a(X,Z)) 


The covariant differential AR of the curvature tensor field is obtained 
as follows. Let X¥,Y,Z,Wem. By (4.5) we have 


(AR) (X*, Y*;Z*) - W* = (R(X*, Y*) - W*) — R(X*, (Age: W*) 
— X*, Y*) - W* — R(X*, Y*) - W*. 


We shall find the value at yo. Since F is an invariant tensor field, we have 
(R(X*, Y*) - W*) = (7(c))’ (R(X*, Y*) - W*),,. If we denote by 
Az the mapping Xem—>Az-X—a(Z,X) of m into itself, we find 
R(X*, Y*) - W*),, = Az: (R(X, Y)-W). Using the property of R 
that R(fX*, gY*) -hZ* = fgh R(X*, Y*) -Z* for any functions f, g, and h, 
we easily find (R(X*, Y*) - (Aze- W*))», = R(X, Y) - (Az: W) and similar 
expressions for the other terms. Therefore we obtain the following expression 
of the value at po of the covariant differential AR: 


(9.7) (R(X, Y)-W) —R(X, VY) 
— R(A,-X,¥)-W—R(X, Y)-W. 


10. Canonical affine connections. We shall consider the following 
properties of invariant affine connection on G/H. 


Let be the 1-parameter subgroup of G@ generated by an 
element X em, and let x*(s) be the image of x(s) by the projection w of G 
onto G/H. Then x*(s) is a path. 


(A2) Let a(s) and x*(s) be defined as above. Let Y be an element 
of m. Then the parallel displacement of the tangent vector Y at po along 
the curve 7*(s) is the same as the translation of Y by the 1-parameter sub- 


group 2x(s). 


Tf (A1) is satisfied, every path through jp» is the image by z of the 
1-parameter subgroup generated by a certain element of m. The affine 
connection being invariant, any path in G/H is either such a path or its 
translation by an element of G. Therefore, if an invariant affine connection 
satisfies (A1), then the affine connection is complete in the sense of Section 5. 

Since the tangent vectors to the curve x*(s) are equal to (X*)2«(s) 
= (r(a(s))’- X, we see that (A2) implies (A1). We shall now find the 
condition for a connection function # on m X m to define an invariant affine 


connection which satisfies (A1) and (A2) respectively. First we prove 
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(10.1) Let x(s) and x*(s) be as in the assumption of (A1). Then 
(t(¥*) (X*)) = (a(X, Y))* on a*(s) for every value of s such that x(s) N. 


We may assume that X and Y belong to a base X,, X2,- - +, Xm of m, 
say =X, and Y=X,. Let s, be any fixed value such that x(s,) N. 


We set 5,)))’- Xi* = dyid,* in a small neighborhood of po. Since 
x(s) is a 1-parameter subgroup, we have for sufficiently small s 
( (7 ) A;* = (r(2(— )’ (X4*) 
= (7(x(— s,))) (r(a(s + Xi= (7 Xi = (Li*) 
which implies (X;*) 2+*(s) = (Ss) that is Pui(z*(s)) = 
ost 


for sufficiently small s. In particular, we have ¢yi(po) =8,i. Since 
(X,*f)p, =lim (1/s) (f(z*(s)) —f(po)) for any function f in N*, we have 


yi) p, = 9. Hence we get 


= (7(z(s)))’> Xo) (%(X1, X2) ) 


which is the desired result. 
From (10.1) and from our formulation of the parallel displacement 
(5.1) we see that 


(10.2) The invariant affine connection defined by « satisfies (A1) 
if and only if «(X,X) =—0 for every X em. 

(10.3) The invariant affine connection defined by « satisfies (A2) if 
and only tf a(X,Y) =0 on m Xm. 

From (9.1) and (10.2) we have 


THEOREM 10.1. On a reductive homogeneous space G/H, there exists 
one and only one invariant affine connection which has a trivial torsion 
tensor and which satisfies (A1). It is defined by the connection function 
a(X,Y) = (1/2)[X, Y], on m X m. 


We shall call this invariant affine connection the canonical affine con- 
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nection of the first kind on G/H with respect to a fixed decomposition of the 
Lie algebra g =m - 

We have from (10. 3) 

THEOREM 10.2. On a reductive homogeneous space G/H, there eaists 


one and only one invariant affine connection which satisfies (A2). It ts 
defined by the connection function which is identically zero on m X m. 


We shall call this invariant affine connection the canonical affine con- 
nection of the second kind on G/H with respect to a fixed decomposition of 


the Lie algebra g =m -+ J. 


THEOREM 10.3. The canonical affine connection of the second kind on a 
reductive homogeneous space G/H has the following torsion and curvature 


tensors: 


T(X,Y) =— R(X, Y)-Z =— [[X, Z]. 
The covariant differentials of T and R are both zero. 


This follows from those expression for 7’, R and AR which have been 
given in Section 9. AZ = 0 is also easy to prove. 


11. Affine connections on a group space. We shall clarify E. Cartan’s 
study of affine connections on the space of a connected Lie group.® Let G 
be a connected Lie group with Lie algebra g. The three affine connections 
which were studied by Cartan are the following. 

a) (—)-connection: A left invariant affine connection on G which 
satisfies the condition (A2); it is defined by the corinection function a= 0 
on gXg. T(X,Y) =—[X,Y] and R=0. 

b) (+)-connection: A right invariant affine connection which satisfies 
the condition (A2) in the sense of right translations, that is, the parallel 
displacement along a 1-parameter subgroup 2(s).is the same as the right 
translation by x(s). If we consider g as the Lie algebra of left invariant 
infinitesimal transformations on G (as we do for the case a)), then 
this connection is defined by the connection function «(X, Y) = [X,Y] and 
T(X,Y) —[X,Y], 

ec) (0)-connection: A left invariant affine connection which satisfies 
the condition (Al) and which has a trivial torsion tensor; it is given by 
a(X,¥Y) = (1/2)[X, VY]. R(X, VY) =— (1/4)ad([X, Y]). 


5 See [2] and Chapter V of [8]. 


50 KATSUMI NOMIZU. 


All these connections have 1-parameter subgroups as paths. They are 
all bi-invariant, that is, invariant by right as well as left translations of G, 


since their connection functions are invariant by ad(@) in g. 


12. Holonomy groups. Let (/H be a reductive homogeneous space 
with a fixed decomposition of the Lie algebra g=m-+b. We first prove 


(12.1) Let §, be the subspace of h spanned by all elements of the 
form [X,Y ]5, X,¥em. Then is an ideal of §. 


In fact, for any Y, Y em and U ef we have 
[U, [X, Y]] = LU, + (U, [X, 


where [U,[X,¥],,]em and [U,[X,¥]5]eb. On the other hand, the 
Jacobian identity shows that 


[U, [X, Y]] [X, [U, Yj] 4- [[U, xX], 


[X, [U, 4- [X, [U, File [[U, X], YJ» LLU, xX]; 


Hence [U, [X, ¥]p] = (X, (U, + [[U. X]. which proves that 
h, is an ideal of §. 


We have from a special case of the theorem of Chevalley stated in 


Section 6 and from Theorem 10. 3. 


THEOREM 12.1. Let G/H be a reductive homogeneous space. The Lie 
algebra of the restricted holonomy group of the canonical affine connection 
of the second kind is isomorphic with ad(h,) acting on m, where h, 1s the 
ideal of h spanned by all elements [X, Y]y, X,Y em. The restricted holonomy 
group ts isomorphic with ad(H,) acting on m, where H, is the invariant Lie 
subgroup of Hy generated by the ideal };. 


This is a generalization of a well known theorem concerning riemannian 


symmetric spaces.® 
It is easy to prove 


THEOREM 12.2. Let D be a discrete subgroup of a connected Lie 
group G. The holonomy group of the canonical affine connection of the 
second kind on G/D is isomorphic with ad(D) acting on the Lie algebra g. 


13. Metric connection. It is well known that, given a symmetric non- 
degenerate tensor field of type (0,2), that is, the so-called (positive-definite 


* See [1], [5] and [10]. 
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or indefinite) metric ds*, there exists a unique affine connection without 
torsion whose parallelism leaves the given ds? invariant. 

Let G/H be a reductive homogeneous space with a fixed decomposition 
of the Lie algebra g=m-+b. Let B be a real-valued non-degenerate 
symmetric bilinear function on m X m which is invariant by ad(H), that is, 
B(ad(h)X,ad(h)Y) = B(X,Y) for em and he H. B determines an 
invariant (indefinite) metric B* on G/H. We shall find the connection 
function « which defines the invariant affine connection arising from the 
metric. 

Net XY, Y and Z be in m. Let z(s) be the 1-parameter subgroup 
generated by Z and 2*(s) the image of z(s) by z. Denote by X* and Yt 
the results of the parallel displacement of the tangent vectors X and Y, 
respectively, along the curve z*(s). Taking a base X,, Y2,- --,Xm of m, 


we write Xt = and Yi = where ¢; and y; are differentiable 
4=1 4=1 


functions such that Y = $i(po) Xi and Y = Wi(Po) Xi. Since the parallel 
i=1 4=1 
displacement leaves B* invariant, we have 


™ 


B*(X1, Yt) = X;*) = B(X, VY) =X Po) po) B(Xi, Xi). 
i,j=1 


4,j=1 


B* being invariant by the transformations of G, we have B*(X;*, X;*) 


n 


— B(X;, X;) and hence $iyjB(X;, Xj) = We apply 
i,j=1 


i,j=1 
(Z*),, to both sides and get 


Since (2*¢;) p.4i = — «(Z, X) as is easily seen, we get 
B(a(Z,X),Y) + B(X,a(Z, Y)) =0. 


We introduce U(X, Y) = a(X, Y) — (1/2)[X, VY]. Since the affine con- 
nection has a trivial torsion tensor, we have a(X, Y) —a(¥Y,X) =[X,YV]m 
and hence U(X, Y) =U(Y,X). Thus we get 


(1/2){B([X, Z] Y) + BLY, m, X)} = BUU(Z, X), Y) + B(X, U(Z, Y)). 


From this and the other two equalities obtained by the cyclic permutation 
of XY, Y and Z, we finally obtain 


(13.1) B(U(X,Y), Z) =(1/2){B([X, 2]... ¥) + BCX, [¥, Z],)}- 
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or 
ra) 


This equality determines U(X, Y) uniquely for given X and Y since B is 


non-degenerate. 


THEOREM 13.1. Let G/H be a reductive homogeneous space with a 
fixed decomposition of the Lie algebra g=m-+ h. Let B* be an invariant 
(indefinite) metric on G/H and B the bilinear function on m X m corre- 
sponding to B*. Then the connection function of the affine connection 


induced by the metric B* is 
a(X,¥) = (1/2)[X, + (X,Y), 
where U(X,Y) ts determined by (13.1). 


In order for B to induce the canonical affine connection of the first 
kind, it is necessary and sufficient that U(X, Y) is identically zero, that is, 


(13. 2) B((X, Z]m + B(X, [Y,Z]m) = 9; X,Y,Zem. 
In particular, we have 


THEOREM 13.2. Let G be a compact connected Iie group and H a 
closed subgroup. Let B be a positive definite quadratic form on the Lie 
algebra g which is invariant by ad(G@). Let m be the orthogonal subspace 
to the subalgebra § of H. Then the riemannian metric on G/H determined 
by the restriction of B to m Xm induces the canonical affine connection of 
the first kind on G/H. 


Proof. B is invariant by ad(G) and hence its restriction to m xX m 
satisfies (13. 2). 


Chapter III. Symmetric Affine Spaces. 


14. Symmetric homogeneous spaces.’ Let G be a connected Lie group 
and o an involutive automorphism of G, that is, oc? =the identity automor- 
phism. The totality of fixed points Ho of o is a closed subgroup of G. A 
homogeneous space G/H is called a symmetric homogeneous space (with 
respect to a) if H lies between Ho and the identity component of Ho. In 
certain cases it will be necessary to assume that @ is effective on G/H as a 
transformation group. 

Let G/H be a symmetric homogeneous space defined by o. The sym- 


7See [3]. In this Chapter, I have borrowed some of the ideas presented by Dr. 
Koszul at the seminar mentioned in the Introduction. 
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metry at the point po is an involutive differentiable homeomorphism of G/H 
onto itself which is defined by 
(14. 1) (p) po for p=71(a@)po, aeG, 
the right hand side being independent of the choice of ae G such that 
p=1(a)*po. A transvection is an element of G such that av =a. We 
define a mapping p of G/H into the set of transvections by p(p) = aa~’, the 
right hand side again being independent of the choice of ae G such that 
p=1(a)- po. If pis the image of a transvection ae G by =, then p(p) = a’. 
The symmetry at any point p is defined as the mapping 7(p(p)) +o» of G/H 
onto itself. 

A symmetric homogeneous space @/H is always reductive, and we take 
the following canonical decomposition of the Lie algebra. Let im be the 
1 of the involutive automorphism, denoted 


eigen-space for the eigen-value 
by the same oa, of the Lie algebra g which is induced by the automorphism o 
of G. Then g=m-+ 5, where § is the subalgebra of the identity component 
H, of H. We have X* = —X for Xem, U’ = U for U cf and ad(H)m C m. 
The 1-parameter subgroup generated by any X em consists of transvections: 
(x(s))% = (x(s))"* = If we take a neighborhood V of e with 
canonical coordinates x’, 7*,- -, 2”, a™*1,- - -, a2" with respect to g = m-+ 
as we did in Section 7, then any point with coordinates such that 
gmt? - == () is a transvection. For a point p with coordinates 
near in G/H, the point has coordinates 
(—a',—a@?,---,—a™). We see that po is an isolated fixed point of op. 

From now on we shall always use the canonical decomposition g = m + h 
for a symmetric homogeneous space. We have . 


(14.2) [X,Y]eh for any X,Y em. 
Keeping the notation in Section 7 and using the fact that o9 commutes with 
t(a) for every ae G, we can easily prove 


(14.3) (oo)’: X* = — X* for any X em. 


15. Canonical affine connection on a symmetric homogeneous space. 
By the results of Chapter II we can clarify the existence and properties of a 
natural affine connection on any symmetric space. From Theorem 10. 1, 
Theorem 10.2 and (14.2) we have immediately 


THEOREM 15.1. The canonical affine connection of the second kind of 
a symmetric homogeneous space G/H has a trivial torsion tensor. In other 
words, the canonical affine connections of the first and second kind coincide. 
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We shall call this invariant affine connection the canonical affine con- 
nection of a symmetric homogeneous space. 
Conversely, we have 


THEOREM 15.2. Let G/H be a reductive homogeneous space. If the 
canonical affine connections of the first and second kind (with respect to a 
fixed decomposition g=m-+h) coincide, then [m,m] Ch. If moreover 
G/H is simply connected, it can be considered as a symmetric homogeneous 
space Gt/Hi where Gi is the simply connected covering group of G and Ht 
is the subgroup of Gi generated by §. 

Proof. The first assertion is already clear. The linear endomorphism o 
of g defined by X¥° = — X for Yem and U’ =U for U ef is an involutive 
automorphism of the Lie algebra g. Let «7 be the involutive automorphism 
of G* induced by o. It is clear that H* is contained in the set of fixed points 
of « so that G*/H?* is a symmetric space defined by ot. Since G/H is simply 
connected, G/H = 

THEOREM 15.3. Let G/H be a symmetric homogeneous space. The 


canonical affine connection is the only invariant affine connection on G/H 


which is invariant by the symmetry at each point. 


Proof. let « be a connection function on m Xm. We shall prove that 
the invariant affine connection ¢ defined by « is invariant by the symmetry 
at each point if and only if « is identically zero. ; 


Suppose first that ¢ is invariant by oo. Then we have 
(t(Y*) (X*) = ¥*) ( (a0) X*) 


for any X,Y em. Using (14.3) we see that «(X,Y) =0. 

Conversely, we prove that the canonical affine connection is invariant 
by the symmetry at each point. It is sufficient to prove the invariance by oo. 
Let ?’ be the affine connection defined by 


(Xt) (Vt) = (t((oo)’- X*) ¥t)) 


for any infinitesimal transformations Yt and Y+ on G/H. Since op commutes 
with every r(a),aeG, we see that ¢’ is an invariant affine connection. We 
have (t’(X*)(Y*)),,—=0 in virtue of (14.3). Since an invariant affine 
connection is uniquely determined by the corresponding connection function, 
we see that ¢ and ?’ coincide, that is, ¢ is invariant by op. 


THEOREM 15.4. Let G/H be a symmetric homogeneous space with its 
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canonical affine connection. The covariant differential of any invariant tensor 
field on G/H is zero. In particular, every invariant differential form 1s closed. 


Proof. Let F be an invariant tensor field of type (1,7). Since the 
covariant differentiation commutes with r(a), ae G, AF is also an invariant 
tensor field of type (i,j-+1). A also commutes with oo. Therefore we 


have at po 
=o): A+ = 


and hence AF —0. The second assertion follows from the fact that the 
exterior derivative dw of a differential form w is the alternation of the covariant 
differential Aw of » in the case where the affine connection has a trivial torsion 


tensor. 


THeEoREM 15.5. If G/H is a symmetric homogeneous space with semi- 
simple G, the canonical affine connection is induced by an invariant (indefinite) 
metric on G/H. 


Proof. Let ¢(X,¥) —Trace of ad(X)ad(Y) be the fundamental 
bilinear form of g. Since it is invariant by every automorphism of 4g, 
we have in particular ¢(X,U) = ¢$(X°%,U") =¢(—X,U) and _ hence 
6(X,U) =0 for any Xem and Ue. Let B(X,Y) be the restriction of 
#(X,¥) tom Xm. It is clearly invariant by ad(#H). It is non-degenerate ; 
in fact, if B(Y, Y) = 0 for all Y em, then ¢(X, Y) for all Y eg and hence 
X =0 since ¢ is non-degenerate. B(X,Y) satisfies (13.2). The remark 
which follows Theorem 13.1 shows that the canonical affine connection is a 
metric connection. 


THEOREM 15.6. Let G/H be a symmetric homogeneous space. An 
invariant (indefinite) metric on G/H, if there exists any, induces the 
canonical affine connection. 


Proof. Wet B* be an invariant metric on G/H. Since it is an invariant 
tensor field of type (0,2), we easily see that it is invariant by oo. Therefore 
the induced affine connection is invariant by o») and coincides with the 
canonical affine connection in virtue of Theorem 15. 3. 


In particular, if H is compact, G/H admits an invariant positive-definite 
metric and any such metric induces the canonical affine connection. This is 
what is called a riemannian symmetric homogeneous space. 


16. Irreducible symmetric homogeneous spaces. Let G/H be a sym- 
metric homogeneous space with the canonical decomposition of the Lie algebra 


5d 
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g—=m--b. If ad() in m is irreducible, G/H is called irreducible. In this 
section, we assume that @ is effective on @/H as a transformation group. 
It follows then that ) does not contain any non-trivial ideal of g. This last 
condition is equivalent to the condition that the representation } — ad(bh) 
in m is faithful. 

We shall first prove 


(16.1) If G/H is irreducible, then either g is semi-simple or [m, m] = 0. 


In order to prove this, let ¢(Y, Y) be the fundamental bilinear form of g. 
We have ¢(Y,U) =0 for any Xem and Ueh as we have seen before. 
Let r= {NX eg] (X,Y) =0 for all Yeq}. r is an ideal of g which is 
contained in the radical of g. rm is a subspace of m which is invariant 
by ad(h). m being irreducible by assumption, we have either rM m= (0) 


Let rM m= (0). We shall prove that ¢ is non-degenerate, that is, g is 
semi-simple. Let X = Y-+U, where Yem and Uehb, be in r. For any 
Zem, we have ¢(X1,Z) =¢(Y,Z)+¢4(U,Z) and hence ¢(Y,Z) =0. 
Therefore YerNmand We have Y—UehnMr. For any Zem, 
we have [U,Z] emM r= (0) which means that [U,m] —0. ad(h) in m 
being a faithful representation of h, we must have U0. We have thereby 
proved that X —0 and that ¢ is non-degenerate. 

Next we consider the case where rN m= 1m, that is, m C r. We remark 
that [§,r] is contained in the nilpotent radical n of g since any derivation 
of g maps the radical of g, and a fortiori r, into n. m being irreducible, 
we have [§, m] =m and hence m C [h,r] Cn. Since n is nilpotent, there 
exists a positive integer k such that n® = [n,n®]—(0) and hence 
m® = (0). If & is even, then m®-» is a subspace of m which is invariant 
by ad(h) which means that either m®» =m or m®) = (0). In the first 
case we have [m, m] = [m, m*)] =m = (0). In the second case, we 
have m@-») = [m, m®-*)] where m-*) is a subspace of since k — 2 is even. 
ad(h) being faithful, we must have m®-*) = (0). This argument shows that 
fm, m] = (0). 

Next we prove 


(16.2) If g is semi-simple, then [m, m] =. 


Let 6, = [m, m]. It is straightforward to see that §, ~ (0) is an ideal of 
and that m+ h, is an ideal of g. g being semi-simple, there exists a supple- 
mentary ideal £ such that g is the direct sum of m + }, and f. It is sufficient 
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to prove that f is contained in ); then £ must be zero since } does not contain 
any non-trivial ideal of g. Let Y=Y-+U where Yem, Uech, be any 
element of f. We shall prove that Y=—0. For any Zem, we have 
[Z,X] = [Z, Y] + [Z, U] =0, where [Z,Y]e, and [Z,U]em. Hence 
[Z,Y¥] =—[Z,U] =0. This shows that [Y,m]—0. Next, we have for any 
[V,X] =[V,Y] + [V, U] =0 and hence [V, Y] = 0, which shows 
that [Y,}.]—0. Finally [Y,£]—0 is obvious since Yem. Thus 
[¥,q] 0 and Y —0 since g is semi-simple. We have completed the proof 
of (16. 2). 

If [m, m] = (0), the canonical affine connection of G/H is locally flat. 


We now prove the following theorem which is an extension of a theorem 
of Cartan in the riemannian case. 


THEOREM 16.1. Let G/H be an irreducible symmetric homogeneous 
space where G is semi-simple and effective on G/H. Then the largest con- 
nected group of affine transformations of G/H (of course, with respect to the 
canonical affine connection) is equal to G. 


Proof. Since G/H is complete, the group of all affine transformations 
of G/H is a Lie group.* Let Gi be the identity component of this group 
and let Gi/Ht = G/H. We shall show that Gi/H7 can be considered as a 
symmetric homogeneous space. Let oo be the symmetry as po and define an 
involutive automorphism of of Gi as follows: (at)?t =o -at-oo, which is 
again an affine transformation since the canonical affine connection is 
invariant by oo. ot so defined is obviously an extension of the automorphism 
o which defines G/H. Let Ki be the subgroup of fixed points of ct. We 
show that Ht lies between AY and its identity component KA‘. 


Let at be any element of Ht. Since at: po = po, at induces a linear 
transformation of the tangent space m at po, and so does (at)*t. These two 
linear transformations of m coincide since (o))’- — X for XY em. at and 
(at) being affine transformations, they coincide.® This proves that Ht C Kv. 
In order to prove that K*, C H?, it is sufficient to prove that every element 
in a sufficiently small neighborhood of e in Kt lies in Ht. Let N* be a 
neighborhood of p» in which pp is the only fixed point of oo, and let U be a 
neighborhood of e in G* such that at- N for everyae U. LetateUN Kt. 
Since (at)ot = we have pp = at + p at: po, which implies 


8 See [9]. 
®*See [9]. 
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that at: po = po since at - pye N* and since py is the only fixed point of op 
in N*, This proves that Kt) C H. 

Let gi = mt + 67 be the canonical decomposition of the Lie algebra 
of the symmetric homogeneous space G*/H?. Since G/H = G1/H*, we have 
mi =m. Since C bi, ad(§t) in m is irreducible. Since G/H = Gt/H? is 
not locally flat by assumption, (16.1) and (16.2) imply that [m, m] = pj. 
On the other hand, [m,m]—b. Hence hf =} and gi=g. We have thus 
proved that Gi = G. 

As a consequence we have 


THEOREM 16.2. Let G/H be an irreducible riemannian symmetric 
homogeneous space where G is semi-simple and ejfective on G/H. Then the 
largest group of affine transformations and the largest group of isometries 


of G/H have the same identity component. 
Proof. The identity components of these groups coincide with G. 


We remark that the same conclusion has been proved for any compact 


riemannian space by K. Yano.’° 


17. Affine symmetric spaces. Just as in the riemannian case, we can 
define an affine symmetric space geometrically as follows. Let Mf be a mani- 
fold with an affine connection. For any pe J, there exists a neighborhood V, 
of p with normal coordinates x',z*,- --,2". We define the symmetry op 
at p by for any point (z',2*,-- -, a") e Np. o is an 
involutive homeomorphism of NV, onto itself with p as the only fixed point. 
M is called an affine symmetric space if o, for every pe M is an affine 
transformation of N, into itself. We wish to prove 


THEOREM 17.1. The following conditions about an affinely-connected 
manifold are mutually equivalent: 

a) M is an affine symmetric space ; 

b) T=0 and AR=0; 

ec) For any peM, there exists a neighborhood N of p and an affine 
transformation of N onto a neighborhood of a certain symmetric homogeneous 
space with its canonical affine connection (we say simply that M is locally 
representable as a symmetric homogeneous space). 


10 He proved that any 1-parameter subgroup of affine transformations of a compact 
orientable Riemannian manifold is in fact a 1l-parameter subgroup of isometries. 
See [12]. 
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The equivalence of these conditions in the riemannian case has been 
shown by Cartan. ‘The affine case has been partly treated by him and 
J. H. C. Whitehead.” 


Proof. Theorem 15.3 shows that c) implies a). We shall prove here 
that a) implies b). The proof of the most essential part b) > c) will be given 
as a special case of Theorem 18.1 in Chapter IV. 


In order to prove that a) implies b), let p be any point of M with a 
normal neighborhood N,. For each tangent vector X at p, we define an 
infinitesimal transformation X* in N, by the parallel displacement along 
paths, that is, if ge N, lies on a path C joining p and gq in Np, we define 
(X*), as the result of the parallel displacement of XY along C. Since 
(op)’: X =—JZX at p and since oy, is an affine transformation of N, into 
itself by assumption, we easily see that (o,)’-X¥* —=—X*. The torsion 
tensor field T is invariant by op, that is, 


for the infinitesimal transformations 1* and )* corresponding to any tangent 


vectors X and Y at p. Using (o,)’: ¥* = —X*, we obtain 
((op)’- T(X*, Y*))>—=— (T(X*, Y*))> 


and, on the other hand, 


that is, (T(X*, Y*)), = 0. Since X and Y are arbitrary tangent vectors at 


any arbitrary point p of J/, we have thus proved that T is zero everywhere. 
AR = 0 can be proved in a similar fashion by making use of (4. 5). 


Chapter IV. Affine Connection Invariant By Its Parallelism. 


18. Affine connection invariant by its parallelism. We shall define a 
type of affine connection which is the simplest next to that treated in 
Chapter III. Let J/ be an affinely-connected manifold. Let C be a path 
from a point po to a nearby point p, We denote by (7(C))’ the parallel 
displacement of vectors along C, which is a linear mapping of the tangent 
space 7'(.) onto the tangent space T(p,). Let NV, be a normal neighborhood 
of p, and let N, be a sufficiently small normal neighborhood of ‘po. Each 


11 See [2] and [11]. 
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point go e N, lies on a certain path with tangent vector, say X, at po. Let q, 
be the point which is on the path with tangent vector (r(C))’-X at p, and 
which corresponds to the same value of the canonical parameter as qo. We 
denote the mapping which sends gq, into g, so determined by r(C) and call 
it a parallel transformation determined by the path C. 7(C@) is a local 
homeomorhpism of N, into N, and clearly induces the linear mapping 
(7(C))’ of T(po) onto T(p:). 

If the affine connection of WV is invariant by all parallel transformations, 
we say that the affine connection is invariant by its parallelism. 


We shall now establish 


THEOREM 18.1. The following conditions about an affinely-connected 


manifold are mutually equivalent: 
a) The affine connection is invariant by its parallelism ; 
b) AT and AR=0; 
c) It is locally representable as a reductive homogeneous space G/H 


with its canonical affine connection of the second kind (with respect to. a 


certain decomposition of the Lie algebra). 


Proof. Let G/H be a reductive homogeneous space with its canonical 
affine connection of the second kind. We see from property (A2) that a 
parallel transformation along a path 2*(s) with origin pois nothing but 
the transformation by the 1-parameter subgroup x(s) whose projection is the 
given z*(s). This fact and the invariance of the affine connection by G proves 


that c) implies a). 


a) —b) is an easy consequence of the following alternative definition 
of the covariant differentiation of tensor fields in any affinely-connected mani- 
fold. Let C(s) be a path with tangent vector X at the origin p where s 
is the canonical parameter. Let +(C(s))’ denote the parallel displacement 
along the path from p up to the point with parameter s. If F is a tensor 
field of type, say (1, 2), then 


(Ax: 2) 
= lian FLY, 2) — F((C(@)))’ - ¥, ((C@)Y’ - 2)} 


for any tangent vectors X, Y and Z at p. 
In order to prove that b) implies c), we need some preparations, which 
we shall do in the next Section. 
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19. Certain identities derived from AT =—0 and AR =—0. In the 
following, we shall denote by X, Y and Z any infinitesimal transformations 
in an affinely-connected manifold M. Tx denotes the mapping Y ~ T(X, Y) 
of X into itself and S{P(X,Y,Z)} denotes the sum of the expression 
P(X, Y,Z) over the cyclic permutations (Y, Y,Z), (Y,Z,X) and (2, X,Y). 
We shall derive certain identities from the conditions AT —0 and AR = 0. 


(19. 1) If AT = 0, then [ Ax, Ty] == ia 
Here Ay is the mapping Z > 1(Z)X of X into itself which was defined in 
Section 4, and [Ax, Ty] is the bracket product of endomorphisms of the 
linear space X. If AT —0, we have 
Az: T(X, Y) — T(Az- X, Y) — T(X, Az: VY) 
—Ty:Az: Y —Ty,,.x:Y =0 


which gives the desired result. 


(19.2) If AT=0, then =0. 


Proof. We introduce the following affine connection ¢’ associated to the 

given affine connection 
Y = t(X)Y + (1/2)T(X, Y).” 

Since the torsion tensor of the new connection ¢’ is equal to 0, its curvature 
tensor R’ satisfies the well known identity S{R’(XY, Y)-Z}—=0. Let A’y be 
the mapping X > ?’(X)Y of X into itself. From A’y = — (1/2)Ty + Ay 
we obtain, by using (19.1) and (4. 4) 
R’(X, Y) = [A’x, A’y] — 

= [— (1/2) Tx + Ax, — (1/2)Ty + Ay] + (1/2)T — 

= R(X, Y) + (t/4)[Tx, Ty] — (1/2){[Tx, Ay] + [Ax, Ty] — Tix,¥3} 

= R(X, Y) + (1/4)[Tx, Ty] — (1/2)T x,y). 
We easily see that 

S([Tx, Ty] -Z} =—2S{T(T(X, ¥),Z)} 

and hence we get (19. 2). 
(19.3) If AR—0, then ©{R(T(X, Y),Z)} 


Proof. Using the identity R(AzxY, Y) — R(AzY, X) = [Az, R(X, Y)] 
which is an immediate consequence of AR = 0, we have 


72 If ¢ is defined by means of I*,,, then ?¢’ is defined by I’*,, = (1/2) (IT, + T*,,). 
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(X, Y), Z)} =S{R(Ax: ¥ X—[X, Y],Z) 
= R(Ay: Y,Z) —R(Ay:X, Z) —R([X, VY], Z) 
4 X) — 21, X) 
+ R(dz° — R(x: Z, ¥) —R([Z, X], Y) 
— S{[Az, R(X, Y)] — Z)) 
= S{[Az, R(X, Y)] — Az] + 
= S{[Az, R(X, Y) + 
= ©{[Az, [Ax, Ay]]} = 0, 
where we have used the Jacobian identity S{[[X, Y],Z]} —0 in & and the 
Jacobian identity S{[Az, [Ax, Ay] ]} —0 for endomorphisms. 


We are now in a position to prove that b) implies c). Let M be an 
affinely-connected manifold such that AJ —0 and AR=0O. Let p be any 
arbitrary point of J/ and m the tangent space at p. We denote by § the Lie 
algebra consisting of endomorphisms of m which, when extended to a deriva- 
tion of the tensor algebra over m, leave the tensors T and F at p invariant. 
Since AJ —0 and AR=0, Ay for any Xem belongs to h, and so does 
R(X, Y) = [Ax, Ay] — Acx,y) for any Y and Yem. Now we define a Lie 
algebra g in the following fashion. As a vector space g is the direct sum of 
m and h}: g=m-+ 5b, where we define the bracket operation as follows: 


For X,Y em, [X,Y] = (—T(X, Y), —R(X, Y)) 
For Xem, Uch, [X, UV] —=—U-X (U operating on X) 
For U,V eb, [U, V] =UV— VU. 


The Jacobian identity is satisfied in virtue of (19.2) and (19.3), and g is a 
Lie algebra of which § is a subalgebra. 

Let G be the simply connected Lie group having g as the Lie algebra 
and H the Lie subgroup of G generated by §. Since § consists of all endo- 
morphisms of m which leave 7 and # invariant, we see that H is a closed 
subgroup. Clearly G/H is a reductive homogeneous space with the above 
decomposition g—=im-h. The canonical affine connection of the second 
kind on G/H has the same values of the torsion and curvature tensor fields 
at po (the point represented by the coset H) as those of the given affine 
connection of M at p. Therefore we shall be able to conclude the proof of 
b) —c) if we prove 


(19.4) Let M and M’ be two manifolds with affine connection which 
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both satisfy the conditions b) of Theorem 18.1. Let pe M and p’e M’ and 
let m and nv’ be the tangent spaces at p and p’ respectively. If A is a linear 
isomorphism of m onto m’ which transforms the torsion and curvature tensors 
at p into those at p’, then there exists an affine transformation of a neighbor- 
hood of p onto a neighborhood of p’ which induces the given A at p. 


Or what amounts to the same, 


(19.5) An affine connection such that AT =0 and AR=0 ts uniquely 
determined, locally, by the values of T and R at a given point. 


(19.5) is well known in the riemannian case. We shall prove the general 
ease by a technique similar to that of Cartan.1% Let po be any point of M 
and X,, X2,- + -,X, a frame at po. Let N be a normal neighborhood of p, 
and define the infinitesimal transformations X,*, X.*,---,X,* by the parallel 


displacement along paths as we did before. Let (Xj*)(Xp*) =D ytyXe* 
i=1 


n 
and set = >) y‘qpw% where - are linearly independent differ- 
q=1 


ential forms such that = If T (X;*, X;*) = is the 
i=1 
expression for the torsion tensor field in N, then T+ = > T+; Ao* is, for 
each 1, a. differential form of degree 2. It is easy to verify 


p=1 


(19. 6) (1/2) = > w'» Aw? — dut for each 1. 
If R(X;*, X;,*) = is the expression for the curvature tensor 
i=1 


field in NV, then Ao is, for each i and 1, a differential form 
j,k=1 


of degree 2 and 


n n 
(19.7) S (1/72) Rt joi Nok = ¥ — for each i and 1. 
= 


j k=1 p=1i 


We remark that we have used the formula 
dwt (Xj*, = [Xj*, X,*]). 


Since AJ —0 and AR=0O, T and R are invariant by the parallel 
displacement. From the choice of Y;*, we see that the coefficients T%;, and 
Ri, are constants. We shall show that (19.6) and (19.7) determine wo! 


13 See Chapter X, IT, pp. 236-238 and Chapter XI, II, pp. 262-265 of [4]. 
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and w‘, and hence the afline connection uniquely. We consider coordinates 

- -,a";t) instead of the normal coordinates (z+) in N. The coordi- 
n 

nates (a‘) remain constant along the path with tangent vector > a‘X; at po. 
i=1 

It is easy to see that w‘, when expressed in terms of a‘ and ¢, can be written 

as w'(t, a‘; dt, da‘) =a‘dt + o*!(t, a’; da‘) where w* does not contain dé. 

On the other hand, ‘,(t, a‘; dt, da‘) = w*‘,(t, a‘; da‘) does not contain dt 

either. We substitute these expressions into (19.6) and (19.7) and take the 

terms which contain dt. We then obtain the following system of differential 


equations : 


(19. 8) dat +S aPo*t, + 
p=1 


n 
w** 


n 
Ow**,/dt == > Riya" 
j,k=1 


with the initial conditions 
(19.9) w**(0,a*; dat) =0, w*!,(0, a‘; da‘) = 0. 


The differential equations (19.8), together with the initial condi- 
tions (19.9), determine wo** and w*‘, uniquely as linear expressions in 
da’, da*,--- , da” with coefficients which are entire functions in ¢, a1, a*,---, a". 
Therefore w' and w‘, and hence the affine connection in N are uniquely 
determined. The existence of the solutions follows from the fact that we 
can construct a homogeneous space @/H with its canonical affine connection 
of the second kind as we have shown before. We have thereby completed 
the proof of Theorem 18. 1. 


20. Final remarks. First we shall complete the proof of b) ->c) in 
Theorem 17.1. By Theorem 18.1, we see that the affine connection in 
question is locally representable as G/H with its canonical affine connection 
of the second kind. Since, in particular, 7 — 0, we see from the construction 
of the Lie algebra g —m-§ that [m,m] Ch. This means that G/H is 
locally representable as a symmetric homogeneous space as we see from the 
proof of Theorem 15.2. This concludes the proof of Theorem 17. 1. 

To sum up, we have the following table which shows a close relationship 
between the simple types of affine connection and the groups spaces or 
homogeneous spaces. 


a) T=0, RO (locally flat) = locally representable as an abelian 
group with its canonical affine connection. 
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b) AT =0, R=0O=locally representable as a group with its (—)- 
connection. 

c) 7 =—0, AR =0 (affine symmetric space) = locally representable as 
a symmetric homogeneous space. 

d) AT +0, ARO (affine connection invariant by its parallelism) 
= locally representable as a reductive homogeneous space with its canonical 
affine connection of the second kind. 


CoLUMBIA UNIVERSITY. 
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SOME THEOREMS CONCERNING SPIRALS IN THE PLANE.* 


By B. J. Batt. 


Throughout this paper the space under consideration will be the Euclidean 
plane. 

The definitions given by R. L. Moore in [6] are essentially equivalent 
to the following, also due to Professor Moore. An arc ¢ is said to spiral 
down on a point O provided there does not exist a finite sequence of rays? 
each starting from O such that the first has no point other than O in common 
with ¢, the last is a straight ray, and no two adjacent rays of the sequence 
have a point other than O in common. The are ¢ is said to go n steps 
toward spiraling down on O provided there does not exist a sequence of n 
rays satisfying the above conditions. An are which spirals down on some 
point is called a spiral and a point set which contains no spiral is said to be 
spiral free. 

In this paper an example is given of a compact, totally disconnected, 
closed point set such that every arc containing it spirals down on each of 
uncountably many points. In order that the compact, totally .disconnected, 
closed point set W should be a subset of a spiral free are it is sufficient that 
the projection of JJ onto some straight line be totally disconnected and it is 
necessary and sufficient that IJ be a subset of the sum of the elements of a 
continuous and equicontinuous collection G of mutually exclusive spiral free 
ares such that G, with respect to its elements as points, is totally disconnected. 
A number of related results are also obtained. 


Definitions. 1. A finite sequence F of simple discs? is called a weak 
chain provided that every two adjacent terms of F have only one point in 
common and every two nonadjacent terms of F are mutually exclusive. The 
terms of F are called its links and the first and last terms of F are called its 
end links. The sum of the links of F' will be denoted by F*, and in general, 
if G@ is a collection of point sets, G* will denote the sum of the sets of the 


* Received April 13, 1953; revised August 18, 1953. 
1 The term ray is here used to denote a closed topological image of a straight ray. 
* A simple closed curve plus its interior is called a simple disc. 
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collection G. 2. The weak chain F is said to have property Y with respect 
to the simple disc K provided there are two points A and B of the boundary 
of K such that (1) there is only one link of F containing A and only one 
containing B and these are the end links of F and (2) F*— (A+B) isa 
subset of the interior of K. A point z of a link f of F will be called a 
special point of f with respect to F and K if x belongs either to the boundary 
of K or to a link of F different from f. 3. If K is a simple disc and n is a 
positive integer, then the weak chain F is said to have property X, with 
respect to K provided that (1) F has property Y with respect to K, (2) F has 
only three links and (3) if O is a point of the center link of F and ¢ is an 
are which intersects both the boundary of K and the center link of F and 
does not go n steps toward spiraling down on O, then if f denotes either of 
the end links of F', ¢-f contains an are which separates the special points of 
f from each other in f. It can be shown that if A and B are two points of 
the boundary of a simple disc K and n is a positive integer, then there exists 
a weak chain F such that F* contains A + 3 and F has property X, with 


respect to K. 


THEOREM 1. If D is a domain, « is a positive number and n is a positive 
integer, then there exist a compact, totally disconnected closed point set M 
and a collection G of mutually exclusive simple discs such that (1) G* + M 
is a closed subset of D, (2) each element of G has diameter less than « and 
(3) if t is an are containing M, there is an element g of G such that t goes 


n steps toward spiraling down on every point of 9. 


Proof. Let K be a simple disc of diameter less ‘than e lying in D and 
let F be a weak chain having property X, with respect to K. If f is an end 
link of F, there exists a weak chain c¢ having property Y with respect to f 
such that c* contains the special points of f and each link of c has diameter 
less than 1. Let C, denote a collection consisting of one such weak chain 
for each end link of F. If q is a link of an element of C,, there exists a weak 
chain h having property X, with respect to q such that h* contains the 
special points of g. Let H, denote a collection consisting of one such weak 
chain for each simple disc which is a link of an element of C;. For each 
end link f of an element of H, there exists a weak chain ¢ having 
property Y with respect to f such that c* contains the special points of f 
and each link of c has diameter less than 1/2. Let C. denote a collection 
consisting of one such weak chain for each simple disc which is an end 
link of an element of 7,. Jf q is a link of a weak chain of the collection 
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C., there is a weak chain h having property X, with respect to q such that h* 
contains the special points of g. Let H. denote a collection consisting of one 
such weak chain for each simple dise which is a link of an element of C,. This 
process may be continued. Hence there exist two sequences Ho, H,, H,, - 
and C2, such that Hy is the collection whose only element is # and 
for each positive integer i, the following statements are true: (1) C; and H; 
are finite collections of weak chains, (2) every link of an element of C; has 
diameter less than 1/i, (3) if ¢ is an element of C;, there is an end link f of 
an element of Hj, such that c has property Y with respect to f and c* 
contains the special points of f, (4) if f is an end link of an element of Hj, 
there is exactly one element c of C; such that c has property Y with respect 
to f and c* contains the special points of f, (5) if h is an element of Hj, 
there is a link q of an element of C; such that h has property XY, with respect 
to g and h* contains the special points of g, (6) if q is a link of an element 
of C;, there is exactly one element h of H; such that h has property X, with 
respect to g and h* contains the special points of q. 


For each non-negative integer 7, let 1/; denote the sum of all simple discs 
f such that f is an end link of an element of H;. Then I; is closed and con- 
tains .V;,,. Let 1/ denote the common part of the point sets Mo, M,, M2,---. 
Then M is closed and compact, and contains all the special points of the end 
links of the elements of Zo, Hi, H2,---. For each positive integer 7, every 
component of 1/; is a subset of the sum of two intersecting links of an element 
of C; and consequently has diameter less than 2/i. Hence JM is totally dis- 
connected. Let G denote the collection of all simple discs g such that for 
some non-negative integer t, g is the center link of an element of H;. The 
elements of G are mutually exclusive. If 7% is a positive integer and g is the 
center link of an element of Hj, then every point of g is at a distance less 
than 1/i from some point of @. Hence G* + J is closed. 

Suppose ¢ is an are containing WV and there is no element g of G such 
that ¢ goes n steps toward spiraling down on each point of g. There exists 
a ray r starting from a point O of the center link of F and not intersecting ¢ 
such that no are of r goes n steps toward spiraling down on O. Let f, denote 
one of the end links of F and let x and y denote its special points. Since F 
has property ‘, with respect to K and r contains an are intersecting both 
the boundary of K and the center link of F, r contains an are which separates 
x from y in fo. . 

Since r does not intersect J/, there exists a positive integer 7 such that 
for no end link f of an element of H; does r contain an are which separates 
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the special points of f from each other in f. Let 7 denote the smallest such 
integer i. There exist a simple disc f, two points x and y of f, and an arc u 
of r such that f is one of the end links of an element of Hj, x and y are 
the special points of f, and w is a subset of f which separates x from y in f. 
Let c be the element of C; which has property Y’ with respect to f. Since x 
and y belong to c* and wu separates x from y in f and does not contain any of 
the special points of the links of c, there exists a link q of ¢ such that u 
contains an are which separates the special points of g from each other in 4. 
Let z and w be the special points of g and let v be a subare of w which is 
irreducible with respect to the property of separating z from w in g. Let h 
denote the element of H; which has property X, with respect to q and let f, 
and f, denote the end links of h containing z and w, respectively. Let L 
denote the component of g—v which contains z and let N denote the other 
component of g—v. Since the special points of f, belong to M and v does 
not intersect J/ and contains no are which separates the special points of f, 
from each other in f;, both of the special points of f, belong to Z. Similarly, 
both special points of f. belong to VN. Since both special points of f, belong to 
L and one of them belongs to the boundary of g and the other belongs to the 
center link of h, ¢ contains an are ¢’ which is a subset of Z and intersects 
both the boundary of g and the center link of h. Since h has property XY, 
with respect to g and there is a point of the center link of h such that t’ does 
not go n steps toward spiraling down on it, /’ contains an are which separates 
the special points of f, from each other in f2. Since ?’ is a subset of Z and 
LE does not contain either of the special points of f., L-f, separates the 
special points of f. from each other in f, and hence so does v- fz. This involves 
a contradiction. Hence if ¢ is an arc containing M there is an element g of G 
such that ¢ goes n steps toward spiraling down on every point of g. 


CoroLuary 1.1. Jf A and B are two points, then for every positive 
integer n there exists a compact, totally disconnected, closed point set M,, 
not containing A or B such that every are from A to B which does not 
intersect M, goes at least n steps toward spiraling down on some point. 


Proof. The argument for Theorem 1 can be used almost without 
modification to establish that if P and Q are two points of the boundary of 
a simple disc K, then for every positive integer n there exists a totally dis- 
connected, closed subset H, of K such that every are lying in K —/H], and 
separating P from Q in K goes at least n steps toward spiraling down on 
some point. For any two points A and B there exist simple discs K, and K, 
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with only two points, P and Q, in common such that K, + K, separates A 
from B. If Hp; and Hy,» are compact, totally disconnected, closed subsets of 
K, and K,, respectively such that for 1 = 1 or 2, every arc lying in Ki; — Hy 
and separating P from Q in K; goes n steps toward spiraling down on some 
point, then every arc from A to B which does not intersect Ay; + Hnz goes 
n steps toward spiraling down on some point. 


THEOREM 2. There exists a compact, totally disconnected, closed point 
set such that every arc containing it spirals down on each of uncountably 


many points. 


Proof. It follows from Theorem 1 that there exist a compact, totally 
disconnected, closed point set 1, and a collection H, of mutually exclusive 
simple discs such that (1) JM, + H,* is closed and compact, (2) each element 
of H, has diameter less than 1, (3) if ¢ is an are containing M,, then there 
are two elements of H, such that ¢ goes 1 step toward spiraling down on 
every point of their sum. If g is an element of H,, there exist a closed, 
bounded, totally disconnected point set M, and a collection Hy of mutually 
exclusive simple discs such that (1) M,-+ H,* is a closed subset of g, (2) 
each element of H, has diameter less than 1/2, (3) if ¢ is an are containing 
M,, there are two elements of H, such that ¢ goes 2 steps toward spiraling 
down on every point of their sum. Hence there exist a point set M. and 4 
collection H, of mutually exclusive simple discs each of diameter less than 
1/2 such that (1) M, + M, is closed and totally disconnected (2) H,* con- 
tains M, + H,.* and M, intersects every element of H, (3) if g is an element 
of H, and ¢ is an arc containing g- M2, then there are two elements of H; 
lying in g such that ¢ goes 2 steps toward spiraling down on every point of 
their sum. 


Thus there exist two sequences M,, M2, M;,- - - and Hi, He, H3,- - - such 
that if n is a positive integer, then (1) M, is a point set and H, is a 
collection of mutually exclusive simple discs each of diameter less than 1/n, 
(2) M,+M.+M,+---:-+M, is closed and totally disconnected and 
M,+M.+M@;+:--+M,-+H,* is closed and compact, (3) H,* con- 
tains Mn + Hnu.* and M,,, intersects every element of Hn, (4) if g is an 
element of H, and ¢ is an are containing g- /,,,,, then there are two elements 
of H,., lying in g such that ¢ goes n steps toward spiraling down on every 
point of their sum. 

For each positive integer n, let M,’=—M,+ M.+---+M,-+ 
Then M,’ is closed and compact, each component of M,’ has diameter less 


t 
e 
i 
0 
I 
h 
il 
0 
t 
t 
a 
d 
8} 
u 
ly 
st 
al 
it 
al 


CONCERNING SPIRALS IN THE PLANE. 71 


than 1/n, and M,’ contains M,,,’.. If M denotes the common part of the 
point sets M,’, M.’, M;’,---, then M is closed, compact and totally dis- 
connected. Suppose ¢ is an are containing M. Since ¢ contains J/,, there 
exist two simple discs g, and g» of H, such that ¢ goes 1 step toward spiraling 
down on every point of their sum. Since ¢ contains M.-g, and M,- g2, there 
exist four simple discs 911, 912, “. and gs. of H» such that g; and giz lie in 
iN 91, Jor and Gee lie in ge, and ¢ goes 2 steps toward spiraling down on every 
point of their sum. This process may be continued. Hence there exists a 
sequence G,, G2, G;,- - - such that if n is a positive integer, then (1) G, is a 
collection of 2” mutually exclusive simple discs, (2) each element of Grit 
is a subset of some element of G, and each element of G, contains two elements 
of Gy and (3) ¢ goes n steps toward spiraling down on each point of G,*. 
Let K denote the common part of the point sets G,*,@.*, Gs*,---. Then 
K is an uncountable subset of M. If O is a point of K, then for every positive 
integer n, t goes n steps toward spiraling down on O. Hence ¢ spirals down 


on every point of K. 


THEOREM 3. If the arc t spirals down on the point O, then O belongs 
to t and every subarc of t which contains O spirals down on O. 


THEOREM 4. Jf two arcs have only the point O in common and one of 


them spirals down on O, so does the other. 


Definition. Suppose is either a domain or a nondegenerate continuous 
curve. The reversibly continuous transformation 7 of M into some point 
set is said to be spiral preserving provided that if uw is an are lying in M 
and P is a point of VM, then wu spirals down on P if and only if T(w) spirals 
down on T(P). 


THEOREM 5. A reversibly continuous transformation of a domain D 
into a domain is spiral preserving provided every point of D belongs to a 
spiral free arc which is transformed into a spiral free are. 


Proof. Suppose wu is an are lying in D and P is a point of D such that 
u does not spiral down on P. There exists a sequence w;, U2,° * +, Un of arcs 
lying in D such that u,-u = P, = + +, Un1 =P and wu, is a 
straight line interval. Suppose 7’(w) spirals down on T(P). Since T(w) 
and 7'(u,) have only 7(P) in common and 7 (u) spirals down on T(P) 
it follows from Theorem 4 that T(u,) spirals down on T(P). Since T'(u;) 
and 7'(u,) have only 7(P) in common, 7'(uz) spirals down on T(P). Hence 
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T(u,) spirals down on T(P). Let g be a spiral free are lying in D and con- 
taining P such that T(q) is spiral free. There exists a sequence 9, J2,°** 5 9m 
of ares lying in D such that gi: g =P, 92-9: 9m* =P and 
gm is a straight line interval. Since T7(g) and 7(g,) have only T(P) in 
common and 7(g) does not spiral down on 7(P), it follows from Theorem 4 
that 7'(g,) does not spiral down on T(P). Since T(g,) and T(g:) have 
only 7(P) in common, 7(g2) does not spiral down on 7'(P). By induction, 
T (gm) does not spiral down on T'(P). Since 7'(u,) spirals down on T(P) 
and T(gm) does not, T(u,) and T'(gm) have a point different from 7'(P) 
in common. Consequently, since u, and gm are straight line intervals, there 
exists an interval v which contains P and is a subset of both wu, and gy». 
Since 7(v) is a subare of T(u,) containing 7'(P) and 7'(u,) spirals down 
on T(P), by Theorem 3, T(v) spirals down on T(P). It follows that 7(gn) 
spirals down on T(P). This is a contradiction, and hence 7T(w) does not 


spiral down on T(P). 


It follows from the above argument that if v is an are lying in T(D) 
and Q is a point of 7(D) such that v does not spiral down on P, then 
T-*(v) does not spiral down on 7-1(Q). Hence if wu is an are lying in D 
and P is a point of D such that wu spirals down on P, then T(w) spirals down 
on 7'(P). Hence the transformation 7 is spiral preserving. 


CoroLuary 5.1. Every conformal mapping of one domain onto another 


is spiral preserving. 


This is an immediate consequence of Theorem 5 and the fact that no 
conformal mapping transforms a straight line interval into an are which 


spirals down on a point. 


Definitions. A collection G of points sets is said to be continuous pro- 
vided it is true that if the element g of G contains a point of the sequential 
limiting set of a sequence of elements of G, then g is the sequential limiting 
set of that sequence. <A collection G of arcs is said to be equicontinuous 
provided that for every positive number « there exists a positive number 6 
such that if # and y are two points of an are g of G at a distance apart less 
than 6, then the diameter of the interval zy of g is less than e. 


TueoreM 6. If A, B, D, and C are four points lying in that order on 
the simple closed curve J and no arc of J spirals down on any point, then 
there is a continuous and equicontinuous collection of G of mutually exclusive 
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spiral free arcs filling up J plus its interior such that each are of @ ts 
irreducible from AB to CD. 


Proof. Wet I denote the interior of J and let J’ and I’ denote a circle 
and its interior, respectively. There is a reversibly continuous transformation 
T of 1+ J into I’ + J’ which represents J conformally on I’ ([2], pp. 81-86). 
(The use of this theorem was suggested by Professor E. J. McShane.) By 
Corollary 5.1, 7 is spiral preserving over 7. Since no are of J spirals down 
on any point, T is spiral preserving over J+J. Let A’, B’, C’, and D’ denote, 
respectively, 7(A), T7(B), T7(C) and T(D). It can be shown that there is 
a continuous and equicontinuous collection G of mutually exclusive spiral free 
ares filling up J’-+ J’ such that each are of @ is irreducible from A’B’ to 
C’D’. The collection 7-*(G) is a continuous and equicontinuous collection 
of mutually exclusive spiral free ares filling up J + J and each are of T-'(G) 
is irreducible from AB to CD. 


TuHeorEM 7%. Let J denote a rectangle with vertices A, B, D, C, such 
that the interval AB is vertical, A is above B, C is above D, and D is to the 
right of B. Let I denote the interior of J. Suppose G is a collection of 
mutually exclusive arcs such that (1) each are of G 1s irreducible from AC 
to BD and is a subset of J +1, and (2) each pair of points of J +I which 
lie on the same horizontal line are separated in J +I by an element of G. 
Then there exists a continuous and equicontinuous collection G’ of mutually 
exclusive arcs such that (1) @*=J-+-I, (2) every are of G’ is irreducible 
from AC to BD, (3) no horizontal line contains two points of any are of G’, 
(4) G is a subcollection of G’. 


Lemma 7.1. Under the hypothesis of Theorem 7, G is equicontinuous. 


Proof. Suppose G is not equicontinuous. It follows that there exist 
a sequence 9;, J», of ares of G and three sequences 2%, V2, %3,° * *, 
Yis Yo, * aNd 2, Zo, 23,° Such that (1) for each n, 2p, 2n, Yn lie in 
that order on g, with x, above z, and Z, above yn, (2) as N00, Lp and Yn 
converge to a point x and z, converges to a point z different from x. Let /! 
denote the horizontal line through x. Since for each n, 2, is above z, and Zp 
is above yp, it follows that z is on 1. Since 2 and z are on the same horizontal 
line, there is an arc of G@ which separates x from z in J + TJ. This involves a 


Y 


contradiction. Hence G is equicontinuous. 


Proof of Theorem %. Suppose g is the sequential limiting set of a 
sequence @ of arcs of G. Then g is a continuum lying in J + I which inter- 
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sects both AC and BD. Suppose g’ is an element of G which intersects g. 
If g is not a subset of g’, there exist points 2 and y on the same horizontal 
line such that « belongs to g and y belongs to g’. Let g” be an element of G 
which separates x from y in J + J and let K, and Kk, be the components of 
(J +1) —g” containing x and y respectively. Since K, is an open set 
relative to J + J and contains a point of the sequential limiting set of a, and 
the boundary of K, relative to J + J does not intersect more than one element 
of a, K, contains all but a finite number of the elements of «. But this is 
impossible since K, contains g’ and g’ intersects g. An almost identical 
argument shows that g cannot contain two points of any horizontal line. It 
follows that g is an are irreducible from AC to BD and is either an element 


of G or does not intersect G*. 


Let G’ denote the collection of all ares g such that g either belongs to G 
or is the sequential limiting set of some sequence of ares of G. It is easily 
see that the collection G’ satisfies the desired conditions. 


THEOREM 8. Let A, B, C, D, J, and I be defined as in the hypothesis 
of Theorem % and suppose M is a closed, totally disconnected point set lying 
in J +I such that the projection of M onto AB is totally disconnected. Then 
there exists a continuous and equicontinuous collection G of mutually exclusive 
arcs such that (1) G* =J +1, (2) each arc of G is irreducible from AC to 
BD, (3) no horizontal line contains two points of any arc of G, (4) the 
common part of BD and the sum of the elements of G which intersect M is 
totally disconnected. 


Lemma 8.1. Under the hypothesis of Theorem 8, if h and k are two 
arcs lying in J +-I, each irreducible from AC to BD and neither containing 
two points of any horizontal line, then there exists an are t lying between 
h and k such that t does not intersect M, is irreducible from AC ta BD 
and does not contain two points of any horizontal line. 


Proof. There exists a sequence M,, M2, M;,- - - of points sets such that 
(1) for each positive integer n, M, contains M,,,, (2) for each n, M, has 
only a finite number of components and if m is a component of M,, then m 
is a simple disc bounded by an interval of AB, an interval of CD, and two 
horizontal intervals, (3) the common part of M,, M2, Ms3,- - - is.the set of 
all horizontal intervals with end points on AB and CD which intersect M. 
Suppose that for each positive integer n, there is a component m of M, such 
that there is no vertical interval lying in m—m-M and separating m-h 
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from m-k in m. It follows ([5], 56, Theorem 78) that there exists a 
sequence Ms, such that for each positive integer n, (1) my is a 
component of M,, (2) mp contains mn,1, (3) every vertical interval separating 
m,*'h from m,:k in m, intersects M. If 1 is the horizontal line containing 
the common part of m,, m2, ms3,° then every point of between and 
1-k belongs to 1. But this is not possible since M is totally disconnected. 
Hence there exists an integer m) such that if m is a component of M,,, there 
is a vertical interval lying in m and not intersecting M and separating m-h 
from m-k in M. Hence there exists an are ¢ irreducible from AC to BD 
such that (1) ¢ lies between h and k, (2) if m is a component of MJ,,, then 
t-m is a vertical interval not intersecting J/ and separating m-h from 
m:k in m, (3) if D is a component of J—-I- Mo, then ¢- D is an are which 
does not contain two points of any horizontal line. It follows from (2) and 
(3) that no horizontal line contains two points of t. 


Proof of Theorem 8. Suppose H is a finite collection of mutually 
exclusive ares each irreducible from AC to BD and lying in J+ TI and no 
are of H contains two points of any horizontal line. Then the maximum 
width of a component D of (J + 1) — H* will mean the length of the longest 
horizontal straight line interval lying in D. Let d denote the length of AC. 
With the aid of Lemma 8.1 it can be shown that there exists a sequence 
G1, Gs, Gs,* + - such that for each n, (1) Gn is a finite collection of mutually 
exclusive arcs each lying in (J + 7) — WM and irreducible from AC to BD, 
(2) no element of G, contains two points of any horizontal line, (3) Gy is a 
subcollection of G1, (4) if D is a component of (J + 7) —G,*, then the 
maximum width of D is less than (3/4)"-d. Let G=G,+G62+-: 


Suppose x and y are two points of J + J lying on the same horizontal 
line. There exists an integer n such that (3/4)"-d< p(a,y) and hence 
some element of G, separates x from y in J+J. Hence G@ satisfies the 
hypothesis of Theorem 7 and consequently there exists a continuous and equi- 
continuous collection G’ of mutually exclusive ares such that (1) =J +], 
(2) every are of G’ is irreducible from AC to BD, (3) no horizontal line 
contains two points of any element of G’, (4) @ is a subcollection of G’. 
Since G* does not intersect M and is dense in J +7, the common part of 
BD and the sum of the elements of G’ which intersect / is totally disconnected. 


THEOREM 9. Jf M is a compact, totally disconnected, closed point set 
and the projection of M onto the y-axis is totally disconnected, then there 
exists a reversibly continuous, spiral preserving transformation T of the plane 
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into itself which carries each horizontal line into itself and is such that the | 


projection of T(M) onto each of the coordinates axes is totally disconnected. 


Proof. There exists a rectangle J with interior J and vertices A, B, D, C, 
such that VM is a subset of J and AB is vertical, A is above B, C is above D 
and D is to the right of B. There exists a continuous and equicontinuous 
collection G of mutually exclusive ares satisfying the conditions of Theorem 8 
with respect to J, J, A, B, C, D, and M. There exists a reversibly continuous 
transformation 7 of the plane into itself which carries each are of G into a 
vertical straight line interval and which carries each horizontal line into itself 
(See [4], Theorem 1). By Theorem 5, T is spiral preserving. It may be 
easily seen that the transformation 7 carries JJ into a point set whose pro- 


jections onto the coordinate axes are totally disconnected. 


THEOREM 10. Suppose R is a rectangle with horizontal and vertical 
sides and G is a finite collection of mutually exclusive rectangular discs with 
horizontal and vertical edges such that G* is a subset of the interior of R. 
Let AB and CD denote the vertical sides of R. There exists a finite collection 
H of mutually exclusive arcs such that (1) each are of H has one end point 
on AB— (A+B) and the other on CD— (C + D) and lies except for its 
end points wholly in the interior of R, (2) no vertical line contains two points 
of any arc of H, (3) if g ts an element of G, there exist two arcs hg and k, 
of H such that h, contains the upper edge, and kg the lower edge, of g and 
(4) no element of H intersects two elements of G. : 


THEOREM 11. If the totally disconnected point set M is the common 
part of a sequence M,, Mo, M3,: - - of point sets such that for each n, M, is 
the sum of a finite number of mutually exclusive rectangular discs with 
horizontal and vertical edges, then there exists a spiral free arc containing M. 


Proof. Under the above hypothesis, M is the common part of a sequence 
M,’, M.’, M;’,- - - of point sets such that for each n, (1) M,’ is the sum of 
a finite number of mutually exclusive rectangular discs, (2) every component 
of M,’ intersects WV and (3) every component of M,,,’ is a subset of the 


interior of some component of J,’. 


Let R denote a rectangule with horizontal and vertical sides whose 
interior, J, contains M,’. Let A, B, D, C denote vertices-of R such that A 
is above B, C is above D, and D is to the right of B. Let M,’ denote R +1] 
and let G, denote the collection of all components of M,’. There exists a 
finite collection H, of ares which satisfies the conditions of Theorem 10 with 
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respect to 2 and G,. If m is a component of M,’, there exists an arc t which 
is the sum of the left edge of m and two ares ¢, and ¢, each of which is 
irreducible from m to AB and is a subset of some are of H,. Hence there 
exists an are u, from A to B which is a subset of AB + H,* + M,’ and which 
contains the left edge of each component of M,’. Let K, denote the point 
set + 

Let m,,m.,* Mm, denote the components of J/,’.. For each positive 
integer i not greater than n, let R; denote the boundary of m; and let xy; 
denote the left edge of m;. Let G2; denote the collection of all components 
of M.’-m;. There exists a finite collection H.; of arcs which satisfies the 
conditions of Theorem 10 with respect to R; and G.;. By an argument similar 
to that given above, it can be shown that there exists an are ux; from a; to yi 
which is a subset of ayy; + H.;-+ m;- 1.’ ard which contains the left edge 
of each component of m;- M.’. Let K. denote the point set 


This process may be continued. 

Hence there exists a sequence K,, K»2, Ks,- - - of continua such that for 
each positive integer n, (1) Ky contains Ky,,, (2) Ky is the sum of WM,’ and 
an are u, from A to B which contains the left edge of each component of 1,’, 
(3) if P is a point of K, which is not in the interior of any component of 
M,’, then there is a ray r starting from P such that no vertical line contains 
two points of r and such that r— P does not intersect Ky. 

Let K denote the common part of the point sets K,, Ke, K3,- - +. Then 
K is a compact continuum containing 31J-+ A-+ B. It may be shown that 
K contains no continuum of condensation and hence is a continuous curve. 
If P is a point of K—(M-+A+B), then for some n, P does not belong 
to M,’. It follows that P separates A from B in M,’ and therefore separates 
A from B in K. Since for each n, every component of 1,’ separates A from 
B in K, and K is a continuous curve, it follows that every point of VW 
separates A from B in K. Hence K is an are from A to B. 

If P is a point of K, there is a ray r starting from P such that no 
vertical line contains two points of r and such that »— P does not intersect 
K. Consequently K does not spiral down on any point. 


THEOREM 12. If G@ is a continuous and equicontinuous collection of 
mutually exclusive, spiral free arcs such that G* is closed and compact and 
does not separate the plane, then there exist a rectangle ABDC and a reversibly 
continuous, spiral preserving transformation of the plane into itself which 
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carries each arc of G into a straight line interval which has its end points on 
AB and CD and is parallel to BD. 


Proof. There exist a simple closed curve J and four points A, B, D, C 
lying in that order on J such that every are G is irreducible from AB to CD 
and lies except for its end points wholly in J, the interior of J ({1], Theorem 12). 
With the aid of Theorem 6, the arguments for Theorems 11 and 13 of [1] 
can easily be modified to show that there exists a continuous and equicon- 
tinuous collection G, of mutually exclusive arcs such that (1) G.* =J +], 
(2) each are of G, is irreducible from AB to CD, (3) @ is a subcollection 
of G,, (4) no are of G, spirals down on any point of J. 


There exist two mutually exclusive ares wu, and uw, such that (1) A and B 
are the end points of u,, wu; contains G*- AB, and u, — (A + B+ G*: AB) 
is a subset of J, (2) C and D are the end points of ws, u. contains G*- CD, 
and uw.— (C+ D+ G*-CD) is a subset of J, (3) no are of G, contains 
two points of wu; or of u2. Let G. denote the collection of all arcs g such that 
g is a subare of some are of G,, and is irreducible from wu, to uz. Then G, is 
a continuous and equicontinuous collection of mutually exclusive spiral free 
ares, G.* is a simple disc, and G@ is a subcollection of G». 

Let g, and g. denote the ares of G, from A to C and from B to D, 
respectively. Let 7, denote the boundary of G,*. Then Jo=u,+u.+9:+92. 
Let J, denote a rectangle which has G.* in its interior. Let R denote the 
connected domain bounded by J, + J. and let R’ denote a connected domain 
bounded by two concentric circles J,’ and J,’ with J,’ in the interior of Jy’. 
There exists a reversibly continuous transformation 7’ of R+J,-+ J, into 
R’ + J,’+ J.’ which is conformal over R ([2], pp. 72 and 86). Since no 
are of J, or of J. spirals down on any point, it follows that T’ is spiral pre- 
serving over R + J,+ J.. The vertices of J, may be labeled A’, B’, D’, C’ 
in such a way that the points 7’(A’), T’(B’), T’(D’), T’(C’) lie in that 
order on J,’ and are in the same sense on J,’ that the points T’(A), T’(B), 
T’(D), T’(C) are on J’. Let A” and B” be points lying in the order 
A’A”B’B’ on A’B’ and let C” and D” be points lying in the order C’C’’D’’D’ 
on C’D’. There exist four mutually exclusive spiral free arcs each lying in 
R’ except for their end points and such that one of them has end points 
T’(A) and 7’(A”), one has end points 7’(B) and 7’(B’’), one has end 
points 7’(C) and 7’(C’’), and one has end points T’(D) and T’(D”). The 
inverses of these ares are four mutually exclusive spiral free ares AA”, BB”, 
CC”, and DD”, each lying in R except for its end points. These ares divide 
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R into four simple domains. By applying Theorem 6 to each of these and 
adding together certain ares, it can be shown that there exists a continuous 
and equicontinuous collection G’ of mutually exclusive, spiral free arcs such 
that (1) G’* is the sum of J, and its interior, (2) each are of G’ is irre- 
ducible from A’B’ to C’D’, (3) each are of Gs is a subare of some arc of G’. 

Suppose A’B’ is vertical. There exists a reversibly continuous trans- 
formation 7 of the plane into itself which carries each are of G’ into a 
horizontal straight line interval with end points on A’B’ and C’D’ and which 
does not change the x-coordinate of any point which is not in the interior 
of J, (See [4], Theorem 1). It follows from Theorem 5 that T is spiral 
preserving. 


THEOREM 13. In order that the compact, totally disconnected, closed 
point set M should be a subset of a spiral free arc tt is necessary and sufficient 
that M be a-subset of the sum of the elements of a continuous and equicon- 
tinuous collection G of mutually exclusive spiral free arcs such that G with 
respect to its elements as points is totally disconnected. 


Proof. The conditions are clearly necessary. The following argument 
shows that they are also sufficient. 


With the help of the fact that every sequence of point sets in the plane 
which has limiting set contains a subsequence which has a sequential limiting 
set ([3], Theorem 5), it may be shown that if G’ is the set of all elements 
of G which intersect 1/, then G’* is closed and compact. Since G with respect 
to its elements as points is totally disconnected, every compoent of G* is an 


Wk 


arc of G. Hence G’* does not separate the plane. 

By Theorem 12, there exist a rectangle ABDC and a reversibly continuous 
transformation 7, of the plane into itself which carries each are of G into a 
straight line interval which has its end points on AB and CD and is parallel 
to BD. Clearly, the projection of 7,(J) onto AB is totally disconnected. 
Suppose AB is vertical. Since the projection of 7,(J/) onto AB is totally 
disconnected, it follows from Theorem 9 that there exists a reversibly con- 
tinuous spiral preserving transformation 7, of the plane into itself which 
carries each horizontal line into itself and which carries 7,(1/) into a point 
set whose projection onto each of the coordinate axes is totally disconnected. 
Let 7 denote the transformation T,7, and let N denote the point set T(1). 

Since the projection of N onto each of the coordinate axes is totally 
disconnected, if g is a component of N and « is a positive number, then there 
exists a rectangle R, with horizontal and vertical sides which encloses q and 
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does not intersect N and is such that every point of it is at a distance less 
than « from g. From this it may be shown that NV satisfies condition (2) 
of the hypothesis of Theorem 11. 

Hence there exists a spiral free are K containing VY. Since V = 7'(J/) 
and the transformation 7 is spiral preserving, 7-*(K) is a spiral free are 
containing M. 


UNIVERSITY OF VIRGINIA. 
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ON PARABOLIC CURVES ON SURFACES.* 


By Puitip HARTMAN and AUREL WINTNER. 


Let S be a piece of a surface (of class C?, at least) in an (2, y, z)-space, 
and let A be a parabolic are on S, that is, an are consisting of parabolic 
points, points at which the Gaussian curvature K of S vanishes. According 
to a consideration of P. Franklin [2], pp. 254-256, such an A (on an 8S of 
class C*, say) is a line of curvature of S if the following three conditions are 
satisfied: (a) no point of A is “singular,” and (b) no point of A is a flat 
point of S (that is, a point at which the matrix of the second fundamental 
form of S is the zero matrix), finally (c) at no point of S does a normal 
section of S possess a flex point. 

It is not clear what Franklin means by “ singular” in assumption (a). 
In this regard, it was shown in [3], Theorem (*), pp. 610-613, that if (a) 
is interpreted as meaning (a*) grad K ~0 along A (which makes (b) super- 
fluous), while (c) is retained, then Franklin’s assertion is certainly true but 
its truth is of a drastic nature; in fact, (a*) and (c) imply that A is a 
plane curve and, what is more, the tangent plane of S does not vary along A 
(so that A is an asymptotic line, as well as a line of curvature, of 8S). In 
what follows, this result of [3] will be paralleled by a similar but deeper 
fact, as follows: 

(**) Let S: z(x,y) be a surface of class C? defined over an open 
(x, y)-domain D. Suppose that there exists in D a Jordan are of class C’, 
(1) —l<i<l, 
which divides D into two domains Dt, D- on which the respective inequalities 
K>0, K <0 hold for the Gaussian curvature K = K(2,y) of S (so that, 
in particular, 

(Ibis) (ay, 2) = y(t))) 


is a parabolic are of class C1 on 8). Suppose further that, if P is any point 
of A, the tangent plane T—T(P) of S at P is a supporting plane of S 
(near P). Then A is a plane curve and, what is more, 7 is independent of P. 
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The assertion of this theorem (**) is the same as that of theorem (*) 
of [3], quoted above. But S is required to be of class C* in (*), and just 
of class C? in (**). Moreover, the assumption grad K #0 of (*) is reduced 
in (**) to the much weaker assumption which requires merely the division 
(by D°) of D into D+ and D-. Finally, assumption (c) of (*) is replaced 
by the last assumption of (**), requiring that S be supported by T(P) at 
every point P of A. It is understood that (even though P is a cluster point 
of points of negative curvature) the tangent plane T(P) is said to support 
S at P if § is contained in either of the closed half-spaces having the boun- 
dary T(P) (at least if S is replaced by a sufficiently small neighborhood of 
P on 8). 

A theorem of A. D. Alexandroff (Theorem 1 in [1]) is concerned with 
parabolic ares on certain closed surfaces and is a “theorem in the large,” 
a fact underlying the proof given by Alexandroff. In what follows, his proof 
will be modified so as to become adjusted to the situation presented by the 
“local theorem” (**) above. Thus the proofs of the facts corresponding 
to those in Sections 5-6 of [1], p. 72, will have to be quite different from the 
arguments used (loc. cit.) by Alexandroff. On the other hand, the end of 
the proof of (**) will be identical with the proof in Section 7 of [1], pp. 72-73. 


Remark. It is worth emphasizing that (**) is not a theorem on a 
piece of surface S with the properties that K >0 on S and that a part 
of the boundary is a (smooth) are A on which K ~0. For example, let 
S: X =X(u,v), where Y = (2, y,z), be a surface of class C? defined on the 
partly closed set (u= 0, u* + v? < r?) in such a way that K = K(u,v) 29 
according as u=0 and that the tangent plane at a point (u,v) = (0,v), 
where v* < r’®, supports S. It does not follow from (**), nor is it true, that 
(i) A: X =YX(0,v) is a plane curve or that (ii) if A is a plane curve, the 
tangent plane does not vary along it. This will be shown by two examples. 


Ad (i). Consider the surface S: + (x -+ 4y’)*, where (2, y) is 
in a vicinity D of a point (Xo, yo) 4 (0,0) of the parabola Do: f(z, y) =9, 
where f=2-+ Since = 2 + 12f?, zry = 12yf?, = + 12y°f?, 
the curvature is 


K =K (x,y) = 4f2(2e + Vy? + 12f%)/(1 + 


Hence, if D is sufficiently small, K = 0 according as (z,y) is not or is on 
Do. Theare A: (— 4y’, y, y*/4), for y near yo, is not a plane curve, although 
S is convex, hence every tangent plane is a supporting plane. 


ist 


ON PARABOLIC CURVES ON SURFACES. 83 


In view of Franklin’s statement quoted above, it is worth pointing out 
that A is not a line of curvature. In fact, = 2, = 0 on A, 
so that the directions of principal curvature at a point of A are determined 
by dz =0 and dy =0 and, therefore, A is not in a direction of principal 
curvature (except at the point (7, y) = (0,0), which is not in D). 


Ad (ii). Consider the surface S: z=2?-+ (r+ y)*. Since 
too = 2+ 12%, = yy 


where f—=2-+y, the curvature K is 24f?/(1+2,?+2,?)*. Hence K 20 
according as (z,y) is not or is on the line Do: x+y=0. The arc 
A: (a,— 2, x’) is a plane curve but is not in a direction of principal curva- 
ture at any point. In particular, the normal vector (2z,0,—1)/(1-+ 42°) 
is not constant along A. 


Proof of (**). Let the notations be so chosen that D is a neighborhood 
of the point (z,y) = (0,0) and that this point is on D®. If r>0 is so 
small that the circle B= B(r): 2?+y? <r’ is in D, let H—=E(r) be a 
neighborhood of (0,0) contained in B and having the following properties: 
E has a decomposition similar to that of D, that is, # = H* + H° + E-, where 
E* are (non-empty) subsets of D*, respectively, and H° is a subset of D® and 
consists of an (open) subare of D®. Thus z—2z(z,y) is defined on the 
closure of H, and the segment joining any pair of points in F is in D. 


It can be supposed that the point (7,y) = (0,0) of D® corresponds to 
t = 0, that 
(2) 2(0,0) =z,(0,0) =z,(0,0) =0, 
and, since the tangent plane z = 0 at (0,0,0) supports S, that 
(3) z(@,y) on D. 
The normal vector V = N (a, y) = (— 22, — 2y, 1) /1 + 22? + 4?)4 is of class 
C, since z is of class C?. Hence, the assumption that D° is of class C1 implies 
that N = N(t) = N(2x(t), y(t)) is also of class Ct. 

Suppose that the assertion of (**) is false, i.e., that dN/dts40. It 


can be assumed that 
(4) dN/dt ~0 


at t= 0 and hence along H° = F°(r), if r is sufficiently small. According 
to (4), no point of H° is a flat point of S. 
It will be shown that, if r(> 0) is sufficiently small, and if (z,y), 
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(x(t), y(t)) are points of respectively, then N(a,y) AN(t). It will 
be clear from the proof that it is sufficient to prove N(a,y) AN(0,0) for 
(x,y) in E*. 

Suppose, if possible, that N(x, yo) = N (0,0) = (0,0,1) holds at a 
point (2, yo) of Et. Since K(x, > 0, the function z has a strict 
extremum at (2, yo). This extremum is a maximum. In order to see this, 
it is sufficient to note that, since K (0,0) = 0, the point (0,0) is not a flat 
point and (3) holds, it can be supposed, after a rotation of the (2, y)-plane, 
that z(2, y) + + y?) as (x, y) > (0,0), wherea > 0. Hence 
Zr2(0, 0) = —a*® < 0, and so 22, < 0 on £ if r is sufficiently small. 

After another rotation of the (2, y)-plane, it can be assumed that 2 > 0 
and = 0. The function z(2, y) is defined on the segment y=0,0 Sx =a, 
which is in D. It follows that z(z,0) has a maximum at 7 = 0 and at x = 2%. 
Consequently, the curvature K = K(a,y) cannot satisfy K(z,0) >0 for 
0<a<= 2, for otherwise the curve z=2z(2,0) in the plane y—0O would 
be concave downwards. Let zx, be the greatest value of r(< x) at which 
K(2z,0) =0. Thus K(2z,0) >0 for 7, << and (2%, 0,2(2:,0)) is a 
point of D°®. Clearly, < 0) = 0, > 0, since the plane 
curve z= 2(2,0) is concave downwards for 7, = 272. The tangent line 
y =0, L(x,z) =0, where L(x, 2) = 0) — (ex — (21, 0), sepa- 
rates (r,z) = (0,0) from (2, 2) = (2, 2(%, 0)) in the (x, z)-plane. In fact, 
L (Xo, 2(Xo, 0) ) = 0) 0) — (% — 21) 22 (41,0) < 0, by the con- 
cavity of the curve z = 0), while Z(0,0) = — 2(2,, 0) + 0) > 0. 
Since the tangent plane to S at the point (2, y) = (2z,,0) contains the line 
L(a,z) =0, y =0, the assumption of (**), according to which the tangent 
plane of S§ at the point (7, 0,2(21,0)) of D° supports 8, cannot hold unless 
that tangent plane is y=0. But the latter case is impossible if r(>0) is 
sufficiently small, since NV (0,0) = (0,0,1). This contradiction proves that 
N yo) N(0, 0). | 

Since the proof of the last inequality depended only on the continuity 
of N and of the second partial derivatives of z at (x, y) = (0, 0), and since the 
continuity is uniform on 2? + y? = 7°, it follows that N(a, y) ~ N(2(t). y(t)) 
if r > 0 is sufficiently small and if (7, y), (a(t), y(t)) are points of Et, F°, 
respectively. 

Let r>0 be fixed and so small that N(2,y) AN(2x(t),y(t)) if 
(x,y) A (x(t), y(t)) is on the closure of E+ and (x(t), y(t)) is on F°; 
cf. (4). Let 


—1l<i<l. 
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be an arc of class C* in such that (x1(0), y*(0)) = (0,0), that (2*(¢), 
is in E* according as ¢ 20, that H* crosses (does not touch) LH at 
(x,y) = (0,0), and that dN (a'(¢), y'(t))/dt A dN (a(t), y(t))/dt (as to 
the last condition, note that a directional derivative of N at (0,0) can be 0 
only in the direction of principal curvature 0). 

If ¢>0 is sufficiently small, then no point (2*,y*) of H* can satisfy 
N(a*, y*) = N(a"(— 1), y(—t)). In order to prove this, suppose, if possible, 
that the last relation is satisfied by a sequence of values t = (,, to,- - 
satisfying t,->-+ 0 as n—>0oo, where (2*, y*) = Yn) is in Then 
N (@n; Yn) > N (0,0) as 00, hence (2p, Yn) (0,0) by virtue of the above 
choice of r and H(r). Let (2*(tn), y*(tn) ) and (Xn, Yn) be joined by a smooth 
Jordan are J, contained in #*. Then the normal image of J, is a locally 
smooth (possibly self-intersecting) are which has the same end points as does 
the normal image of that subare of #* on which —?f, [¢St,. When n is 


> of the are which 


sufficiently large, these end points are on “ opposite sides ’ 
is the normal image of #°. On the other hand, the normal image of J, has 
no point in common with the normal image of H°. This leads to a contra- 
diction, since it can be assumed that the arc-length of J, and, hence, of its 
normal image, tends to 0 as n—>00. 

What remains to be proved in order to complete the proof of (**) depends 
on the conclusions made by A. D. Alexandroff in Section 7, pp. 72-73 of his 
paper [1]; so that the balance of the proof of (**) will now be given only 
for the sake of completeness. 

The plane 7 = T; tangent to S at a point (2,y) = (a(t), y'(t)) of 
E- must intersect S, since K(a'(t), y'(t)) <0; ef.,,e.g., [8], pp. 617-619. 
If —¢ <0 is sufficiently small, T; contains no points (x,y,z) of S, where 
(z, y) is on the boundary of H. For otherwise To, the tangent plane at the 
point (0,0,0) of S, would contain such a point (2%, y). In view of (3), 
this would imply that N (2, yo) = N(0,0) and that K(a,y) 20. But 
this is impossible in view of the above choice of r and H = E(r). 

Since 7; contains no points of (2, y,z) of S at which (z,y) is on the 
boundary of £, there is a (connected) domain S; on S which lies on one 
side of the plane 7; and is such that its boundary points are on T; (the 
projection of S; on the (z,7)-plane is in #). Furthermore, S; has a sup- 
porting plane parallel to T;, and this supporting plane touches S; at some 
point (Zo, Yo, 20). Clearly, K (2, yo) 20. But K(x, yo) = 0 is impossible, 
since the normal images of the ares E° and £ have only the point 
N = (0,0,1) in common, and K yo) > 0 is impossible, since N (2, yo) 
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y'(—t)) for yo) in H* and < 0 sufficiently small. 
Since this contradiction resulted from the assumption (4), it follows 
that dN/dt =0, which proves (**). 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


[1] A. Alexandroff, “Uber eine Klasse geschlossener Flichen” (in Russian, with a 
German abstract), Recueil Mathématique, vol. 46 (1938), pp. 69-77. 

[2] P. Franklin, “Regions of positive and negative curvature on closed surfaces,” 
Journal of Mathematics and Physics, vol. 13 (1934), pp. 253-260. 

[3] P. Hartman and A. Wintner, “On geodesic torsions and parabolic and asymptotic 
curves,” American Journal of Mathematics, vol. 74 (1952), pp. 607-625. 


Th 


i 
is 
is 

a 
a 
a 


ON CONTINUOUS AREA-PRESERVING AND LEGENDRE 
TRANSFORMATIONS.* 


By Puitie HartTMAN and AUREL WINTNER. 


This note is concerned with minimal assumptions of differentiability for 
some classical theorems dealing with continuous, one-to-one mappings 


(1) Ti: u=u(t,y),v= (ay) 
which are of the Legendre or “ semi-Legendre ” type. 


In what follows, all domains (i. e., open sets) are simply connected and, 
in fact, “small” vicinities of a point. 


1. Area-preserving transformations. The first theorem to be proved 
is the following: 

(1) Let u=u(x,y), v=v(a,y) be continuous functions on some 
(x, y)-domain. Necessary and sufficient in order that the transformation 
T,: (x,y) — (u,v), defined by (1), has the properties 

(i) ZT, ts one-to-one ; 

(ii) u(a,y) is an increasing function of y for fixed x; 

(iii) TJ, ts orientation-preserving ; 


(iv) T, ts area-preserving, 
is the existence of a function f =f (a, u) having the properties 


(«) f has continuous partial derivatives fr, fu; 

(B) fe(x,u) is an increasing function of u for fixed x; 

(y) fu(av,u) is an increasing function of x for fixed u; 

(8) TT, ts representable as the product T, =T;T.-1, where and T's 
are the respective (one-to-one) semi-Legendre transformations (2, u) 
and (x,u) — (u,v) defined by 


(2) ©=2,y = f2(2, u) 
and 
(3) U== U, Vv = — fy (2, 
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This theorem is classical ([1]; ef. [4]) if it is assumed that wu and v 
are of class C' (and, correspondingly, f is of class C?) and that (11) is replaced 
by u, > 0 and (y) by fue > 0. 

If a funetion f(2,u) with the properties («), (8), (y) is given, then 
(2), (3) and the relation 7, determine a unique transformation 
(1) with the properties (i)-(iv). Conversely, if (1) is given with the 
properties (i)-(iv) (on a simply connected (x, y)-domain), then an f satis- 
fying (2)-(8) is uniquely determined up to an additive constant. 


Proof of (1). Let (1) define a continuous transformation T, with the 
properties (i)-(iv) and let the inverse of 7’, be denoted by 


(4) Ty: 


Since u(az,y) in (1) is an increasing function of y for fixed 2, the 
equation wu =u(2,y) has a solution y=y(x;u) for y. Let D denote the 
domain of definition of y(x;u) (so that D is simply connected). Clearly, 


(5) T;: 2, y (ZT; 
defines a topological mapping (z,u) (2, 


If y=y(x;u) is substituted into the second of the functions (1), 
there results a function v = v(a,y(@;u)) =v(x;u) of (a,u) on D. The 
transformation 


(6) Te: u=U,v = 


is topological, since T, = T7,T;. This shows that the property (ii) of the 
topological mapping 7, implies that the inverse 7, = 7,” has the following 


property : 
(ii’) w—-2(u,v) is a strictly monotone function of v for fixed uw. 


Actually, the monotone function v(x;u) of x is decreasing, since 7, 
is orientation-preserving. For the fact that y(xz;w) is an increasing function 
of u shows that 7; is orientation-preserving and hence T,—=T,T; is also. 
But 7, is orientation-preserving if and only if the monotone function v(r; u) 
of x is decreasing. 

Let R denote a rectangle u,;SuSu, in D. Let R; and 
R, denote the (z, y)- and (u, v)-images of R under T; and T,., respectively. 
Thus PR; is a domain bounded by the two ares y= y= Y (3 U2), 
where S =», and two segments of the lines 2,, — 22, and is 
a similarly described domain. Hence, the areas of R; and R, are given by 
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the respective line integrals fy(x;u)dz, fv(x;w)du around the boundary 
of R. Since FR, is the image of R; under (1), the area-preserving character 
of 7, means that 

(7) fy(v; u)dx — v(x; u)du = 0, 


where the line integral is taken around the boundary of any rectangle f& the 
closure of which is in D. 
Since D is simply connected, there exists on D a function f(z, wv), unique 


up to an additive constant, satisfying 
(8) fe(z,u) =y(2z; fu(z, wu) =—v(@3u). 


Hence (2) is identical with (5), and (3) with (6). Clearly, f has the 
properties (a)-(8). 

In order to complete the proof of (1), the converse of the statement 
just proved must be verified. Let f(z,w) have the properties (a)-(y) on 
some domain D; so that T, and T,; are continuous, one-to-one mappings of 
D onto (a, y)- and (u,v)-domains, respectively. Let (1) be the mapping 
(x,y) > (u,v) given by T,;=T;T.". It must be shown that 7, has the 
properties (i)-(iv). That 7, has the property (i) is obvious; that 7, has 
the property (ii) is equivalent to the fact that f has the property (8) 
(cf. (2) and (5)); that 7, is orientation-preserving follows from the fact 
that (8) and (y) imply that 7 and T, are. 

There only remains the verification of the statement that 7, is area- 
preserving. By reversing the reasoning based on (7), it follows that 7, 
preserves the area of any domain of the type of R;. Since FR; is the image 
under 7’; (= T:) of a rectangle in the (2, u)-domain D, it is clear that the 
collection of sets R; forms a “ basis” for the additive set-funetion which is 
the area. Hence 7’ is area-preserving. This proves (1). 


2. Two lemmas. Condition (y) in (1) does not imply (8).° For the 
function f(z,w) =2*u has the partial derivative f,— 2°, which is an 
increasing function of x, and the partial derivative f, = 3x?u, which is not 
an increasing function of wu if 0. On the other hand, a weakened form 
of (y) does imply a weakened form of (£). 


LemMA 1. Let f(a,u) bé a function possessing continuous partial 
derivatives and let fu(x,u) be a non-decreasing function of x for fixed u. 
Then fr(x,u) is a non-decreasing function of u for fixed x. 


Proof. Let the segment r=-2,, u,; Suu, be in the domain D of 


definition of f(w,w). It will be shown that uw.) = fe(a1,u,). Let 
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(> 2,) be chosen so that the rectangle a, uy SuS is in D. 


Put 
Q =f (a1, U1) —f(@2, Ui) + f (x2, U2) —f (21, U2). 


The mean value theorem of differential calculus, applied to the function 
f (x2, u) —f(a,u) of u, shows that there exists a number u* satisfying 


Q = (fu(%2, u*) — u*)) (U2—u,) and < u* < 


The assumption that f, is a non-decreasing function of x implies that Q = 0. 
On the other hand 


0S Q/(#2— 211) > fe(@1, U2) — U1), AS > 


This proves Lemma 1. 
It was noted in the proof of (1) that if (1) is a continuous, one-to-one 


mapping and u(z, y) is an increasing function of y for fixed z, then, in the 
inverse mapping (4), the function z(u,v) is strictly monotone in v for 
fixed wu; cf. (ii’). A corresponding fact holds if u(z,y) is only monotone 


(not strictly monotone). 


LemMaA 2. Let (1) be a@ continuous, one-to-one mapping and let (4) 
be its inverse. If u=u(x,y) in (1) is a monotone function of y for 
fixed x, then x =2(u,v) tm (4) ts a monotone function of v for fixed u. 


Proof. Since it is assumed that (1) is a continuous, one-to-one mapping 
of an (zx, y)-domain onto a (u, v)-domain, the set of points u(2, y) = const. 
is either empty or is an open Jordan are. In the latter case, let = x(t), 
y =y(t), where 0<¢t< 1, be a parametric representation of this are. If 
two points (21, 4:1), (%, y2), where y2 > y;, are on this arc, then the entire 
segment 2,, 1s on the arc, since w is a monotone function 
of y for fixed z This shows that either dr(t) 20 or dzx(t) =0 for 
0<t<1. | 

Since (1) is a topological mapping, v—v(z,y) cannot have the 
same value at two distinct points of the are u—const. Hence either 


dv(x(t), y(t)) > 9 or dv(z(t), y(t)) <0 for O0< t<1. Thus, v can be 
used as a parameter on the arc uw = const. and z is a monotone function of v. 


This proves Lemma 2. 


3. Semi-Legendre transformations. There will now be proved a 
theorem dealing with a semi-Legendre transformation of the type occurring 


in (2) and (3). 
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(II) Let z=2(x,y) possess continuous partial derivatives on some 
(x, y)-domain and let z2(x,y) be an increasing function of x for fixed y; 
so that 


(9) Ty: p= y= 
is a one-to-one mapping (x,y) > (p,y). Then 


(10) g = tp —%, 


as a function of (p,y), possesses the continuous partial derivatives 


(11) Jp = Gy = — 


Proof. By virtue of 75, any segment p; S pS po, y = y, in the (p, y)- 
plane corresponds in a one-to-one manner to a segment 2%,l¢@%@l%, 
y=y, in the (z,y)-plane. For any function f of (p,y) or (2,y), let 
Af =f (pe 41) —f(pi, 41). Then Az = p*Az, where p* = 2z,(a*,y,) and 
< pi < p* < po. Hence 


Ag = piAz + — Az = + (p, — p*) Ar < 2,Ap, 
Ag = poAxr + 2,Ap — Az = 2,Ap + p*) Ar > Ap. 


Since p.— p, implies > 2,, it follows that Ag/Ap—> 2, as po —>p,. Thus 
gp exists and is 2. 

Let p = p, be fixed in such a way that p(z, y) = p, is a non-empty set 
of points (z,y). This set is a Jordan are with a parametric representation 
t=ax(y). Let (a1:,4:) and (2, y2), where y2 > yi, be two points of this 
are. For any function f, let Af y2) —f(p1,y:). Then Az can be 
written in the following two ways: Az = p*Az + q*Ay and Az = p°Az + qg°dy, 
where p* = yi), = Zy(%2,y*) and = y2), 9° = 2y(%, 9°), 
and x*, lie between x, and 2», finally y, <y*, < ys. Since py = 
= 2Z,(@2, y2) and since z,(x, y) is an increasing function of z, it follows that 
pi S p* and p, = p® if x. = x, and these inequalities are reversed if 7, = 2. 
Hence 

Ag = p,Av — Az = — g*Ay + (pi — p*) Az, 


Ag = pidr — Az = — + (pi — Az 


show that Ag/Ay is between — q* and --- g°. Consequently, g, exists and is 
—q=—~2z,(z,y). This proves (II). 


Remark. A part of assertion (II) remains true if z,(z,y) is a non- 
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decreasing (instead of an increasing) function of a for fixed y. In this case, 
(10) remains a function of (p,y). For if B denotes the set of points (p, y) 
which are images of points (z,y) under Ts, then the pre-image of a point 
(p*, y*) of B is a segment (open, closed, half-closed or a point) on the line 
y=y* in the (z,y)-plane. It is clear that g, as a function of (2, y), is 
constant on this segment, since z(, y*) is linear in x and 2,(2, y*) = p* is 
constant on that segment. Also, the first part of the proof of (II) shows that 


(12) 9 (P25 Y1) — 9 (Prs Ys) = (P2 — pr) 


holds for some point (x*, y,) on any segment of the line y = y, (in the (2, y)- 
plane) which is mapped onto the segment p, S pS po, y=y, under Ty 


The name semi-Legendre transformation for (9) is justified not only in 
the obvious manner but also by the following corollary of (II): 


(III) Let z=2z(2x,y) possess continuous partial derivatives. Let the 
mapping Ts: (x,y) > (p.y), given by (9), and the Legendre mapping 
Tis: > (p,q), gwen by 


(13) T 13: Zy(2, Y)> 


be one-to-one. Then the Legendre transformation Ty; ts the product 
Ti3 =T1T,,T, of the (one-to-one) semi-Legendre transformations given by 
(9) and 

(14) Lis? P= =— 


where g is defined by (10). 


In order to see that (14) defines a one-to-one transformation, it is 
sufficient to remark that g,(p,¥) =— q(=—%,) is a strictly monotone 
funnction of y for fixed p. For, in view of the assumptions that 7, is one- 
to-one and that p(z,y) is increasing with z, it follows that q is a strictly 
monotone function of y for fixed p; cf. the remarks preceding Lemma 2. 


4, Legendre transformations. The following theorem will be proved: 


(IV) Let z=2z(xz,y) possess continuous partial derivatives on some 
(x, y)-domain and let the Legendre transformation T,3: (2, y) (p,q) be 
one-to-one. In addition, suppose that either 


(15) 2z(z, y) is a montone function of x for fixed y 
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or 

(16) 2,(x,y) is a monotone function of y for fixed a. 
Then, as a function of (p,q); 

(17) h=ap+ yq—z 


has continuous partial derivatives; the transformation inverse to Ty, 1s the 


Legendre transformation 


(18) Ts: ©=hp(p, = ha(P, 9); 


finally, in the respective cases (15) or (16), 


(19) ha(p, 7) is a monotone function of q for fixed p 
or 
(20) hq(p, 7) is a monotone function of p for fixed q. 


Thus the relation z—>h is involutory. 


It is understood that the sense of the monotony in (15), (16), (19) and 
(20) is independent of y, x, p and q, respectively. It is also clear that if it 
is assumed that the monotony in (15) is strict, then the monotony in (19) is; 
if the monotony in (16) is strict, then (20) can be improved to “hp(p, q) 
is a strictly monotone function of q for fixed p.” 

A theorem corresponding to (IV) in which it is assumed that z is of 
class C? and that the Jacobian 0(p, q)/0(2, y) = Ze2%yy — Zay” is not 0 is 
classical. Also, if it is assumed that z is convex (instead of assuming merely 
that z, is monotone with respect to one of its arguments), then (IV) is 
known ([2]; for a similar theorem, see [3], pp. 365-366). 

The particular case of (IV) in which it is assumed that z,(7,y) is a 
strictly monotone function of a for fixed y is contained in (IIT). 

It will remain undecided whether or not (IV) remains true if the 
assumption on the monotony of z, and the assertion on the monotony of hg 
are omitted. 


Proof of (1V). The theorem (IV) will first be proved under the assump- 
tion (16); say, p= 22(2,y) is a non-decreasing function of y for fixed z. 
By Lemma 1, the function ¢ = 2,(2, y) is non-decreasing in x for fixed y. 


If the number p = 7, is so chosen that the set of (2, y)-points p(x, y) = px 
is not empty, then it is a Jordan are. Let J be a closed subare of this arc; 
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so that J has a parametric representation z = 2(t), y = y(t), where 4; St St, 
and dx(t) 20. It will be shown that 


(21) f pdx + qdy = dz, 
7 


where Az = 2(2o, —2(%1, 91) and for k —1,2. 

The are J is not stated to be rectifiable, but the line (or Stieltjes) integral 

(21) exists, since p is constant and q is a monotone function of ¢ along J. 
Consider a subare of J. For convenience, it will be supposed that the 

end points of this subare are also denoted by (21, ¥:), (2, y2), Where %, = 2. 

Then 

(22) Az = + q*Ay = + q°Ay, 


where p* = 22(2*, 41), q* = 2y(%2, y*) and p® = 22(2*, 41), 9° = %y(%1, 9"), 
with z, S 2*, x° = 2, and with y*, y° between y,; and y. 

Since 7’; is a one-to-one mapping, q(t) = z,(x(t), y(#)) is a (strictly) 
monotone function of ¢. For the sake of definiteness, suppose that it is an 
increasing function. On the subare under consideration, there is at least 
one value ¢* of ¢ satisfying y(¢*) = y* (and, of course, z(t*) = 2,). Since 
q(x, y) is a non-decreasing function of 2, it follows that qg* = q(t*). Hence 
g* =q(t*) Similarly, < q(tz). 

Thus (22) shows that, if Ay = 0, 


Az = p(t)Ar + + (p* — ) Az, 
Az S p(tz) Av + q(t2) Ay (p® — p(te) ) Az, 


and similar inequalities hold if Ay = 0. 
Consider a mesh on the are J and the corresponding inequalities for each 
subare of the mesh. If the corresponding inequalities are added, there result 


the inequalities 


Az = pdt + qdy + 0(1) + 0(1)3| 
J 


pdr + qdy + 0(1) +0(1)3| 
J 


where the 0(1)-terms refer to the degree of fineness of the mesh tending to 0. 
The first 0(1) in each of the inequalities is the difference between the integral 
and a suitably chosen approximating sum %(p,A*r + q,A*y). The second 


il 
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0(1) in each of the inequalities depends on the degree of continuity of 
p=%,(2,y). Since dx(t) = 0 implies that = | Ar | =a, —%, the relation 
(21) follows from the last two inequalities. 


An integration by parts, applied to (21), shows that 


Az = + yq) — (dp = 0); hence Ah = ydq (dp=0O), 


by the definition (17) of h. Consequently, 0h/dq exists and is y. 


Since the conditions on z;(z,y) and z,(z,y) are symmetric by virtue 
of Lemma 1, it follows that (21) holds if J is a closed subare on which 
q(x, y) =const. Hence dh/dp exists and is 2. 

Finally, if p, q are read in place of u, v in Lemma 2, it follows from 
(16) that 2 —h,(p,q) is a monotone function of g. Hence (20) is implied 
by Lemma 1. This proves the case (16) of (IV). 

Assume that (15) holds; for instance, that p = z,(a, y) is a non-decreasing 
function of 2 for fixed y. If J is a closed are on which p(z, y) = p,, then J 
has a parametrization = a(t), y= y(t), where StS and dy(t) 20. 
By an argument similar to that used above, it will be shown that (21) holds. 

First, the equations (22) hold, where z*, x° are between x, and 2, and 
where y; S y*, S yo. Since pi = p(X, = y2) and since p(a, y) 
is a non-decreasing function of x for fixed y, it follows that p* = p, = p® 
or p* = p, S p® according as 72 = 2, or Thus, if 


Az = p(t)Ar + q(t)Ay + (q* — Ay, 
Az S p(t.) Av + q(tz)Ay + (9° — q(te) ) Ay, 
and similar inequalities hold if 7, 2,. Since Ay = 0, it can be concluded 
that (21) holds. Consequently, 0h/0q ‘exists and is y. a 
Tt will now be shown that 0h/dp exists and is x. An integration by parts 
applied to the first term in (21) gives Az =A(pzx) + ( qdy (dp=0O). 
Since dy = 0, it follows that J 


(23) A (px —z) =— q*Ay (dp=0). 


where g* is the value of g =z, at some point of JJ. 
In view of the Remark following the proof of (IT), it follows by (12) 
that the function g = pr —z of (p,y) satisfies 


(24) A(px — = Ap, (dy =0), 
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where z* is the abscissa of some point of a line segment y = const. on which 
p varies from p, to p, (and Ap = p, — p,). 

Consider two points (21,41), (2, yz) of an are q(z,y) =q, and let 
Pi = p(t, Yi) < po = p(x2, y2). If the two points are sufficiently close 
together, they can be joined by a path consisting of an are on which p= p, 
and of a line segment on which y = ys, and also by a path consisting of a 
line segment on which y = y, and of an are on which p = p.. Correspondingly, 


(23) and (24) give 
A(ap—z) =— q*Ay + = v°Ap — 


Since y is a monotone function of q for fixed p (that is, dy = 0 and Ag+ 
on the ares p= p, and p=pz,), it follows that q, is between g* and 
Hence Ah = q,Ay + A(ap—z) is between and Consequently, 
0h/dp exists and is 2. 

Since (19) follows from (15) and Lemma 2, this completes the proof 
of (IV). 
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ON THE NON-PERIODICITY OF THE ZEROS OF THE 
RIEMANN ZETA-FUNCTION.* 


By C. R. Putnam. 


In the half-plane « > 0 of the complex (z =o + ?f) plane, the Riemann 
Zeta-Function, £(z), can be represented as 


(1) =2 j ; 
7 0 
cf., e.g., [1], p. 14. The following theorem will be proved: 
(*) The sequence of the consecutive positive zeros of €(4-+ it) does 
not contain any periodic subsequence. 


Here, by a periodic sequence is meant any arithmetical progression of 
the form 
(2) d, 2d, (d>0). 
(Needless to say, the assertion of (*) remains valid if the word “ positive ” 
is replaced by “negative.”) The theorem states that every sequence (2) 
contains an infinity of elements which are not the imaginary parts of zeros 


of £(z) on the critical line o-=}. It should be noted that £¢(4-+ it) 
possesses an infinity of positive zeros << (Hardy); ef. [1], p. 214. 
Moreover, one has even lim inf | tsi — ty | =0,as k-—>oo. This follows from 


Selberg’s result that V (7) > const. T log T, where “ const.” denotes a positive 
constant and V(7') denotes the number of zeros of (4 -+- i/) on the interval 
0<=tST; see [1], p. 226. 

Actually, the theorem (*) holds if the role of the line o = } is played 
by any line o ~const., whether or not the Riemann hypothesis is assumed. 
However, the assertion of (*) in this case is an easy consequence of the 
relation Ns(7) = o0(7) (Bohr-Landau), in which S > $ and Vs(T’) denotes 
the number of zeros o + it of £(z) in the rectangle SSoZ1,0StST. 


Proof of (*). Another expression for £(z), easily deduced from (1), 
is given by 


(3) f(z) =2(z—1)1—2 g(a) ec ttedz, 
e 0 


* Received July 11, 1953. 
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where, for n= 1, 2,-- -, 
(4) g(x) = (e*— [e*])e-* for lognS 2 < log(n+ 1). 
If z=o+ it is a zero of £(z), it follows from (3) that 


(z—1)? = 
e 0 


and hence, 
(5) 9(w)e0s (te) da = (9 —1)/((o 1)? + #). 


In order to prove (*), suppose, if possible, that the assertion of the 
theorem is false; then there exists some number d>0 such that, for 
n =1,2,:--, the numbers z = 4-+ idn are zeros of £(z). Henceforth, it 
will be supposed that o 4. Next, extend the domain of definition of g(z) 
to —w< <o, by putting g(— 7) g(x), 0S 4 and define G(z) 


on —o< hy 


(6) — > 4 2nk/d). 


In view of (4), one has | g(x)| S e4l*!, —o < # <oo, and it is clear that 
the series (6) is uniformly convergent on any finite interval. Moreover, 
G(x) is periodic with period 2x/d, so that G(a + = G(x). In view 
of the relation 


G (x) eidne dy J g (x) 
e 0 


(see [2], p. 37), it follows from (5), where now o + it = 4-t idn, that, 
for n=1,2,---, 


2a/d 
J G(x) cos(dna) dx = — (4 + 
0 


and 
* 


G(x)sin(dnz) dz = 0. 
0 


Thus, the Fourier coefficients of G(x) are O(n-*), hence o(n-'). Therefore, 
the function G(x) cannot possess a simple discontinuity; [2], p. 28. 

Let h>0. Since g(a+0) (=limg(x+h) as h>+ 0) exists for 
all x, it follows from the uniform convergence of the series (6) that 
G(z+0) exists; similarly G(a—0) exists. Next, it will be shown that 
G(« +0) —G(x—0) <0 for at least one value of 2, in contradiction 
with the assertion at the end of the last paragraph. 
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In order to show this, let = log m, where m = 2 is an integer to be 


determined later. Then 


G(log m + h) — G(log m — h) = g(log m + h) — g(log m — h) 
+ (log m + 2ak/d + h) —g(log m + 2rk/d —h)]. 
ko 


Since, by (4), g(logm +0) —g(logm—0) <0, and since the jumps 
g(« + 0) —g(«—0), which occur only at the values x = + logn, n=2, 
3,° °°, are positive or negative according as x = —logn or x= log n, it is 
clear that it is sufficient to show that m can be chosen so that 


(8) log m + 2xk/d ~— log n 1, + +). 


(If (8) holds, then none of the terms of the summation of (7) can make 
a positive contribution as h—>-+ 0.) Thus, it is enough to show that, for 
some integer m = 2. 


(9) 2ark/d ~log(mn) holds for k= 1,2,--- 


Suppose though, if possible, that (9) fails to hold for every integer 
m=2. Then, if m=m;=2(j—1,2) denote arbitrary integers, there 
exist integers k =k; = 1, n=nj=22 such that 2rk;/d =log(mjnj) and 
hence (m2n2)* = (m,n,)*. However this last equality must fail to hold, 
a contradiction, provided only that m, and m, are chosen so that mz is 
relatively prime both to m, and to n, (note that n, depends only on m,). 

Hence (9) must hold for some integer m =m’ =2 and therefore 
G(x + 0) —G(x— 0) < 0 for r= log m’. As pointed out above, this leads 
to a contradiction and the proof of (*) is now complete. 


PURDUE UNIVERSITY. 
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A GENERAL THEORY OF RADICALS.* 
II. Radicals in rings and bicategories. 


By S. A. AmItTsuR. 


In the introduction of part I ([1])* we have pointed out the relation 
between some of the well known radicals of rings and properties of rings 
which are invariant under homomorphisms (compare [3], Theorem 2). The 
object of the present paper is to give an axiomatic foundation of these radicals. 
If z is a property of rings, we generally mean by a z-radical of a ring Ra 
unique maximal ideal N of R possessing the property 7 and such that the 
quotient R/N does not contain non-zero z-ideals. The main properties which 
a radical is generally required to satisfy is: the existence in every ring; the 
radical of an ideal A should be the intersection of A and the radical of the 
whole ring; the radical should contain also one-sided ideals; the radical of 
a matrix ring over a ring R should be the matrix ring over the radical of R. 
Necessary and sufficient conditions for these results to be satisfied by a property 
m7 are given, among which the most important is the fact that + should be 
invariant under homomorhpism. If only this condition is imposed on 
(and another of far less importance) one can obtain generalizations of Baer’s 
Lower Radical ([5]). Some methods of constructing properties which yield 
radicals from non-yielding-radicals properties are given in section 7 and 8. 
This method will be applied to prove some new results about some known 
radicals, in particular Baer’s Radical. 

The most important tools in studying this homomorphic-type radicals 
are undoubtedly the main two Isomorphism Theorems. This is the reason 
why the whole theory (except some particular ring-structure problem, e. g. 
radicals of matrix rings) can be developed in quite a general class of bicate- 
gories, namely : the lattice ordered bicategories, in the sense of MacLane ([7]), 
which contain a zero element and which satisfy some additional axioms. In 
particular, the theory will hold for a big class of abstract algebras containing 
rings (associative or non-associative), semi-groups, groups, loops, ete. The 


* Received September 10, 1952; substantially revised March 7, 1953. 
* The results of part I ([1]) will be quoted by the number I followed by the nota- 
tion of the result quoted, e. g. definition I. 1 mean definition 1 of the paper [1]. 
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greatest part of the present paper deals only with rings (not necessarily asso- 
ciative) since the main applications will be for the theory of rings. Never- 
theless, in section 8 we give a list of conditions to hold in bicategories so that 
the theory developed in the preceding section will hold for them. 

The last section is devoted to the study of the relation between the 
theory of radicals as developed by B. Brown and N. H. McCoy in [3] and [4] 
and the present theory. The type of radical dealt with by these authors con- 
stitutes an important class of the radicals whose theory is developed here. The 
incorporation of their theory and the present theory yields a slight extension 
of the results of [3] but nevertheless it makes use of Zorn’s lemma which 


was avoided in [3]. 


1. HI- and RI-properties. In the present section we restate some of 
the main results of part I ({1])! in the language of ideals and homomorphisms 
of rings. This is obtained by applying the results of that paper to the lattice 
of all ideals of rings. These restated results are needed for further appli- 
cations and for a simplification of the theory developed in the preceding 
paper when applied to rings. 

Let z be a property of ideals in rings. By a z-ideal we mean an ideal 
which possesses the property z. 

The sequence of ideals VU, = U)(f) is defined as follows: 


1) IF — 0. 

2) Uy, = for limit ordinals 

3) For non limit ordinals A, U) is the union of all the ideals A of R 
such that A/U)_, is a z-ideal in R/U _,. 


The ideal U) is known as the A-th (z-) radical of R The upper r-radical 
U(R,7r) =U(R) of F is defined to be the limit ideal of this sequence ; 
namely the minimal ideal UV) such that U, = U),,. The following notions 


are fundamental: 
m-semi simple: a ring R which does not contain non-zero z-ideals. 


7-ideal: an ideal P in a ring R for which the quotient R/P is z-semi 
simple.” 


a-radical: an ideal N in a ring R which is both a z- and a 7-ideal in R. 
That is, N is a z-ideal and the quotient R/N is z-semi simple. 


? These ideals correspond to radical ideals in the sense of Baer [5]. 
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A property z of ideals will be called an HI-property if w satisfies the 
following two conditions: 


(1A) The zero ts a z-ideal. 
(1B) Every homomorphism maps z-ideals onto x-ideals. 


For a property z of ideals, we define a relation z in the lattice of all 
ideals of a ring R as follows: let A and B be two ideals in Rk. Using the 
notation of part I ([{1]), we write AaB if AB and A/B is a z-ideal in 
R/B. Since the lattice of all ideals of a ring is known to be complete, the 
results of part I ([{1]) can be applied to this lattice in view of the following 


fundamental lemma: 


Lemma 1.1. Jf x ts an HI-property then the relation x defined in the 
lattice of the ideals is an H-relation in the sense of definition I. 1.1 


Indeed, conditions IJ. (A) and I. (B) of [1] are evidently satisfied. To 
prove I. (C), we consider three ideals A, B, C in RF such that AwB and 
CB. The homomorphic mapping of 2/B onto R/C maps A/B onto 
AU C/C; hence by (1B) it follows that A U C/C is a z-ideal in R/C. Thus 
(AU C)rC, q.e.d. 

The application of Theorems I. 1.2, I. 1.1, I. 1.3, and Corollary I. 1.4 
yields 


THEOREM 1.1. Jf ~ is an HI-property then the intersection of any set 
of x-ideals of a ring R is a 7-ideal; the intersection of all 7-ideals of R, which 
ts the minimal 7-tdeal, 1s exactly the upper z-radical U(R) of R. Further- 
more, if a m-radical N of R exists, then N=U(R) and N is uniquely 
determined as the maximal x-tdeal of R. 


In the latter case N is called the z-radical of R and will be denoted 
by a(R). 


Properties for which the z-radical always exists must satisfy: 


(1C) For every ideal A ina ring & which is not a w-ideal, there exists 
an ideal BCA such that there are no non zero x-ideals in R/B contained 
in A/B. 


Properties which satisfy the three conditions (1A), (1B) and (1C) will 
be called Ri-properties, and for these properties we have by Theorem I. 2.1: 


THEOREM 1.2. Let x be an HI-property, then a necessary and sufficient 
condition that the x-radical exists in every ring is that x be an RI-property. 
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As a consequence of Theorem I. 2. 2 we obtain the following fundamental 
result concerning #J-properties: 


THEOREM 1.3. The union of any set of x-ideals ts a w-ideal. 


The conditions of Theorem I.2.3 can be simplified in the present 
application as follows 


THEOREM 1.4. If x is an HI-property then the following two conditions 
are equivalent to (1C) 


(1C,) If ADB are ideals in R such that B is a z-tdeal in R and A/B 
is a 1-ideal in R/B, then A is also a r-ideal in R. 


(1C.) If A,C ts @ non decreasing well ordered sequence 
of x-ideals then the union U A, is also a z-ideal.* 


This theorem is a consequence of Theorem I. 2.3 since (1C,) implies 
I. (D,) and (1C,) implies I. (D.). Indeed, if AaB and BC in the lattice 
of ideals of R then, by (1B), (A/C)z(B/C) in the lattice of ideals of R/C 
since (A/C)/(B/C) =A/B. Applying now (1C,) to the ring R/C we 
obtain A which proves J. (D,). Similarly, if BC A,C A,C:-:: isa 
sequence of ideals in & such that A, zB then applying (1C,) to the ring R/B 
we see that U(A,/B) is a z-ideal. Now, U(A,/B) = (UA,)/B. It follows, 
therefore, that (U A,)aB which proves I. (D2). 

The homomorphism invariance of z immediately yields the following 


results: 


THEOREM 1.5. 1) Let & be an isomorphism between R and R®, then # 
induces a one to one correspondence between the z-ideals of R and R?. 


Hence, 

2) The isomorph ic images of x-semt simple rings are x-semt simple. 
We are now able to prove: 

THEOREM 1.6. A subdirect sum of x-semi simple rings is x-semi simple. 


Indeed, if R is a subdirect sum of z-semi simple rings {R,} then 
01 A, where A, are ideals in R such that R, = R/A,. From the pre- 


In the proof of the equivalence of these conditions the well-ordering theorem is 
used. One can use Zorn’s lemma instead by replacing (1C,) by the requirement that 
the union of every linear system of z-ideals is a 7-ideal. 
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ceding theorem it follows that A,7#R#* in the lattice of ideals of Rk. Hence, 
by Lemma I. 1.3 it follows that which evidently means that P is 
m-semi simple. 

Thus, Theorem J. 1.5 yields: 


TuroremM 1.7. Jf the kernel 0 of the homomorphic mapping of R onto 
29 is contained in the upper n-radical U(R), then U(R®) =U(R)?. 


In particular, if # is an automorphism we obtain: 


Coronary 1.1. U(R) is invariant under the group of automorphisms 
of R. 


Example. For associative rings, properties like regularity, strong regu- 
larity, and z-regularity ° are HJ-properties. The property of quasi-regularity 


is an example of an #/-property. 


2. Ring properties. ‘The rest of this paper deals with a less general 
class of properties. We consider henceforth only properties of rings (not of 
ideals in rings!) ; thus nilpotency of associative rings is such a property. 

By a x-ring R we shall mean a ring #& which possesses the property z. 
A ring-property 7 is turned into a property of ideals in rings by defining a 
a-ideal to be an ideal which is also a z-ring. 

In contrast with the properties of ideals considered in the preceding 
section, the ring properties z have the characteristic that if A is a z-ideal 
in some ring # then A remains a z-ideal in every ring containing A as an 
ideal. This fact yields some modification of the conditions of the preceding 
section. 

For the convenience of the reader we give here a list of all conditions 
which will be used in the present paper: 


(A) The zero is a z-ring. 
(B) Homomorphic images of z-rings are z-rings. 


(C) Every non z-ring is homomorphic to a non zero z-semi simple ring.® 


‘ The notation 7 is used here in the sense of section I.1. That is: BrA means that 
A D> B and the quotient A/B does not contain non zero z-ideals of R/B. 

5 For definition see e.g. I. Kaplansky, “ Topological representation of algebras II,” 
Transactions of the American Mathematical Society, vol. 68 (1950), p. 67. 

* This amounts to the fact that every ring which is not a 7-ring properly contains 
a 7-ideal. 
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(D) All ideals of a z-ring are 7-ideals. 

(E) All ideals of a 7-semi simple ring are z-semi simple rings. 

For simplification we introduce the following names: properties which 
satisfy (A), (B) will be called H-properties (homomorphism invariant 
properties). If (A), (B), (C) are satisfied, the property will be called an 
R-property (a radical-property). H-properties which satisfy (D), (E) will 
be called SH-properties. If all conditions are valid, the property will be 
called an SR-property. 

The following two conditions will be seen to be equivalent to (C): 

(C,) (Transitivity) If A is an ideal in R such that R/A and A are 
a-rings then is also a z-ring. 

(C.) If 4, C A.C: is a well-ordered sequence of z-ideals then 
U A, is a 7-ideal. 

For problems relating to one sided ideal and subrings we shall need the 
following conditions (section 3): 


(D,) Every right ideal of a z-ring is a z-ring. 
(F,) -semi simple rings do not contain non zero right z-ideals. 
(Ds) Every subring of a z-ring is a 7-ring. 


(F,) 7z-semi simple rings do not contain non zero z-subrings. 


For determining the radical of a matrix ring R, over a ring R, we shall 


need the following conditions (section 4) : 


(G,) If # is a z-ring then so is R,. 


(G.) If R is z-semi simple then so is R,. 


Two more conditions of a less general nature will appear in sections 6 
and 8. 

The classes of H-properties and R-properties defined above are subclasses 
of the HI-properties and Rl-properties respectively as will be shown in the 
following lemma: 


LEMMA 2.1. 1) An H-property 1s an HI-property. 
2) R-properties are also RI-properties. 


Before proving the lemma, we note that (C,) is readily seen to be 
equivalent to condition (1C,) of the preceding section, and (C.) is exactly 
the condition (1C.). 
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The proof of (1) is evident. To prove (2) we shall show that if 7 is 
an H-property, then a necessary and sufficient condition for the existence of 
the z-radical is that 7 satisfies (C); and thus the proof of (2) will follow 
by Theorem 1.2. To this end we observe that if z is an H-property whose 
x-radical always exists then the quotient ring modulo this radical satisfies 
(C) ; and hence z is an R-property. Now suppose that 7 is an A-property. 
Let A > B be two ideals in a ring R such that A/B is a a-ring in R/B and 
B isa n-ring in R. Consider A as a ring; if it were not a z-ring then, by (C), 
A would properly contain a z-ideal D. Since B is a z-ideal it follows, by 
Lemma I. 1.2, that D> B. In view of the fact that (A/B)/(D/B) = A/D 
it follows that D/B is also a 7-ideal in A/B; but A/B is a z-ring, hence 
DA. This is a contradiction, which proves that w satisfies (1C,). To 
prove that x satisfies (1C.), we consider a well ordered non decreasing sequence 
of z-ideals A, C A,C-+:+:. If A=U A, ts not a z-ideal, then considering 
A as a ring we would obtain, by (C), the existence of a 7-ideal D properly 
contained in A. Since A, is also an ideal in A, it follows by Lemma I. 1. 2 
that DA, Hence DU A,=—A which is a contradiction. The rest of 
the proof follows now immediately by Theorem 1. 4. 

An immediate consequence of this theorem is that H-properties zw yield 
lower z-radicals and #-properties yield z-radicals, which have the properties 
of the radicals of the preceding section. 


The ring-properties satisfy also the following theorem: 


THEOREM 2.1. Jf z is an H-property, then U(S) D U(R) for every 
subring S which contains U(R). 


The proof will follow by showing inductively that U(S) > U)(R), where 
U,(R) is the A-th z-radical of R. Indeed, since 0CS, U(S) U,(R). 
Suppose U(S) ~ U,(f) for every y << A. For a limit ordinal d, it is evident 
that this implies that U(S) > U)(R). 

If A is a non limit ordinal and P/U),(P) is a z-ideal in R/U)_,(R), 
then since SO U(R) DU)(R) DP, is also a -z-ideal in 
S/U)1(R); hence U(S) > P. By the definition of U,(R) it follows now 
that U(S) 2 U)(R). 

In particular, if S = U(R) we have: 


2.1. U(U(R)) =U(R). 


Consider now the set of all subrings of a ring R. This set is turned 
into a complete pseudo-lattice, in the sense of definition I. 4, by defining a 


of 
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relation: A = B, between subrings, if B is an ideal in the subring A.’ The 
intervals [B, A] of this complete pseudo-lattice can be indentified with the 
complete lattice of the ideals of the quotient A/B. Let w be an H-property ; 
as before, this property determines a relation a in this pseudo-lattice. 
Condition I. (C,) of section I.3 is readily seen to be a consequence of the 
first isomorphism theorem and Theorem 1.5. Thus z determines an H- 
relation in the complete pseudo-lattice. Stating Theorem I. 3.1 for the 


present case we have 


THEOREM 2.2. Let « be an R-property. A necessary and sufficient 
condition that r(A) =A x(k) for every ideal A is that w be an SR- 
property, i.e. should satisfy the five conditions (A), (B), (C), (D) and (E). 


The first isomorphism theorem together with (B) imply that SR- 
properties satisfy the conditions of Theorem |. 4.1. Hence, 


CorOLLARY 2.2. The mapping A—>1(A), where r(A)/A ts the x-radical 
of R/A, is an idempotent meet-homomorphism of the lattice of ideals of R. 


Another result is that H-properties which satisfy (D), satisfy the con- 
ditions of Theorem |.5.2. Thus, each of these properties determines a 
dual radical. 

We conclude this section with proving the necessity of the conditions 
(A)-(E) for the existence of the radical. That is: 


THEOREM 2.3. Let be an tsomorphism invariant property: of rings. 
Then: 

1) If every algebra R contains a maximal x-ideal r(R) such that 
is the minimal ideal P in R with the property that R/P is a-semi simple, 


then w+ is an R-property. 


2) If the preceding ideal x(R) satisfies: r(A) =AM 2(R) for every 
ideal A ina ring R then x is an SR-property. 


Proof. Since z(0) = 0 for the zero ring, it follows that zm satisfies (A). 
To prove (B), let R be a z-ring and let R’ be a homomorphic image of R. 
Denote by N the origin of x(k’) in R. Note that the isomorphism invariance 
of + implies that isomorphic images of z-semi simple rings are also w-semi 
simple. Applying this fact to the case R/N = R’/r(R’), where the latter is 
m-semi simple by the condition of the theorem, we obtain that R/N is r-semi 


7 This is the pseudo-lattice given as an example in section I. 3. 


=] 
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simple. Hence, by the assumption of the theorem, V  7(f). Since # isa 
a-ring and +(/) is the maximal z-ideal of R, 7(R) = R = N, which proves 
that N’ =7(hk’) =f’. Thus FP’ is a z-ring and (B) is proved. By now x 
has been proved to be an H-property, hence by the proof of Lemnma 2. 1 it 
follows that + is an f-property. The rest of the theorem follows now 


immediately from Theorem 2. 2. 


3. Subrings and one-sided ideals. ‘The present section deals with the 
relation between the upper z-radical +(#) of a ring FR and its one sided 
ideals. We shall deal only with right ideals. 


THEOREM 3.1. If x is an H-property of rings, then the upper x-radical 
U(R) of every ring R contains every right x-ideal of R if and only tf x 
satisfies : 


Every z-semi simple ring does not contain non zero right x-ideals, 


Put U=U(R). Let J be any right z-ideal of R. J/J MU is a homo- 
morphic image of J, hence it is a z-ring. This immediately implies that 
JUU/U is a z-ring, and, therefore, a right z-ideal in R/U. The z-semi 
simplicity of the latter and (F,) shows that U > J. 

The necessity part of this theorem is a simple consequence of the appli- 
cation of the result to the case U(R) = 0. 

Another result for these properties is: 


THEOREM 3.2. If a is an R-property satisfying (F,) then the union of 
any set of right x-ideals is also a right x-ideal. 


Let {J,} be a set of right z-ideals of R. Put J =U J,; each J, is also 
a right z-ideal in J, hence This implies r(J) D U J, =J which 
evidently means that J is a z-ring. 

Thus, 


3.1. If is an R-property satisfying (F,) then is the 
unique maximal right x-ideal of R. 


The same methods yield similar results for left ideals under a similar 
Condition (F)) imposed on left ideals. Furthermore, the same methods yield 
similar results for subrings. Namely: 


THEOREM 3.3. 1) If « ts an H-property which satisfies: 
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(F,) Every x-semi simple ring does not contain non zero a-subrings, 
then U(R) contains every a-subring of R. 


2) If x is an R-property of this type, then the union of every set of 
a-subrings is a w-subring and is the maximal x-subring of R. 


In some cases the following is useful in proving the validity of (F;). 
Denote by 2(R) the ring of all linear transformations of R: aj: 7— ax, 
ae R. Then: 


LemMa 3.1. A property x satisfies (F,) tf it satisfies the requirement 
that for any right x-ideal J in R, the two sided ideal 2(R)J ts a w-rdeal. 
This requirement is also necessary if x is assumed to satisfy (D). 


Indeed, if the requirement of this theorem is satisfied and R is z-semi 
simple, 2(R)J —=0. This means that J is two sided, and since it is a z- 
ideal, it follows that J = 0, q.e.d. The necessity part follows immediately 
by (D) and by the fact that 7(R) DJ implies r(R) 2 Q(R)J. 


4. The radical of matrix rings. The present section deals with the 
relation of a radical of a ring FR and the radical of a matrix ring over R. We 
use the notation FR, for the ring of all square matrices of order n on R, and 
R, for the ring of finite-rowed infinite matrices (or finite columns). We prove: 


TrroreM 4.1. Let x be an H-property of rings. Then U(R,) = U(R), 
for every ring R if x satisfies the following conditions: 


(G,) If R is a a-ring then so is Ry. 


(G.) If Ris a x-semi simple ring then so is also Ry. 


Proof. Consider- the sequence U, = U)(R) of the A-th z-radicals. Put 
Uyn = (Uy)n. Our first step in the proof is to show inductively that 
U(R),»> Uy. For A=0, this is evident. If A is a limit ordinal, 
U(R), > Uy» is a simple consequence of the induction and of the fact that 
Dyn = n = Let be a non limit ordinal and assume 
that U(R) DU)... If P is an ideal in R such that P/U)_, is a z-ideal, 
then by (G,) it follows that (P/U)-_,), is a z-ring. Since z is an H-property 
this implies that P,,/U)., » is a w-ideal in R,/U)_, »3 hence, by Theorem 1. 7, 
it is readily seen that U7 (R,) © P,. This being valid for every such P yields 
that U(R,) DU = (U P)n =U yn, q.e.d. It follows now by Theorem 
1.1 that U(R,) Since R,/U(R), = (R/U(R))n and R/U(R) 


= 
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is x-semi simple it follows by (G,) and by Theorem 1.1 that U(R), > U(R),. 
Thus U(Rn) = U(R)n. 


Remark 4.1. If is an R-property then (G,) and (G2) are also neces- 
sary conditions for the validity of the preceding theorem, since then (G;) 
is the assertion of this theorem for 7(R) =F and (Gz) is the assertion for 
a(R) =0. 


As an immediate consequence of (G,) and of the fact that 7(#) n = 7(R,) 
= R, for z-rings R, we obtain: 


CoroLtiary 4.1. If 2 is an R-property which satisfies (G,) and (G2) 
then R, is a x-ring tf and only if R is a x-ring. 


The converse of this corollary is generally not true. But suppose 7 is 
an R-property which satisfies (D) and the condition of the preceding corollary ; 
then (G.) must hold. If not, there exists a 7-semi simple ring # such that 
R,, contains a non zero z-ideal A. Consider the ideal in R, generated by the 
matrices p(aq), (pa)q, p,qgeR, and ae A. This ideal is contained in 4, 
hence by (D) it is a z-ideal. One readily observes that this ideal equals P,, 
where P is the ideal in R generated by the elements: r(st), (rs)t, where 
t,r eR and s range over all the elements of R appearing in the matrices of A. 
Since # is z-semi simple it follows by the condition of the corollary that 
P=0. In particular this yields: 


THEOREM 4.2. If aw is an R-property which satisfies (D) and the 
condition that a x-semi simple ring R cannot contain a non zero ideal P 
such that R(PR) = (RP)R = 0, then (G,) and (G.) are equivalent to the 
condition that R,, 1s a x-ring tf and only if R is such. 


Note that in the whole section no use was made of the relation between 
the two operations of the rings. Hence the preceding results hold also. for non 
associative and non distributive rings. 


5. Dependence of conditions. The conditions listed in section 2 are 
generally independent, but not if one restricts oneself to particular classes of 
rings. The object of the present section is to show that in the case of 
associative rings condition (E) is superfluous. 


Lemma 5.1. Let w be an R-property of rings (not necessarily asso- 
ciatwe). If Q is an ideal in a x-semi simple ring R such that Q? =0 then 
Q is also r-semi simple. 


my 
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Proof. Denote by {(f) the ring of all transformations of the additive 
group of R generated by the identity and by the transformations a,: x— za, 
a: «—>ax, ae R. Suppose that Q contains a z-ideal P in Q. For every 
re X(R) the totality of the elements pr,peP constitutes an additive sub- 
group of Q. We note that since Q? 0, the ideals of Q are its additive 
subgroups. Now the mapping p— pr defines a homomorphism of the additive 
group P onto Pr and, evidently, this is also a homomorphism of the ideal P 
onto the ideal Pr; hence condition (B) implies that Pr is also a z-ideal in Q. 
By Theorem 1.3 it follows that the union P* of all the ideals Pr, re Z(R), 
is a 7-ideal in Q and, therefore, it is also a w-ring. The set P* is readily 
seen to be an ideal in R. Thus the z-semi simplicity of R implies that P* = 0 
which proves that P = 0, q.e.d. 


THEOREM 5.1. Jf x is an R-property of associative rings which satisfies 
condition (D), then x satisfies also (FE), 1. e. 1s an SR-property. 


Proof. Let R be a x-semi simple ring and let P be an ideal in R. If Q 
is a 7-ideal in P then PQP C Q, and it is a z-ideal in R by (D). By the z-semi 
simplicity of R it follows that PQP =0. Let P,= {«|reP, Px = 0}; then 
P, is an ideal in R such that P? =0. Since QGP CQN P,CQ, it follows 
by (D) that Q/N P, is a z-ideal in P,. 

Thus the preceding lemma implies that P,NQ=—=QP=0. _ Let 
P, = {x | ae =0} then P; is an ideal in R such that P?,=0. Since 
Q C P, the preceding argument yields that @ = 0 which proves (E). 


Remark 5.1. If one restricts oneself to associative rings possessing 
operators and considers only admissible ideals, then the same proof is valid 
also in this case, with the additional remark that in the proof of the first 
lemma one should note that if P is admissible each of the ideals Pr and also 
P* is admissible. In the proof of the preceding theorem, one has to note 
that together with P and Q, also POP, QP, P, and P, are admissible ideals. 


Remark 5.2. At the end of this paper we point out algebraic structures 
to which the theory of radicals can be extended, except for the parts which 
depend on ring operations. It is worth remarking at this stage that the 
theory can be developed without any changes for semi-groups (with or without 
operators) and in particular this section will hold for associative semi-groups. 


6. Z-properties. Many of the properties of rings which yield radicals 
are properties which satisfy the following condition: 


(Z) very zero ring, i.e. a ring R such that R? = 0, is a a-ring. 
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A great deal of the theory can be simplified if (Z) is assumed. This is carried 
out in this section. 

H-properties and R-properties which satisfy (Z) will be referred to as 
HZ-properties and RZ-properties respectively. 

Condition (Z) itself can be chosen as a definition of a new ring property 
in the following way: a ring F# is a g-ring if R?=0. This property is 
evidently an HZ-property and yields, therefore, an upper ¢-radical. For 
asociative rings this radical is well known as Baer’s Lower Radical defined in 


[5]. Additional results on this radical will be given in a subsequent paper. 


THEOREM 6.1. Jf a is an HZ-property, then the upper x-radical 
U(R) 


This is an immediate consequence of Theorem I. 1. 6._ 

One readily verifies that the converse of the preceding theorem holds for 
R-properties That is: if for every ring then is an 
RZ-property. The RZ-properties satisfy the following famous property of 


radicals. 


THEOREM 6.2. Jf a is an HZ-property of associative rings, then an 
element re R belongs to U(R) tf and only if RrR CU(R). 


Proof. Let Q={x|reR,RrR CU(R)}. Then Q is an ideal in R 
such that @? C U(R#). The natural homomorphism of R onto R/U(R) maps 
@ on a nilpotent ideal of R/U(R). By (Z) it is readily séen that x-semi 


simple rings do not contain nilpotent ideals, hence Q C U(R). 


The property (Z), when satisfied together with conditions (D,) and (F,) 
of section 4, yields the existence of the radical of a matrix ring: 


THEOREM 6.3. Let x be an RZ-property which satisfies (D,)* and ee 
then = 


Proof. Let R be a a-ring and let {ci} be a matrix unit base of R,. 
Denote by R™ ‘the ring of all one rowed matrices: + ° + Trin}. 
This ring contains the two sided ideal P of all the matrices rzc,;. + +--+ Tnein, 
and P? = 0. The quotient = Rc,, and the latter is isomorphic with R. 
Since Ff is a z-ring and P is a z-ideal, by (Z), it follows by (C,) that R@ 
also a z-ring; hence R™ is a right z-ideal in R,. Similarly each of the eis 
ideals R® = ++ t= is a right z-ideal in R,. 


° By Theorem 5.1 it follows that m is also an SRZ-property. 
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Since R, is the sum of all these ideals, it follows by Theorem 3.2 that R, is 
a m-ring, which proves (G,) of section 4. Conversely, let R, be a a-ring. 
sv (D,) it follows that the right ideal R® is a w-ring. As it has already 
been pointed out the latter is homomorphiec with A, hence by (B), F is also 
a-ring. The theorem follows now immediately by Theorem 4.1 in view of 
the facts that RZ-properties satisfy Theorem 4.2 and that (D,) implies (D). 

One readily observes that the theorems used above do not require the 


finiteness of n and we may similarly obtain: 


THEOREM 6.4. Jf x is an RZ-property which satisfies (Dr) and (F;) 
then +(R);=2(R;), where, e.g. Ry denotes the ring of all finite matrices 
over R or the ring of all finite rowed matrices over R. 


THEoREM 6.5. Let 2 be an RZ-property of associative rings which 
satisfies (D,) and (F,). Then the z-radical of a right ideal J is the set 
Jp={x| red, Cr(R)}. 


Proof. Evidently J, is an ideal in J. Since JJ,J Ca(R) it follows by 
(D,) that JJ,J is a right z-ideal; hence JJ,J Cx(J). It follows now, by 
Theorem 6.2, that J, Co r(J). Now is a two sided ideal in J, hence 
by (D) we obtain that r(J)J is a z-ring. This means that x(J)J is a right 
r-ideal in R, which implies by Theorem 3.1 that x(J)J hence 
Cd,, thus =J;,. 


In particular, if R is z-semi simple we obtain: 


CoroLuaRy 6.1. The x-radical of a right ideal in a x-semi simple ring 


is its left annthilator. 


A short cut in the proof of existence of the z-radical can be obtained 
for properties of associative rings which satisfy (D) with the aid of the 


following theorem: - 


THEOREM 6.6. Jf a is an HZ-property which satisfies (D) and the 
condition that every non a-ring is homomorphic to a non zero ring which 
does not contain non zero right w-ideals, then a is an RZ-property which 
satisfies (F;). 


Evidently the condition of this theorem implies the validity of (C) 
which proves that + is an RZ-property. To prove (F,), let R be a w-semi 
simple ring and let J be a right z-ideal in R. If J ~0, Jo = (J, RJ) is a 
non zero ideal in R which is not a z-ideal. Therefore, J) is homomorphic 
with a non zero ring Jo/Q which does not contain right z-ideals. Thus 
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JCQ. This vields J*, = (J*,JRJ, RJ*, (RJ)*) CQ, since JoJ CQ and 
(RJ)? C (RJ)JI CQ. Hence is a a-ring. Contradiction! 


7. Constructing new properties. Let z be any property of rings. The 
only restriction imposed is that the zero ring should be a z-ring, namely, 
that x satisfies (A). This property gives rise to an H-property by defining: 


DEFINITION 7.1. A ring R is to be called a z,-ring tf every homomor phic 


image of R (including R) is a x-ring. 


One readily observes that z, is now invariant under homomorphism and 
satisfies (A). That is x, is an H-property. If one starts from an H-property 
x, the new property z;, is the same as 7; this implies (a; ), = 

In some cases z, is even an /?-property. Thus, for example: 


LemMMA 7.1. Jf x is an isomorphism invariant property satisfying (A), 
(C,) and (C.), then zp, is an R-property. 


Proof. By Theorem 1.4 and Lemma 2.1, it suffices to show that z=, 
satisfies also (C,) and (C.). To prove (C;), we consider an ideal A in a 
ring R such that both A and R/A are z,-rings. Let R® be a homomorphic 
image of R. # induces then a homomorphism of R/A onto R®/A%, Since 
the latter is also a homomorphie image of ? and A® is a homomorphic image 
of A, it follows by (C.) and by the definition of z, that R® is also a 7-ring. 


This proves that z, satisfies also (C,). 


To prove (C.), let A, C A.C---+ be a non decreasing well ordered 
sequence of z,-ideals in R, and let A =U A,. Let &@ be a homomorphism of 
A onto A®. Then # induces a well ordered sequence of ideals A,? C A,®C-: - - 
in A®, Each A,* is a z-ideal by the definition of z,. It follows now by 
(C.) that, since U A,? = (U A,)® = A®, A® is a z-ring, which proves (C2). 

The following is a method for constructing properties satisfying the 
conditions of this lema. 

To each property we can define a property of being “-semi simple ” 
which we shall denote by z.s.s. That is: a ring Ff is a z.s.s-ring if it does 


not contain non zero 7-ideals. 
LemMaA 7.2. If a is an H-property satisfying (D) then z.s.s. is an 
isomorphism invariant property which satisfies (A), (C,) and (C.). 


By definition and by Theorem 1.5 it follows that z.s.s. is an isomor- 
phism invariant property which satisfies (A). A proof similar to the proof 
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of Lemma I. 2. 3, yields (C,). To prove (C2), let A, C Az C: - - be a well 
ordered sequence of 7-semi simple ideals in a ring R. Let A—UA,. If Q 
is a r-ideal in A, then since QD@QN A, and z satisfies (D) it follows that 
QA, is a z-ideal. This implies that QM A, is also a x-ideal A), and since 
A, is z-semi simple, we have 9M A,=0. Now 


=U(QNA,) =0, 


which proves that A is a z.s.s. ring, q. e. d. 

lt follows now that one can apply Lemma 7.1 to the property (7.8.8.) a, 
which we denote shortly by z.s.h. The definition of this property is the 
following: 

DEFINITION 7.2. A ring R is a w.s.h.-ring if every homomorphic image 


of R is x-semt simple. 
Thus by the preceding two lemmas we have: 


THEOREM 7.1. Jf 7 is an H-property satisfying (D) then .s.h. is an 
R-property. 
If we restrict ourselves to associative rings, we can prove more: 


THEOREM 7.2. If x is a ZH-property of associative rings and satisfies 
(D), then the property w.s.h. is an SR-property, t.e. an R-property satis- 
fying (D) and (FE). Furthermore, if m satisfies the condition that R, ts a 
z-ring if and only if R is a x-ring then x.s.h. satisfies also the condition (G,), 


(G.) for the radical of the matrix ring. 


Proof. First, we note that under the conditions of the theorem, z-semi 


simple rings cannot contain nilpotent ideals. 


Let R be a z.s.h.-ring and let Q be an ideal in R. Suppose Q is not a 
7.8.h.-ring; then it contains an ideal P such that Q/P is not z-semi simple. 
From the condition of the theorem we can readily deduce that Q/QPQ is 
also not z-semi simple. Let T/QPQ be a non zero z-ideal in Q/QPQ. 
It follows by (D) that QTQ/QPQ is also a z-ring; but since R is a 7.s.h.- 
ring, R/QPQ must be x-semi simple and, therefore QTQ CQPQ. Conse- 
quently, the set {x | xe Q,QrQ CQPQ} constitutes a non zero ideal in R 
(containing 7’) which is nilpotent modulo QPQ. This is impossible since 
in the present case we know that 7.s.s.-rings cannot contain non zero nilpotent 
ideals. Hence we obtain the validity of (D). The rest of the first part 
of the theorem follows now from Theorem 5. 1. 
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Before proceeding with the proof of the second part, we show that if 
R is a z-semi simple ring then so is R,. Indeed, let Q be a z-ideal in R,; 
then R,QR, CQ is by (D) a z-ideal. Now R,QR, =T, where T is the 
ideal in R generated by the elements: rqs,7r,s,e R and where q ranges over 
all the elements of R which appear in the matrices of Q. By the condition 
stated in the second part of the theorem it follows that T is a z-ideal, 
hence T==0. The set {2|reR,RrR =0} is a nilpotent ideal in R; it 
follows, therefore, that this ideal is zero. Consequently, @ =0 and our 
assertion is proved. We turn now to the proof of (G,): let R be a z.s.h.- 
ring and suppose FR, is not a z.s.h.-ring. Let Q be an ideal in R, such that 
R,/Q is not a x-semi simple ring. By (Z) it follows that R,/R,QR, is also 
not x-semi simple. The latter is isomorphic with (R/T), and R/T is by 
assumption z-semi simple, hence our assertion implies that (R/T), is also 
a 7.s.s.-ring. Contradiction! To prove (G.), we note first that if R, is a 
z.s.h.-ring then R must also be a z.s.h.-ring. Indeed, for every ideal T in R, 
(R/T), =R,/T, and if R/T were to contain a non zero z-ideal then, by 
the condition of the theorem, the latter would also contain a non zero z-ideal 
which is impossible. Now let R be a z.s.h.-semi simple ring and suppose FP, 
is not such a ring. Let Q be an ideal in R, which is a z.s.h.-ring. By (D) 
it follows that R,QR, =T,(CQ) is also a z.s.h.-ring. This implies, by 
the preceding argument, that T is also a z.s.h.-ring; hence T—0. But if 
this is the case, Q®? 0; but then Q cannot be a 7z.s.s.-ring and, evidently, 
not a z.s.h.-ring. 

The preceding theorem can be applied to many ring properties, and in 
particular; quasi regularity, nillity and semi nilpotency. The respective 
radicals will be dealt with in part ITI. 

A by-result of Lemma 7. 2 is the following : 


THEOREM 7.3. If z is an H-property which satisfies (D), then every 
ring R contains maximal x-semi simple ideals P and cach of them is a x-semi 
simple radical in the sense of section 1. 


The proof follows immediately, since the relation between ideals deter- 
mined (as defined in section 1) by the property of “ -semi simplicity ” 
satisfies the requirements of Corollary I. 2. 1. 

We can prove more: 


THEOREM 7.4. If is an R-property which satisfies (or (F;)), 
then every ring R contains maximal x-semi simple subrings (right ideals), 
and r(R) 1 S=0 for every such maximal subring (right ideal 8). 
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The proof of this theorem is similar to the proof of Lemma 7.2. By 
proving that if 8, S.,C--- is a well ordered sequence of z-semi simple 
subrings of PR, then U S, is also z-semi simple. This implies the existence 
of maximal 7z.s.s.-subrings S. It follows by (D) that 8S C is 
a w-ideal in and, therefore, in 8; we obtain N S —0, q.e. d. 

In the next section, we turn to another useful method of constructing 
new radical properties, a method which will be seen to yield a generalization 


of Baer’s Lower Radical ([5]). 


8. Additive properties. Many H-properties of rings, like nilpotency, 
are not /-properties, yet their upper radicals behave very similarly to radicals 
of R-properties. Those are the H-properties which satisfy the condition of 


additivity. Namely: 

(Add) Jf A, B are two z-ideals in a ring R then their union (A, B) 1s 
also a x-ideal. 

Such properties will be called A-properties. Evidently, -properties 
satisfy this condition. Furthermore, this implies that the union of any 
finite number of z-ideals is a 7-ideal. 


With the aid of a property z, we define a property z* as follows: 


DEFINITION 8.1. A ring R is a x*-ring if every non zero homomorphic 


image of R contains non zero z-ideals. 
An equivalent definition is given by the following lemma: 


Lemma 8.1. R is a x*-ring if and only if R=U(R), where U(R) is 


the upper w-radical. 


Indeed if R  U(R), R/U(R) is a non zero z-semi simple homomorphic 
image of R; hence R cannot be a r*-ring. Conversely, if R is not a *-ring 
then FR properly contains an ideal P such that R/P is z-semi simple. Hence, 
by Theorem I. 1.2, R 0 PD U(R), and the proof is completed. 

An associative ring R was called by Levitzki® an Z-ring if R coincides 
with its Lower Radical (in the sense of Baer [4]). Noting that Baer’s 
Lower Radical is an uppper radical (in the sense of the present paper), of 
the property of nilpotency, the preceding lemma shows that the notion of 
a*-rings is a generalization of the notion of L-rings. 


® J. Levitzki, “A theorem on polynomial identities,” Proceedings of the American 
Mathematical Society, vol. 1 (1950), p. 335. 
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THEOREM 8.1. If a is an H-property which satisfies (EK), then x* is 
an R-property which satisfies the same condition; and the x*-radical coincides 


with the upper x-radical. 


Proof. Evidently, it follows from the definition of ~* that it is an 
H-property. First we assert that 7*-semi simplicity is equivalent to z-semi 
simplicity. Indeed, if R is w-semi simple, then it follows by (1) that 
U(P) =0 for every ideal P in Rk. In view of Lemma 7. 1, this implies that 
R does not contain non zero z*-ideals, i.e., & is 7*-semi simple. The con- 
verse is evidently true since z-ideals are also 7*-ideals. Consequently, the 
validity of (E) for + implies that ~* also satisfies (E). Another result 
obtained by our assertion is that if R is not a 7*-ring, R/U(R) is a non zero 
7*-semi simple image of R. This proves that z* satisfies (C), hence +* is an 
R-property. By Theorem 1.1 it follows readily that U(R) D r*(R). Since, 
by Corollary 2.1, U(R) is a x*-ideal, it follows by Theorem 1.1 that 
a*(R) DU(R); hence = U(R), q.e.d. 

A similar theorem about condition (D) is not true in the general case. 
But for properties which satisfy the condition of additivity we can show: 


THEOREM 8.2. If x is an A-property which satisfies (D) and (§), 
then x* is an SR-property. Furthermore, if a satisfies also (D,) or (Ds) 
then x* satisfies also the same condition. 


Proof. From the preceding theorem we see that it remains to prove the 
validity of (D). We shall prove the second part of the theorem for (Ds) 
and the proof of (D) and (D,) is similar. 


Let R be a x*-ring and let S be any subring of R. The proof will 
follow by showing that U(S) > SN U(R), since then U(S) = 8 if U(R) = R. 
This is carried out by proving inductively that U(S) DSOU)(R) for 
every 4. For A= 0 it is evident. If A is a limit ordinal, then 


SA =8 N(UrerU,(R)) = N U,(R)) C U(8). 


Let A not be a limit ordinal. Put V=—U).,(R). By definition, U,(R) is 
the union of all the ideals P such that P/V is a z-ideal; hence, if 
aeU,(R) OS, then ae P} UP,U---UP,, for a finite number of ideals 
{P;} such that P;/V are z-ideals. By the condition (Add), P = U P; is also 
a w-ideal over V. The quotient (P18) U V/V is a subring of P/V, hence 
by (D,) it is a z-ring. Since P  V, 


VU (PN 8)/V=PNS8/[PN 
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Thus, the latter is also a z-ring. By Theorem 1.7%, and from the fact that 
U(S) D SOV, it follows readily, that since PO S/VNS is a z-ideal, 
U(S) DPNS. Thus aeU(S) for every aeU,N8. Consequently 
U(S) DUNS, q.e.d. 

The proof of (D) (similarly of (D,)) follows in the same way by 
replacing S by an ideal (right ideal) B of R and noticing that VU (PN B) 
is an ideal (right ideal) in P. 

In particular we have by Theorem 2.3: 

CoronLary 8.1. Jf is an A-property, then U(A) for 
every ideal A wm R. 

We conclude this seection with showing that: 

THEOREM 8.3. If x is an H-property which satisfies (C,) then x satisfies 
the condition (Add). 

Indeed, let A and B be two z-ideals in R. Since AU B/B=A/ANB 
and the latter is homomorphie with the z-ring A, 4 U B/B is also a a-ring. 
B is a z-ideal in AU B as well as in R; it follows, therefore, by (C,) that 


AU B is also a z-ring, which proves (Add). 


9. Generalization. \ close survery of the proofs of the preceding sec- 
tions (except section 6) shows that no use whatsoever has been made of the 
operators of the rings and, except for one point (on 4-properties), even the 
fact that rings and ideals contain elements. The whole theory can be’ developed 
in a far wider class of mathematical objects. The largest field in which this 
can be done is that of the Lattice-ordered Bicategories of S. MacLane defined 
in [7], which satisfy some additional axioms so that the two main isomor- 
phisms hold and some minor properties of ideals in rings. 

We do not intend to give here the list of axioms such a category has to 
satisfy, but only a list of conditions, some of which may be considered as 
axioms, others as lemmas, to be valid in such a category in order that the 
whole theory can be developed in it. It is worth noting that the whole theory 
of radicals is just a relation between injections and projections of a bicate- 
gory. The proof in the general case can be obtained by a simple change of 
‘objects ’ and ‘ normal objects,’ ‘ supermaps’ instead of ‘ rings,’ ‘ ideals’? and 
‘homomorphisms.’ 

In giving the list of conditions we shall make use of notions and 
theorems given in [7], with the only difference of using the zero symbol 04 
instead of the multiplicative symbol 14 used in [7]. 
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Let € be a lattice-ordered bicategory satisfying the axioms of part II 
of [7] (pp. 495-507). Such a category contains objects denoted by 4A, 
B,: and mappings «, B,y,: °°. Recall the following facts of 
[7]: The set of all subobjects A C F form a complete lattice €(R) with 
the zero Or, and F as zero element. 

If «:R—R’, then a, denotes in [6] the mapping of G(R) induced 
by « (for details see [7], p. 500). We shall use the notation @ instead of a, 
and put «A for the image @,A. 

The kernel K, of a is by definition the l.u.b.{A | A C R, aA = Op}. 
The set of all kernels of R is exactly the set of all subobjects of R normal 
in R (in the sense of [7]). With each normal object A of R, there is 
associated a unique projection i4: R-R/A, with the unique quotient 
object R/A. 

A simple observation shows that the set 3¢(#) of all normal subobjects 
of R constitutes a complete lattice. This is not necessarily a sublattice of 
©(f). It is true that the intersection of normal subobjects in 9() is the 
same intersection as in G(R) ([7], p. 507), but the union U NV, in N(R) 
is defined as the intersection of all normal subobjects of R containing all V,. 


This is the general set-up in which one can state the results of the preceding 
theory. The objects R will replace the rings and the kernels (= normal 
subobjects) will replace the ideals. In order for the proofs to hold, one has 
to assume that the complete lattice satisfies some additional requirements. 


I) Ita: R-—cR is a supermap (we call it a homomorphism (onto) to 
make it similar to the language used here), then @ maps normal subobjects 
of R onto normal subobjects of and N;) = U (aj). 


Il) If a: R-—»aR is a homomorphism, and «1A denotes the inverse 
image of a normal subobject A of aR ([7], p. 507) then a?° is a lattice 
isomorphism between the lattice of all normal subobjects of R lying in 
the interval K, C R (Ka the kernel of a) and the lattice N (aR); and 
R/a"B aR/B for every Be N(aR). 


III) If RR is a homomorphism, A e 3(R), then « induces a homo- 
morphism R/A > «R/aA. 


IV) 9,4, B,eN(k). If ROADB then BeN(A),™ and so 
QNAER(Q). 


1° Denoted by a*, in [7]. 
11 This fact was pointed out in [7], p. 507. 
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V) If ROD BDA,A,BeN(R), then there is a homomorphism: 
R/A — R/B such that for every C D A, Ce N(R), C/A exists, and its image 
is (CU B)/B. Furthermore, by (II) we have: (R/A)/(B/A) < F/B (The 
Second Isomorphism theorem). 


VI) For A,BeR(R), there exists an equivalence (isomorphism) 
AU B/A@&B/A OB (The First Isomorphism theorem). 


VII) If A, C A, C--- is a well ordered non decreasing sequence in 
N(R) then U A; in N(R) is the same U A; in N(B) for every normal object 
BD A. 


These conditions are fundamental for the theory developed in sections 
1, 2, 7, 8 (except Theorem 2.1 and its corollary). The second part of 
condition II was used only at Theorem 2.3. For the sake of Lemma 7. 2 
we need also: 


VIIT) If A, C A, C-- - is of the preceding type, then QM (U Ay) 
=U(QNA,) for any QeM(R); and for the sake of Theorem 8.2, we 
need (VIII) and: 


IX) If {P.} is a set of N(R), BeN(R), Ae M(B) and such that 
AD (U P;) NB for every finite set Pi,- --,P, chosen out of {P,} then 
A DPOB, where P is the union of all Py. 


In the proofs of sections 3 and 8, and of Theorem 2.1 and its corollary, 
one notices the resemblance between the problem of one sided ideals and 
subrings. These two cases are particular applications of similar problems 
which arise for the bicategories of the preceding type. The general situation 
is as follows: we assume that to each object R belongs a complete lattice of 
subobjects 3(#) (right ideals, subrings). These subobjects should satisfy 
conditions of a similar nature to the conditions satisfied by %(R). Namely: 


I’) If a: R-—>aR is a homomorphism, then « induces a lattice homo- 
morphism of onto such that «(U J;) = U (aJ;). Furthermore, 
J; —> aJ; is a homomorphism. 


II’) The lattice of normal subobjects 2(R) is a sublattice of Y(R). 


III’) IfAeRN(R), Je then AN Je R(J) and Y(A), and there 
is an isomorphism AU J/A@eJ/AN J. 


IV’) If {Ag} is a set of N(R)(J(R)) and Ag CJeJ(R), then 
AgeN(J) and U Aa in R(R)(R(R)) is the same object as U Ag in 
R(J)(S())- 
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Je Y(#), then J/A exists and belongs 
to 3(R/A). 


And for the sake of Theorem 8.2, we need the previous conditions (VIII) 
and (IX) to hold also for B, 3(#) and the fact that if D J, Q, J 
then J € Y(Q). 

Even without going as far as bicategories one readily observes that the 
whole theory can be developed without any additional efforts for groups, 
semi groups, loops, etc., or, more generally, to abstract algebras in the sense 
of Birkhoff (e.g. [2]), where ideals should be defined as kernels of homo- 
morphisms. These abstract algebras should be of a more restrictive type 
than those studied in [2], Ch. VI (also, Goldie [6]). Namely: one has to 
assume that the operations of the abstract algebras R = (R,Q) are finitary 


functions fg(21,° fgeQ and that: 
(A) These algebras contain a selected one element, denoted by zero. 
(B) The congruent relations of the algebras are uniquely determined 
by the set of elements which are congruent to zero (i. e. by the kernel = ideal). 
(C) <All congruent relations considered must be permutable (in the 


sense of [2], Ch. VI). 


By the results of [2], Ch. VI (see also [6]), one readily shows that 
these algebras satisfy all conditions imposed on the bicategories., In particular 
we remark that if 4; C A, C: - - is a well ordered sequence of ideals in such 
an algebra & then U A, is just the set-theoretic union, which proves (VIL), 
(VIIT). 

To these algebras one can extend part of the theory of radicals as 
developed by Brown-McCoy in [8] and [4]. Their whole theory can be 
extended to algebras of the type mentioned above which are loops with respect 
to one of their binary functions: f,(7,y) —=«-+y. In the next section we 
develop their theory first, partly, in the general case and then the whole 


theory for loop algebras. 


9. Brown-McCoy’s theory of F-radical. Furthermore, we assume, in 
this section, that to every algebra considered there corresponds a complete 
lattice R of subsets containing the zero. By a §-subset of R we shall mean 
a subset of R which belongs to 8. We also assume that for every homomor- 
phism # of R the R-subsets of R® are the images of the -subsets of R and 
the inverse images of the first is a subset of the latter. 
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Let a— F(a) be a mapping of elements of algebras R onto R-subsets 
of Rk. This mapping is assumed to be homomorphic invariant; that is: 
F(a®) = F*(a).° Following [3], we define: a §-subset T of is called 
maximal modular if there exists an element ae R such that F(a) CT and 
T is a maximal §-subset of R which does not contain a. An ideal A in R 
is said to be F-primitive if A is a maximal ideal contained in some maximal 
modular R-subset of R. An algebra R is F-primitive if the zero is a primitive 
ideal in R. An ideal P is called F-regular if ae F(a) for every ae A. 

It is readily seen that F-regularity is an HJI-property in the sense of 
section 1, hence it yields an upper F-radical which we denote by F(f). 


THEOREM 9.1. Jf the primitive ideals P are F-ideals, then F(R) is 
the F-radical of R and it is the intersection of all primitive ideals of R. 


Proof. Let A > B be two ideals in # such that A/B is F-regular in 
R/B. We wish to show the existence of a primitive ideal P  B such that 
ADANP. Without loss of generality we assume that B=0. Since A 
is not F-regular, az F(a) for some ae A. By Zorn’s Maximum Principle 
we deduce the existence of a maximal modular R-subset S of R containing 
F(a) and excluding a.'* Let P be a primitive ideal contained in S, then 
since q.e.d. Our theorem follows immediately as a 
consequence of Theorem I. 2. 4. 


Generally, it cannot be shown that F-primitive ideals are F-ideals. 

Furthermore, it is not certain that if A is F-primitive in R then R/A is an 
F-primitive algebra. The following are some special cases for which this 


is true. 


THEOREM 9.2. Jf the sets F(a) (the set ®) are the set of all ideals, 
then if A ts an F-primitive ideal in R, R/A is F-primitive and every primitive 


algebra is F-semi simple. 


Proof. In the present case F-primitivity coincides with maximal modu- 
larity, hence if A is F-primitive F(a) C A for some af A. In the quotient 
R/A, the image a of the element is a non zero element such that F(a) = 0. 
By the maximality of A with respect to A, one readily observes that every non 

12 Here FO(a) = xe R, = for some ye F(a)}. 

18 Meaning that R/P does not contain non zero F-regular ideals. 

*4 Here we have to assume that the union of a linearly ordered set of &-subsets is a 
&-subset which is their set-theoretic union. This holds for example if the &-subsets are 
all subalgebras. 
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~ 


zero ideal in the quotient R/A must contain @ and thus cannot be F-regular. 
This proves that R/A is primitive as well as F'-semi simple. 

The preceding two theorems present a generalization of the F’-radicals 
as developed in [4] and in example 3 of [3], section 6. The incorporation 
of the theory developed in [3] is achieved by restricting ourselves to abstract 
algebras which are loops with regard to one of their functions f(a, 6) =a-+) 
and assuming that the R-subsets are subloops. We refer to such algebras as 
loop-algebras. 

Now let a— F(a) be a homomorphic invariant mapping of the elements 
of loop-algebras onto &-subsets which satisfies the following conditions of [3]: 

P. F(a+b) C F(a) U (0),. 
where F(a) U (0), denotes the minimal loop-subalgebra of R which contains 
both F(a) and the element b. 

In a similar way to the proof of the lemma of [3, section 8], one verifies 
that if A is F-primitive in R then R/A is an F-primitive ring. Hence the 
theory developed in [3] section 8 is now a consequence of Theorem 9.1 and 


the following lemma: 
Lemma 9.1. Jf R is an F-primitive loop-algebra then R is F-semi simple. 


Indeed, let & be primitive with respect to a and the maximal modular 
R-subloop S. Let P be a non zero ideal in R. The natural, homomorphism 
of R onto R/P maps S onto the R-subloop P(S)/P of R/P. Since PCS, 
P(S) DS. Henceae P(S). Thus a=s(mod P) for some se S and, there- 
fore, a—s—peP. It follows now by condition (P) that 


F(p) C F(a) U (s); C F(a) US=S. 
Assuming that P is F-regular, we obtain a — s = pe F(p) CS which implies 
a—=p-+se8. Contradiction. 


Next we assume that the mapping a— F(a) is homomorphic invariant 
and satisfies the second condition of [3]: 


(P,) If be F(a) then F(a+b) C F(a). 


This condition does not imply Theorem 9.1, yet we are able to show that 
in this case: 


THEOREM 9.3. If a— F(a) is a homomorphic invariant and satisfies 
(P.) then the property F-regularity is an RI-property. 
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In view of Theorem 2 of [3] and of the results of section 1, this theorem 
yields the results of sections 1-3 of [3]. 


Proof. We show the validity of (C,) and (C.), and the proof of the 
theorem will follow by Theorem 1.4. The validity of (C.) is evident. To 
prove (C,) we consider the ideals A ~ B of R such that A/B and B are 
F-regular. The natural homomorphism of RF onto R/B maps F(a) onto 
BU F(a)/B. Hence for every ae A, a= (a+ B/B) e F(a) = F(a) U B/B, 
i.e, for some xe F(a). Thus,a—a2—beB. It follows now by 
(P.) that F(b) C F(a). Since B is F-regular, b =a—aeF(b) C F(a) 
and, therefore, ae F(a). This proves that A is F-regular,, q.e. d. 
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III. Applications. 


By S. A. Amirsur. 


The present paper contains applications of the theory developed in the 
first two parts ([1], [2]) to theory of rings, in particular to associative rings. 
The results of the preceding parts [1], and [2] will be quoted by the numbers 
I, or II, followed by the number of the result quoted. 

In section 1 we outline an extension of the theory of nilpotent ideals in 
associative rings to non associative (and non distributive) systems, obtaining 
a generalization of Baer’s Lowe Radical ([3]) and proving that this generalized 
radical is also the intersection of all prime ideals, a result which was 
proved by Levitzki ([8]). Continuing the study of nilpotent ideals, we 
restrict ourselves in section 2 to associative rings. In this section, some new 
results on the Lower Radical are obtained as well as some short cuts in the 
proofs and new results for the semi-nilpotent radical ([7]). In view of the 
results obtained in section 2, it seems that Baer’s Lower Radical is the best 
‘minimal ’ extension of the nilpotent radical of rings satisfying the minimum 
condition for right or left ideals. In section 3 we approach some of the 
known radicals from the point of view of the present theory. Thus we obtain 
alternative proofs for: Jacobson’s radical ([6]), the locally finite kernel 
({10]) and McCoy’s radical ([11]). In the last section we utilize the 
theory for obtaining new radicals, among which is an Fl-radical. A ring R 
is an FJ-ring if every homomorphic image FR’ of F is an I-ring ([10], i.e. 
every non nil right ideal in R’ contains an idempotent). It is shown that this 
property FJ yields a radical which satisfies all the ordinary requirements a 


radical is supposed to satisfy. 


1. Nilpotency and solvability. In the present section we consider rings 
(semi groups) Ff not necessarily associative (or distributive) and we outline 
an extension of the theory of nilpotent ideals to the non-associative case. 
The whole development here will hold also, without any changes in the proof, 
in general abstract algebras of the type considered in the last section of II, 
which are semi groups with respect to a binary operation f(a, b). 


* Received September 10, 1952; substantially revised March 7, 1953. 
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Let Q,, Q2 be two subsets of such a ring R. We denote by QQ. the set 
of all element 9:92, 9:€@Q1, In particular we put Q’=QQ and 
inductively Q™ = Note that (Q™)(™ —Q@™, An ideal P in R 
is called a prime ideal (in R) if: P,P, C P, where P; C P are ideals in R, 
implies that either P; C P or P, CP. 

Defining a ring FR to be a ¢-algebra if R’ = 0,’ we obtain an H-property 
¢ of rings. The upper ¢-radical, which we denote by ¢(R), will be called 
in this paper the Baer’s radical. It is known (Levitzki [8]) that in the 
case of an associative ring R, Baer’s radical is the intersection of all the 
prime ideals in R (McCoy’s radical [11]). It will be shown here that this 
result is true for non associative ring, semi groups and more generally this is 
true for the abstract algebras of the type mentioned above. That is: 


THEOREM 1.1. £(R) is the intersection of all the prime ideals of R. 


Proof. Observing that prime ideals are ¢-ideals, we deduce from Theorem 
I. 1.2 that the intersection XY of all the prime ideals of R contains ¢(R). 
The theorem will be proved if we show that for every af f(R) there is a 
prime ideal Q in R which does not contain the element a, since then X C ¢(R). 
Indeed, put a; = a; let A, be the minimal ideal in R containing a, and ¢(R). 
Since a, A, 0€(R). Hence Let a, be any element of 
A’, which does not belong to (RF). In a similar way we can inductively 
obtain a sequence of elements such that a;¢f(R) and ¢ A’, 
where A; is the minimal ideal in # containing €(F) and a; By Zorn’s lemma 
there exists a maximal ideal P in F containing ¢(R) and excluding the 
sequence {a;}. The ideal P is the required ideal. For if P,P,CP and 
P; > P then the maximality of P implies that a;e P, and a, P2, for some j 
and k; hence P; 1 P» contains an element a, of the sequence {aj}. But this 
implies that a),, ¢€ A’, C P,P, C P which is a contradiction.’ 

The preceding proof provides an alternative characterization of €(R). We 
call a sequence {qi} of elements of R an m-sequence if gis 1 = 1,2,°-° -, 
where Q; denotes the minimal ideal in R containing g;. An m-sequence is 
said to be a null sequence if it contains only a finite number of non zero 
elements. 

Define a property » of ideals in FR as follows: an ideal in A in F is a 
p-ideal in R if_every m-sequence of elements of A is a null sequence. 


We prove: 


? Recall that P is said to be a §f-ideal in R if the quotient R/P is {-semi simple. 
For associative rings, the notion of €-ideals coincide with the notion of radical-ideals 
of [3]. 

* This proof resembles that of the lemma of [9]. 
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THEOREM 1.2. The property p is an RI-property (in the sense of II- 
section 2) and = €(R). 


Let # be a homomorphism of F onto Rk? and Q be an ideal in PR. Let 
P be any ideal in 2, such P = Q°. First we show that for any m-sequence {bj} 
in P, there exists an m-sequence {a;} in Q such that bj = aj®,1=—1,2,---. 
Choose a,, any element of Q such that a,?=0,. Suppose that +, dy 
have been so chosen that ae Q, a® i=1,---,n and A’; where 
A, denotes the minimal ideal in # containing a; We determine ay,, as 
follows: since a,° = b, it is evident that A,® — B,. where B, is the minimal 
ideal in R which contains b,. Since (A’,)? = B’, and by, ¢ B’, we can find 
A’, such that = and evidently a,,,¢@Q. This proceduce deter- 
mines inductively the required sequence. 

Consequently, if every m-sequence of the ideal Q is a null sequence then 
the ideal P = Q° has the same property. This proves that » is an /J-property. 
That » is an RI-property will be shown by proving the validity of II. (C). 
Let A be not a p-ideal in &, then A contains a non null sequence {a;}. Con- 
sider the maximal ideal B contained in A and which does not contain the 
sequence {a;}. Evidently AB. If A > PB then the maximality of B 
implies that a;e¢ P for some 1. Hence aje P for every j=1t. Thus P/B 
contains the non null sequence 4, @i,:,- - - which proves that P/B cannot. be 
a p-ideal, and II. (C) is proved. 

To prove we note first that a ¢-ideal in is a p-ideal. 
By Corollary I.1.5 it remains, therefore, to show that f-semi simplicity 
implies y-semi simplicity. Indeed, let R be a -semi simple ring and let 
A ~0 be an ideal in R. Choose a, any non zero element of A; since A’; 40 
take a, ~ 0 any element of A’,. Inductively we obtain a non null m-sequence 
{a;} in A, which shows that FR is also p-semi simple. 


Remark 1.1. The preceding proof, with the exception of the last part, 
is readily seen to hold, with some slight changes, for the following similar 
properties of rings (semi groups, etc.). A sequence {p;} in R is a mo-sequence 
if pis: © P; where P; is the minimal sub ring of F# containing p; (alternative 
definitions: P; = pk, Pi; = Rp;, etce.); and R is a po-algeba if every mo- 
sequence is a null sequence. 

The second aim of the present section is to outline an extension of the 
theory of nilpotent ideals of associative rings to non associative rings (semi 
groups, or general multiplicative algebras). It seems that in the general 
case the property of solvability should replace the property of nilpotency. 

We call a subset Q of RP a solvable subset if Q™ = 0 for some integer n; 
and a ring R is said to be a o-ring if R is solvable. We call a ring FP a 
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so-ring (semi solvable)* if every finite subset of R is solvable. Furthermore, 
an element ae F is said to be a nil element if the set (a) containing the 
element a is solvable, and a ring R is a nil-ring (v-ring) if all its elements 
are nil elements. 

THEOREM 1.3. 1) o is an H-property which satisfies (C) and (Dg)* 
of II. 2) so and v are R-properties which satisfy II(Ds) (and hence I1(D)) ; 
and v(R) so(R) Do(Rk) 

Proof. Evidently, each of the properties are H-properties and satisfy 
II. (Fs). o satisfies II. (C,), for if A/B is solvable and B is solvable then 
A™ CB for some n and BO™ —0, and hence A“%™ =0. A similar proof 
holds for so and vy. Since these last two properties evidently satisfy II. (C2), 
it follows by Theorem IJ.1.4 that they are R-properties. The relation 
between the various radicals follows by Theorem I. 1.6 as a consequence of 
the fact that a ¢-ring is solvable, solvability implies semi solvability and, 
finitely, a semi solvable ring is a nil ring. 

There are many other aspects of extending the theory of nilpotent ideals 
of associative rings to the non associative case. Thus, for instance, nilpotency 

“may be defined as follows: a subset Q of RF is nilpotent if there exists an 
integer n such that all the possible multiples of n elements of Q are zero. 
Similarly we can define semi nilpotency and nillity. Each of these properties 
is readily seen to be an H-property. But the main disadvantage of the 
nilpotency defined in this manner is that it does not satisfy II. (C,) and, 
therefore, its allied properties: semi nilpotency and niility are not; generally, 
R-properties and do not yield a radical. The validity of II. (C,) seems the 
main reason for preferring solvability in the theory of non associative rings 
over the other generalisations of nilpotency. 

Remark 1.2. Radicals of a similar type can be defined in abstract 
algebras of the type considered in section II, but which need not he ring or 
semi groups. We assume that these algebras possess a function f(2,---,@n), 


n <0, (e.g. Lie triple systems) about which we assume that f(a,---, 2%) =0 
if one of the z;—0. As in section II ideals will be the kernels of homo- 


morphisms. An ideal will be said to be prime if 
Pn) = Pn), pie Pi} CP, 
where P; are ideals, implies that at least one P; C P. Denoting 
Ri = f(R,- +, R) = +, tn), re RB}, 
we define a ring to be a ¢-ring if R’ = 0. 


’ This is a generalisation of the notion of semi-nilpotency defined in [7]. 
“By Theorem II.8.3 it follows that o satisfies also the additivity condition 


II. (Add) of section 8. 
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2. Nilpotency and semi-nilpotency of associative rings and semi- 
groups. We note that when associativity is assumed the two main properties 
solvability and nilpotency and, therefore, their issues are equivalent. Another 
point to notice is the difference between the present definition of semi- 
solvability and the definition of semi nilpotency given in [7]. Our definition 
seems to be more restrictive by requiring the solvability of every finite subset, 
but evidently the two definitions are equivalent if associativity is assumed. 
In the present section we deal with further properties of the radicals defined 
in the preceding one. We shall consider only associative rings but we remark 
that the following proofs and results hold also (with slight modifications) 
for associative semi-groups. 

Since in the present case solvability and semi-solvability are equivalent 
to nilpotency and semi-nilpotency we shall also refer to the latter by the 
respective notation o and sv. 

In addition to the properties of so given in Theorem 1.3 we prove: 


THEOREM 2.1. so is an RZ-property® which satisfies II. (F;). 


In view of Theorem 1.3 and the validity of (2), it remains to show the 
validity of II. (F,). This will follow by proving Lemma II. 3. 1 by a method 
due to J. Levitzki (compare with [10] Lemma 7.3). Let J be a right semi- 
nilpotent ideal in R. Consider the ideal RJ. Let t,,- --,t, be any finite 
set of elements of RJ; then t; = = TinSix, Tix € R, six,eIT. Denote by [¢], [r] 


and [s] the module generated by the elements {t;}, {ri,}, and {si} respec- 
tively, then [¢] C [r][s]. Hence [¢]™** C [r]([s][r])™[s]. The module 
[s][r] € [sr] and the latter is generated by a finite number of elements of J. 
Hence ([s][r])” —0 for some m. This proves that [¢t]"**—0. Thus RJ 
is semi nilpotent. 

The known properties of the semi nilpotent radical ([7]) follow now 
immediately by the results of part II. Additional results follow by Theorem 
II. 2.2 and II. 6.3 namely: 


CoroLLaRY 2.1. 1) so(A) NA for every ideal A in the 
ring R. 2) so(R,) =s8o0(R)n. 

Next we turn to a further study of the Baer’s radical £(R) of associative 
rings. 

For associative rings, ¢-semi simplicity implies o-semi simplicity and 
hence by Corollary II. 1.6 and by Theorem 1.3 it follows that o(R) = ¢(R). 
We have already seen that o is an HZ-property which satisfies II. (C,) and 


5 For definition see section II. 6. 
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II. (Ds) (and evidently II. (D,) and II.(D)). Furthermore o satisfies 
II. (E): this follows from the fact that if P is a nilpotent ideal (a o-ideal!) 
in an ideal Q of a o-semi simple ring FR, then since (RPR)*"* C RPh, 
RPR is a nilpotent ideal in R, which implies RPR —0 and consequently 
P=0. It follows now by Theorem II. 8. 2 that o* is an RZ-property which 
satisfies II. (D,) and II. (D,). A ring R was called an L-ring* an L-ring 
if R= €(R) =o(R). 

In our notation this is equivalent to the definition of a o*-ring. Using 
the notation of L-rings we obtain, in view of the definition of o* and of 
Lemma II. 8.1: 


CoROLLARY 2.2. A ring R is an L-ring if and only if every non zero 
homomorphic image of R contain non zero nilpotent ideals. 

The validity of (D,) implies: 

CoroLLary 2.3. Hvery subring of an L-ring ts an L-ring. 

We wish now to show that the property L (—o*) satisfies IT. (F,). 
Let R be an L-semi simple ring and let J be a right ideal in R such that 
L(J) =J. The annihilator of J in J is the maximal nilpotent ideal in J. 
For, if Q is a nilpotent ideal in J, then QJ is a right nilpotent ideal in R. 
The L-semi simplicity of R readily implies that R does not contain non zero 
right nilpotent ideals and hence QJ =0. Thus Q CN, where N is the left 
annihilator of J in J. By Theorem II. 1.1 and by the definition of the upper 
radical, it is readily seen that L(J) —N. Hence J?=0. By the L-semi 
simplicity it follows now that J = 0. 

Thus, we obtain by Theorem II. 2.2 and by Theorem II. 6.3: 


CoroLLary 2.4. 1) L(A) =L(R) OA for every ideal A ina ring R. 
2) L(Rn) =L(R)». 

We conclude with a remark that the preceding results with the exception 
of the second parts of the Corollaries 2.1 and 2.4 hold also for associative 
semi groups. 


3. Various applications. We bring now some alternative proofs of some 
of the known radicals, in particular those which do not enter in the class of 
F-radicals, discussed in [4]. 


A. Jacobson’s radical. This radical and its generalizations were dealt 
with from the point of view of F-radicals by several authors. We shall 


° This is the notation used by J. Levitzki in: “A theorem on polynomial identities,” 
Proceedings of the American Mathematical Society, vol. 1 (1950), pp. 334-341. 
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deal here only with the associative case. Recall that a ring & was called a 
quasi-regular ring (QR-ring) if for every re FR there exists an se FR such 
that r-+s-+rs=—0.7 This property is readily seen to be an HZ-property 
and satisfies II. (D) and II. (D,). Furthermore, QR satisfies the condition 
of Theorem II. 6.6. Indeed, if R is not a QR-ring, let ae Rk be not a quasi 
regular element. Consider the maximal right ideal J containing the ideal 
{x + ar} and excluding a. By the proof of Theorem 17 of [6] it follows 
that R/(J:R) does not contain right QR-ideals and this proves Theorem 
II.6.6. Hence QR is an RZ-property and satisfies also II. (F,). The 
validity of II. (E) follows by Theorem IJ. 5.1. Thus most of the properties 
of the QR-radical obtained in [6] are deduced from the results obtained here. 


B. The locally finite kernel. An associative algebra R over a field F is 
said to be locally finite (LF-ring) of every finitely generated subalgebra of R 
is finite over F. Levitzki has recently shown in [10] the existence of the 
LF-radical, which was called the locally finite kernel. The present theory 
provides a short cut in Levitzki’s proofs and yields further properties of this 
radical. 

The property LF is readily seen to be an HZ-property which satisfies 
II. (C,) and IT. (D,) and evidently II. (D). The validity of II. (C,) follows 
by [10] Lemma 7. 4, and II. (F,) follows by [10] Lemma 7.3. Since LF is 
also antiisomorphism invariant, a similar condition holds also for left ideals 
(Remark 1.1). The validity of all these conditions yields the results on the 
locally finite kernel obtained in [10]. Additional results now obtained are: 
LF(A) = LF(R) 2 A for every ideal in an algebra R; and LF(R), = LF(R,). 


C. McCoy’s radical. This radical, which was defined in [11], was 
proved to be equal to Bear’s radical. It can be also approached from the 
present point of view of HJ-properties. Calling an ideal A in a ring an 
M-ideal if every m-system of A contains the zero, we obtain an H/-property 
in the sense of section II.1. The prime ideals are readily seen to be 
M-ideals, and if A > B are ideals in R such that A/B is not an M-ideal, 
one can find a prime ideal Q* in R such that AD ANQDB. It follows 
now by Theorem I, 2.4 that M is an Rl-property and that the M-radical is 
the intersection of all the prime ideals. 


7 Usually, quasi-regularity is defined in a ring but it is known that an ideal A 
(right, left or two sided) in a ring R is quasi regular in R if and only if A is a quasi- 
regular ring. 

8 The ideal Q can be chosen as a maximal ideal Q > B such that Q/B excludes a 
certain m-system of A/B. 
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4. Newradicals. A. Pl-radical. For the present radical we consider 
only associative algebras over a field F. An algebra Ff is said to be a Pl-ring 
if it satisfies a polynomial identity - -,2%,) where the coefficients 
of f belong to F. This property is readily seen to be an HZ-property which 
satisfies II. (D;) and hence also II. (D). Furthermore, PI satisfies II. (Ci) 
and IJ.(E). Indeed, if R possesses an ideal A such that R/A satisfies 
the identity g(2,°°°,%n) 0 and such that A satisfies the identity 
*;%n) = 0, then satisfies the identity 


in the nm indeterminates 2;,. This proves II. (C,). To prove II. (E), let 
R be a PI-semi simple ring and let A be an ideal in R. If Q is an ideal in 
A which satisfies an identity, then AQA CQ is a PlI-ideal in R, hence 
AQA =0. But the totality of the elements xe A such that AvA = 0 con- 
stitute a nilpotent ideal in FR, hence, by II. (Z), it must be zero; and thus 
II. (E) is proved. } 

It follows now by Theorem II. 8.3 that PJ is an additive-property and, 
therefore, PJ* is an RZ-property and satisfies II. (EK) and (D,). We recall 
that a ring # is a PJ*-ring if every non zero homomorphic image of # contains 
non zero ideals which satisfy a polynomial identity. 

The property PJ* is an example of an R-property which does not satisfy 
II. (F,). For, the ring # of all finite matrices is a simple ring and it is 
known that this ring does not satisfy a polynomial identity, yet the right 
ideal ¢,,R, i.e., the ring of all one rowed matrices satisfies the’ identity 


(2% — = 0. 


B. 7.8.h.-radicals. The methods of section II. 7 can be applied to yield 
a class of new radicals. With each property z of associative rings which 
satisfies the conditions of Theorem II.7.1 or II. 7.2 we can associate a 
7.8.h.-radical. We recall that a ring R is a z.s.h.-ring if every homomorphic 
image of R does not contain a non zero z-ideal (i.e. is r-semi simple). In 
particular, Theorem II. 7. 2 can be applied to the following properties: quasi- 
regularity, nillity, semi-nilpotency and nilpotency. 

In particular, a ring F is nilpotent-s.h. if every homomorphic image 
does not contain nilpotent ideals. Now a regular ring is evidently a ring 
of this type so that nilpotent-s.h. radical contains the maximal regular ideal 
defined in [5]. 


C. The FI-radicals. The notions of J-rings and FI-rings were intro- 
duced in [10] and the structure of such rings was determined there. An 
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associative ring R was called an J-ring if every non nil right ideal contains 
an idempotent, and R was called an FI-ring if every homomorphic image of 
R is an J-ring. Similarly we shall call a ring R an Js-ring (an J;-ring) 
if every right ideal which is not semi-nilpotent (which is not an L-ring) 
contains an idempotent. In the same way we define FJs-rings and F'J,-rings. 

We wish to show that the three properties: FI, FIg and FI, are SRZ- 
properties and satisfy the conditions II. (D,). Furthermore, the last two 
properties satisfy also II. (F,). The proof will be carried out only for the 
property FJs since the proof for the other two properties is similar. 

The property Is satisfies II.(A) and a similar proof to that of [10] 
Lemma 5.4 yields that Ig satisfies II. (C,) and II. (C.), hence by Lemma 
II. 7. 1 it follows that FJ s is an R-property. Evidently this property satisfies 
II. (Z). To prove the validity of II. (D,) we consider a right ideal J in an 
FIs-ring R. Let J/Q be a homomorphic image of J and let P/Q be a right 
ideal in J/Q. Now, PJ is a right ideal in R and RQJ is a two sided ideal. 
Since R is an FIs-ring, R/RQJ is an Is-ring, so that (PJ, RQJ)/RQJ either 
contains an idempotent or it is semi-nilpotent. Put M = PJM RQJ, then the 
last quotient is isomorphic with PJ/M. PJM is a two sided ideal in PJ, 
hence one readily verifies that PJ/PJM is also either semi-nilpotent or con- 
tains an idempotent. Since JRQJ CJQJ CQ it follows that 


PJM C PIN PIRQI CPI NG. 
Furthermore, 
(PIN Q)*C PJ(RQI PJ) = PIM. 


Hence PJ/PJ 1 Q will also be semi-nilpotent or contain an idempotent 
respectively what PJ/PJM does. Consequently, (PJ,Q)/Q, and hence 
also P/Q, will have the same property, which proves (D,). Evidently this 
implies II. (D), and by Theorem II. 5.1 it follows that FIs satisfies also 
II. (E). Consequently, FIs is an SRZ-property. . 

We now prove the validity of (F,). Let R be an FIs-semi simple ring 
and let J be a right FJs-ideal in R. Consider the ideal J* = (J, RJ). Let 
J*/Q be a homomorphic image of J* and let P/Q be a right ideal in J*/Q. 
We shall prove that either P/Q contains an idempotent or it is semi-nilpotent. 
Evidently, the same will hold if we replace Q by J*QJ*; or, equivalently, 
we may assume that @ is an ideal in R. In this case one readily verifies that 
the semi-nilpotent radical N*/Q of J*/Q isan ideal in R/Q (by Corollary 
2.1). For every je J, (jP,Q)/Q is a homomorphic image of 7P. The latter 
is a right ideal in J, hence either it is semi-nilpotent or contains an idem- 


A GENERAL THEORY OF RADICALS. III. 135 


potent. If the first case holds for every je J, then (JP,Q)/Q must be semi 
nilpotent ° and hence JPC N*. Thus 


P? C (J, RJ)P C (JP, C(N*, RN*) C N*. 


This proves that (P?,@)/Q and hence also P/Q are semi-nilpotent. If this 
is not the case, (jP,Q)/Q contains an idempotent e for some jeJ, i.e. 
e=jp(Q), jeJ and peP. By replacing p by pe we may assume that 
pe=p(Q). Since e?+0(Q), pj 40(Q) and the latter belongs to P and 
satisfies (pj)? = p(jp)j = pe] = pj(Q). This proves that P/Q contains an 
idempotent. Thus J* is an FJ s-ideal in R; hence the FJs-semi simplicity of 
R yields J* = 0 and therefore J = 0. 

A radical similar to those discussed above is the following: call a ring 
R an I,-ring if every non zero right ideal in R:contains an idempotent, and 
call R an FI,-ring if every homomorphic images of F are I)-rings. As in the 
preceding case FJ, can be shown to be an R-property, but this property does 
not satisfy the conditions II. (D,) and II. (F,). Yet we can prove that FI, 
satisfies II. (D) and the two conditions II. (G,) and II. (G.). Thus FI, and 
the preceding properties FJ, FIs and FI;, satisfy the main results on the 
radical, in particular Theorem II. 2.2. Furthermore, FJ, satisfies Theorem 
II. 4. 1 and the last two properties satisfy Theorem II. 6. 3. 

To prove that FJ, satisfies II. (D) we consider an FI,-ring R and an 
ideal A in R. If A is not an FI,-ring then A contains an ideal Q such that 
A/Q is not an J)-ring and, evidently, A/AQA cannot be an Jo-ring. Hence 
we may assume that Q is an ideal in R. If J/Q is a right ideal in A/Q 
which does not contain an idempotent, then since R/Q is an Jp-ring, JA C Q. 
The set {x | xe CQ} is an ideal T in R. If TAOmod Q then it 
contains an idempotent emod Q but then e?¢e¢A CQ which is impossible. 
This proves that J =0modQ; that is, the validity of II.(D). Before 
proceeding with the proof of the conditions II. (G,) and II.(G_) -we show 
that the ring R, is an J,-ring if and only if R is such. Indeed if R, is an 
I,-ring, one readily verifies that R is also an Jy-ring. Let R be an J,-ring 
and let J be a right ideal in R,. Since R cannot have a left annihilator we 
have JR, 0. Hence, J(cuh) ~0 for some 7. We can also see that the 
coefficients of c;; of the matrices of J(cf) constitute a non zero right ideal 
in R. This ideal contains an idempotent e and one readily observes that 
the matrix of Jc; which is of the form ec; + -- - is an idempotent of the 


ideal J, q.e.d. We turn now to the proof of II. (G,). Let R be an FI,-ring. 


® This proof holds only for the properties FJ, and FI, and it is not known whether 
it is valid for FI. 
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Let Q be an ideal in Ry. Since R,QR, = 7,, where T is an ideal in R we 
have R,/R,QR, = (R/T)n. The latter is an Jo-ring since R/T is such, 
hence R,,/P?,QR, and therefore, also R,/Q is an Io-ring which proves that R, 
is an FJ,-ring. To prove II. (G.) let R be an FJo-semi simple ring. If R, 
is not FJ,-semi simple then it contains an ideal Q which is an FJ)-ring. Now 
Q/R,QR, can be an [o-ring only if = R,QYR,. Let T be an ideal in such 
that R,QR, = T,; then for every ideal P in T we have Q/P, = (T/P)n. 
Since Q is an FJ,-ring, T7/P must be an J)-ring. This proves that 7 is an 
FI,-ideal in R hence T =0. Therefore Q = 0 which proves II. (G2). 

The properties FJ (FIs, FI, and FJ,) are symmetric, in the sense that 
if a ring # is J-ring then also every non nil left ideal in R contains idem- 
potency ([10]), and conversely. 

We conclude with the remark that the following relation exists between 
these radicals: FI(R) FIs(R) FI, for every ring and 
all these radicals contain the maximal regular ideal of the ring A which was 
defined in [5]. The last follows from the fact that a regular ring is also 


an FJ -ring. 
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PAIRS OF QUADRATIC EQUATIONS IN A FINITE FIELD.* 
By L. CaRuitz. 


1. Introduction. Let Q(x) = Q(a,°-+,2,) denote a quadratic form 
with coefficients in the finite field GF'(q), g odd. If 80 is the discriminant 
of Q, then it is well-known that the number of solution of the equation 
Q(&,°°°,&) =a (&e¢GF(q)) is given by 


(1. 1) + 1)%a) (r= 2s + 1); 
(1.2) (r= 28), 
where =0, + 1, —1 according as square or non-square of 
GF(q), and =q—1, k(a) =—1 for «0. 


If Q, and Qs denote two arbitrary quadratic forms then the question of 
the number of solutions of the system Qe(é1,: &) =B 
seems to be much more difficult. In the present paper we obtain a number 
of results concerning such systems. In particular we consider the following 
problem. Let - -,@Q, denote r nonsingular quadratic forms with coeffi- 
cients in GF(q) in nonoverlapping sets of unknowns; let fi,- --, 8, be 
distinct and a, 8 arbitrary numbers of GF'(q). Let N = N(a, B) denote the 
number of solutions of the system 


whre &® stands for (£,,- - +, &,“). Thus the total number of unknowns is 
(1. 4) 
The case r= 1 is trivial. Also the case r= 2 may be reduced to 

(Bs = —B, (Bi — = — 
so that the number of solutions of the system may be found by means of 
(1.1) and (1.2). We may accordingly suppose r= 3. 


Fairly simple results for N(a,8) are obtained if certain additional 
hypotheses are introduced. For example if all the s; are even then 


(1.5) 8) 
= + 1) — E(B — afi) 
+ + + #(8)}, 


* Received April 13, 1953. 


19 
n 4 
8, 

137 


138 L. CARLITZ. 


where 8 is the discriminant of Q; and s—[J[]¥y(&), while &(«) has the 
same meaning as in (1.2). For the general case see Theorems 3-6 below. 
We find that in certain cases the number of solutions is expressed in terms 
of the sum 


(1. 6) y((A— *(A—8B:)), Ae GF(q), 


where it is assumed that s,,- - -, 5; are odd while the remaining s; are even. 
Now in some cases the sum (1.6) can be evaluated explicitly. For example 
Jacobsthal [5] has discussed the sum }y(A(A?—1)) in the case q=—p. 
It is easy to extend his result to GF'(q) and thus get simple explicit formulas 
for N(a,) in certain cases (see Theorem 7 and 8 below). It may be 
remarked that (1.6) can be evaluated in some additional cases. For a 
systematic discussion of Jacobsthal sums see Whiteman [8], [9]; we shall 
however not apply these results in the present paper. 

In the next place we discuss a conjecture of André Weil [7] for the 
homogeneous system (1.3) with a=—=8B=—0. Let N™ denote the number 
of solutions in the enlarged field GF(q”); let N,», denote the number of 
“rational points,” where (¢q—1)Nmn=—N™—1, then we verify that 
> Nnu" = dlog Z(u)/du, (m21), where Z(u) is a certain rational 
function. 

Finally we discuss briefly the case g = 2". We now suppose 


(1. 7) Q(u) (a; GF'(2") ) 


and use some results of Dickson [3], §199, on normal forms for (1.7). 
In particular if Q(w) is of even rank then it possesses a certain invariant 6. 
We find that if each Q; is of even rank s; then the number of solutions of 
(1.3) satisfies 


(1.8) B) — + — 1) (8 — af) 
+ k(a)k(B) + +&(6)}, 


where = 1, 1(8) =—1 for 840, note that (1.8) and 
(1.5) are very similar. If some of the s; are odd the final results are some- 
what simpler (see (9.19), (9. 20)). 


2. Preliminaries. If qg=—p", put 


(2.1) e(a) == e?mit(o)/p, t(a) =atart---+ ge 


Then it is familiar that 
(2. 2) 2 ¢(a8) =q («=0), > =0 (40), 
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where the summation is over all Be GF(q). It follows from (2.2) that the 
function k(«) occurring in (1. 2) satisfies 


(2. 3) k(a) = e(aB). 
We next define the Gauss sum [1], § 3, 


(2.4) G(a) (%540); so that (2.5) G(«) =y(«)G(1), 


where y(a) has the same meaning as in (1.1) and (1.2); also 
(2.6) G1) =¥(—1)49; (2. 7) 2 ¥(B)e (a8) = G(a), 


valid for all « (including «—0). If Q(&,---,&) is a quadratic form 
we put 


(2.8) G(Q)= e(Q(E,° hence (2.9) (60), 


if § is the discriminant of Q. Making use of (2.2) it is evident that the 
number of solutions of (1.3) satisfies 


where A, » independently range over the numbers of GF'(q) and S¢ indicates 
a summation over the s numbers 9 (j =1,-- -,s;). Using (2.9) we may 
rewrite the right member of (2.10) as 


(2.11) e(—Aa + wf) — 


In order to simplify this expression we break the sum into several parts. 
In the first place the term 4 contributes g*. Next if for 
some fixed 1 and » +0 we get 


e( (8 — TT’ (Bi — 
= ¥( (8 —B))*)), 


where §; is the discriminant of Q; and the prime refers to the exclusion of 
j=t. Thus the total contribution of these terms is 
r 


(2. 12) (8 — 


4=1 


| 
r 
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where 71. The remaining terms in (2.11) give 


(2.14) G*(1) + IL ¥((A— where 27 


Substituting from (2.12) and (2.14) it is clear that (2.10) becomes 


+ G*(1) + #8) IT 5 


the summation may now be extended over all A, » since the additional terms 
vanish. Further simplification of (2.15) depends on the introduction of 
certain additional hypotheses concerning a, 8 or the Qj. 


3. The case s;even. If all the s; are even then, first of all, (2.13) 
becomes 


j= 
As for (2.15) it evidently reduces to 


+ 7rG8(1) e(—Aa+ 


where 7, 7; are defined by (3.1) and the last sum is over all X, » such that 
A ~ wf; for any 7. This sum evidently is 


(F(a) +1) (K(B) +1) — —1, 


where we have used (2.3). Thus (3.2) becomes 


+ 749(1) (k(a)k(B) + k(a) +&(B)). 
Finally by means of (2.6) we can eliminate G(1) and (3.3) takes on the 
following explicit form. 


q?N(a, B) = + rH((— 1)**)q** {y((— 1)#*3;) — 1}&(8 — a Bi) 


(3. 3) 


(3. 4) 
+ + + &(B)). 


We may now state 


r r 
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THEOREM 1. /f all the s; are even then the number of solutions of the 
system (1.3) is given by (3.4), where 3; is the discriminant of Qi and 
I y (8;) 


This result may be compared with (1.2). The special case a= B=—0O 
is of some intereste. We state 


THEOREM 2. If all the s; are even and in addition «= 8 =0 then the 
number of solutions of (1.3) satisfies 


q°N(0, 0) = + ry((— 1)?*)q#*(q — 1) 
(3. 5) ‘ 
xX { (— + ¢ 


The following special case of Theorem 1 may be mentioned. The number 


of solutions of the system éy, +: é 4, Biéim +: +--+ =B 
is equal to 


gr? + g'-*(q—1) — af.) + + k(a) + k(8)). 


As a partial check of (3.4) and (3.5) we remark that for r= 2 the 
two formulas are in agreement with (1.2). 
It may also be worth noting that for «—0, B~0, (3.4) becomes 


(3.6) 8) = + (— (BA 0). 


4, The general case. If one or more of the s; is odd, the situation is 
rather different. The final results depend on whether the number of odd s; is 


itself odd or even. Let us assume the s; so numbered that s,,- - -, s; are odd, 
1Stsr, while s;.:,- - -,s, are even. In the double sum in (2.15) the 
terms for which » = 0 contribute : 

(4.1) 2 (r= IT ¥(%)). 


To evaluate this sum we use (2.3) or (2.7) according as ¢ is even or odd. 
It follows that (4.1) is equal to 


(4. 2) — k(a)r (teven) or y(—«)G@(1)r (¢ odd). 


The remaining terms in the double sum in (2.15) give (on replacing 
by An) 
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Again using (2.3) and (2.7) we find that (4.3) reduces to 
(4.4) + A)T(A) even) or Ey(—Aa + (¢ 048), 
where 
(4.5) T(x) =I Bo) 
Turning next to the sum 
(4. 6) — 
occurring in (2.15), we remark that s=¢ (mod 2), so that (4.6) reduces to 


(4.7). k(B—2aB;) (i St for odd, i > for even) or — 
(otherwise). 


Let K; denote the sum > in (2.15); it follows from (4.7) that 
i 


Ky = + api) (t odd) 


4=1 


K, = af) + af.) even), 
where by (2. 13) 
the prime indicates j 4 i and as before 


If we now collect the terms corresponding to (4.2), (4.4), (4.8) we see 
that (2.15) reduces to 


+ Ke + {k(a) + Aa even) 
(4. 11) 
+ Ke + 4) + da + odd). 


This proves 


THEOREM 3. If +, are odd while +, 8, are even then the 
number of solution of (1.3) satisfies (4.11), where Ky, T(A) are defined by 
(4.8) and (4.5), respectively, and + 1s defined by (4.10). 


If t = 0, (4.11) should coincide with (3.4). Since by (4.5) and (4. 8) 


(4. 8) 
. 
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and k(a) + k(—Aa-+ £) is if aA0 and k(a) + k(B)q if « it 

follows that the first half of (4.11) does indeed reduce to (3.4); this 

furnishes a partial check for (4.11). 


In the second place if t—=r = 2, 
Ks = — G(B — + Y((B2 — G(B — aBe). 
Also k(a) + ~ k(— ra + B)y((A— fi) (A— 2) ) 

1— w((A— Bi) (A— B2)) + ((B — (B — ) 


= qy( (8 — «B:) (8B — ) 


for « 40, while for «0 we get g—1—k(8) = qy(f?). Thus the right 
member of (4.11) becomes 


+ ghey ( (— 1) 8-8, — B) (B2 — B1) } 
{82 ( (— 1) 88, (a8, — B) (Bi — Be) },. 


in agreement with (1. 2). 


5. Some special cases. If «0 we may replace the second equation 


in (1.3) by +: (B— Qr(E) =0; in other 
words there is no loss in generality in assuming that B=0O. With this 
normalization, (4.11) becomes 


q’N(a, 0) = q* + Kila, 0) + {k(a) + Aa) T(A)}7G4(1) 
(5.1) (t even), 
q’N(a, 0) = + Ki(a, 0) + + y(—Aa@)T(A)} 
(¢ odd), 
where 7'(A) is defined by (4.5), 


t r ; 
(t odd), 
t r 
i=t+1 
‘ (¢ even), 
and 7; is defined by (4.9). The formulas (5.1), (5.2) are valid for all a. 
This proves 


THEOREM 4. +, are odd while s1,;,: -,8, are even and B =0 
then the number of solutions N(a,0) of (1.3) is determined by (5.1) and 
(5.2) together with (4.5) and (4.9). 


| 
| 
4q 
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As already pointed out if «40 then Theorem 4 involves no specializa- 
tion. However, the case « = 0 is not covered (see Theorem 6 below). 
In particular if « = 0 it is easily verified that (5.1) and (5. 2) reduce to 


(5. 3) q°N(0, 0) = g* + K,(0, 0) + (¢g—1){1 + 2 T(A)}7G8(1) 
or g°N(0, 0) = g* + K,(0, 0) 


according as ¢ is even or odd, where, in the respective cases, 


(5. 4) K,(0, 0) = — 1) 


t 
or K,(0, 0) = 1) 
(7 is defined by (4.9)). This proves 


THEOREM 5. If s,,---+,8; are odd while St.1,°°-,S, are even and 
a—=B=0, then the number of solutions N(0,0) of the system (1.3) 1 
determined by (5.3) and (5.4). : 


Returning to (4.11) let us take «0; since the case 0 is 
covered by the last theorem we may assume B~0. We find that (4. 11) 
reduces to 


qN(0, B) = + K.(0, B) + {g—1— T(A)}7G8(1)_ (¢ even) 


(5. 5) 
q?N(0, 8) = 9° + K.(0, 8) + ~ 
where 
K,(0, B) = 2 (¢ odd) 
(5. 6) 


t 


K,(0, B) — nghGen(1) even) 


4=t+ 


and 7; and r are determined by (4.9) and (4.10). We state 


THEOREM 6. If a0, B0, the number of solutions N(0, B) of the 
system (1.3) 1s determined by (5.5) and (5.6). 


When ¢ =r the results of this section simplify and it may be worth 
while stating the specialized results explicitly. In the first place we now have 


r 


Le 


ve 
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In the next place, according as ¢ is even or odd, 


t t 
4=1 4=1 
then N(a,0) is determined by (5.1) together with (5.7) and (5.8). 
If «0, B~0, then, according as ¢ is even or odd, 


t 


(5. 9) K,(0, 8) is righ or — 


4=1 = 


and N(0, 8) is determined by (5.7) and (5.9). 


6. The sum }7(A). Further simplification of (5.1) or (5. 5) depends 
upon the evaluation of the sums 


(6. 1) )TA), TLE), 


where 7'(A) is defined by (4.5). We begin with the last of the sums (6.1). 
Put : 


then 7 
(6.3) ZT) +E 


Now it is clear to begin with that 
(6. 4) J, (6.5) 
In the case t = 3 the general sum can be reduced to one of the type 
(6.6) = ZyAQ—1)(A—B)) 
with 80 or 1. Consider 
EPA) = ZY AQ—1) 
= 
= (¢—2)q— — — 1)) = — 2q—1, 
by (6..5). Since f(0) = = 
and similarly ——4(— 1), it follows that 
(6. 7) = — 2q—3. 
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In the next place, generalizing [5] slightly, we put 


(6. 8) $(8) = + B)) (BA0); 
then 

(6. 9) $(aB) = Y(a)$(8) (aA 0), 
and therefore 

(6. 10) 1) (61) + 8°). 

where y is any number of GF(q) such that y(y) =—1. On the other hand 
exactly as in the proof of (6.7) we have 

(6.11) 


Comparison with (6.10) leads at once to 


(6. 12) ¢°(1) + =2(1+4(—1))¢. 

It is evident from (6.12) that 

(6. 13) $(8) =0 (q =3(mod 4)) ; 
(6. 14) (36(1))? + (q=1(mod 4)). 


It is clear from (6.8) that (8) is even for all 8; also since 


~ (1— y(a)) 1 — —wWa—1)) = ¢—3 — 1) = 0 
(mod 8), 


it follows from (6.9) that 


(mod 4). 


(6. 15) 


We may restate (6.14) and (6.15) in the following way. Put a= 4¢(1), 
b= 3¢(y), then 


(6. 16) a’? + b? a==—1 (mod 4). 


Moreover (6.8) implies 


— ( — DY — 

o(8) == > ( — q—1) 

and therefore using a known property of binomial coefficients 
1(m—1)\" 


(6. 18) 2a 6(1) = — P)s 
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where ¥4(— B) = B*@. It is clear that a is uniquely determined by means 
of (6.16) and the first of (6.18). We have thus generalized Jacobsthal’s 


theorem. 
In the next place we have the easily verified relation [4], p. 227, 


where the f’s are distinct, 7 —(8:— Bs)/(8:— Bs) and 

(6.20) p= (B:B2, BsBs) = (B:1 — Bs) — Bs) (B2 — Bs) /(B2 — Bs); 

the cross-ratio of ,, Be, Bs. Bs In particular if p = —1, then (6. 19) becomes 
Bi)  A— Bs) = Hn) — D) 


by (6.8) and (6.9). Then when fi, Bo, Bs, Bs form a harmonic range the 
sum J, (defined in (6. 2)) is expressed in terms of $(1). 


(6. 21) 


Returning to (6.1) it is apparent that each of the first two sums can 
be expressed in terms of the third. Thus 


(6.22) P(A) + (0) (240) 
and 


We also remark that by a theorem of Weil [6] | 


(6. 24) T(A) = (q>0,t=1). 
For a systematic treatment of Jacobsthal sums, see [8], [9]. 


7. Applications. In view of the results of § 6, some of the formulas 
of § 5 can be made more explicit. For example if t= 4, (6.3) becomes 


where 


We assume that the cross-ratio 


(7. 3) (B:B2, =— 1. 
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It follows from (6.21) that 
(7. 4) Js = 2a(— y(n) (q=1 (mod 4)), 


where a is determined by means of (6.16), while 


(7. 5) J,=0 (q=3 (mod 4)). 


On the other hand if ¢ 3 and 
(7. 6) 2B: = B2 + Bs, 
then we have 
Ts W((A— Bi)(A — B2)(A — Bs)) = — — 1)), 
so that 


Js = 2a(— (Bi — Bz) (q=1 (mod 4)) ; 
(7. 7) 
J;=0 (q=3 (mod 4)). 


Let us now return to (5.1). We have 
(7. 8) ~ k(— Aa)T (A) = k(a) (A) + (q—1—k(a))T(0). 


Then if ¢= 4 and (7.3) is satisfied, the sum in the right member of (7. 8) 
is evaluated by means of (7.1) and (7.4) or (7.5). If ¢=3 we may 
assume «540; then 


Since y(A)Ti(A) = ¥(A(A— (A— B2) (A—Bs)), the condition (7.3) 
reduces to 


(7. 9) (8: — Bs)/(B2 — Bs) = — Bi/Be 


(provided £:8.8; 0). Then the sum }y¥(A)7(A) is determined by means 
of (7.4) or (7.5). If Bs —0 we get 


= Br) (A— B2)) 
If tr the above results aré particularly compact. We state 
THEOREM 7. If r—=t—4 and (7.3) holds then N(a,0) satisfies 
(7.10) q?N(a,0) = q* + K,(a, 0) 
+ {k(a) (Je + 1) + (Bi B2BsBs) }rG*(1), 
where J, is determined by (7.4) or (7.5). 


in 
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If r=t=83 and (7.9) holds then N(a,0) satisfies 
(7.11) (a, 0) = 9° + Ks(4, 0) + (Js + 
but tf then 
(7. 12) (a, 0) = + Ks(a, 0) —y(— (1). 
In all cases Ky, Kz are determined by (5. 2). 


Turning next to Theorem 6 the situation is similar. We again assume 
t= and state the following 


THEOREM 8. Jf r—t—4 and (7.3) holds then 


(7.13) 8) + K4(0, 8) + 
If r=t=83 and (7.6) holds then 


(7. 14) (0, 8) = 4° + K3(0, 
Here K,4, Ks are determined by (5.2), Js, Js by (7.4), (7.5), (7.7). 


8. Conjecture of Weil. Let NV“ denote the number of solutions é 
in the extended field GF'(q”) of the system 


and let N,, denote the number of “ rational points,” so that 
(8.1) N™ =1-+ (q™—1)Nn. 


Consider first the case covered by Theorem 2 with all s; even. For brevity 
we put 


Then (3.5) gives 
Mim) 4. 1)( + + 1—1)) 
and by (8.1) this implies 
Nm = — 1)/(q™ — 1) + + 4-1 


Next following Weil [7], p. 507, we put 


(8. 3) dlog Z(u) /du. 


m=1 
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Now a straightforward computation yields 


22-3 


m=1 


+ — — (r — 1)éqg??/(1 — *u) ; 
thus (8.3) implies 


r 
(1 — éq**u) II (1 qiu) I (1 
i=0 


(8. 4) Z(u) = 


While the right member is of the form conjectured by Weil, Z(w) apparently 
does not satisfy the conjectured functional equation. However it should be 
observed that the variety defined by the pair of quadratics has singular points 
and is accordingly not covered by Weil’s conjecture. 

In the next place we consider (5.3) with {=r. We suppose first that 
tis even. Then it follows from (5.3) and (5.4) that 

heGF(g™) 

where 7‘ (A) indicates that the T-function is defined over GF(qg™). Using 
(8.1) we get 


(8.5) Nm = —1)/(qm™—1) + (14+ T (A) }o™G2™2(1) 


(s = 2z). 
Now by (2. 6) 
(8. 6) y"((— 1)*8; 5,)q™* == é"q™. 
Also Davenport has proved [2], that 
(8. 7) > T™ (Xd) +- 
\eGF(g™) 


where the w’s are the zeros of a certain L-function. Hence using (8.6) and 
(8.7); (8.5) becomes 


-> gi/(1 g‘u) + éq??/(1 éq*?u) 


22-1 


— 
Therefore by (8.3) we get 


(8.8) Z(u) = (1 /((1 — TT (1 


which is of the conjectured form. 


ty 


W 
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Finally in the case tr, ¢ odd, it follows from (5.3) and (5.4) that 


t 
(8. 9) — (s-*) + (q” > y( 1) 
Now put s= 22+ 1, = 24+1, &=y((—1)**)x; then (8.9) implies 


t 
Nun (quer) 1)/(q™ 1) + 


22-2 t 


m=1 


Hence by (8. 3) 
22-2 


which again is of the conjectured form. 


9. The case q=— 2". The typical quadratic form is now taken to be 
(9. 1) Q(u) = (a;e GF(2")). 
1SiSjsr 
Assume that Q is not equivalent to a form in fewer than r indeterminates. 
Then it is known ([3], § 199) that Q can be reduced to 


or to 
where § = 0 or is such that the form u? + wv + Br? is irreducible over GF(q). 
We may call 8 the invariant and r the rank of Q. 

We next define 


exactly as in (2.8). Since 
(9. 5) = > 0, 


it follows at once that, according as r= 2s + 1 or r= 2s. 
(9.6) G(Q)=0 or (9.7) = + & + 


where 8 has the same meaning as in (9.3). When 60, the sum on the 
right is evaluated by the first of (9.5). When 60, it is readily seen that 
the equation 


(9. 8) 24 + (a 0) 
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has precisely g + 1 solutions. Indeed it is clear that the number of solutions 
of (9.8) is equal to the number of solutions of € + &) + 8)? = af? («4B ~0). 
But since every non-zero number of GF(2") can be expressed in the form 
a8. the assertion about (9.8) follows at once. In other words we have 


(9. 9) N(& + & + 89? =a) =k(a) $2 (80), 
where k(a) has the same meaning as in (1.2). Thus (9.7) becomes 
(9.10) G(Q)—=—q* (80), G(Q)—q? (8=0) (r= 2s). 


It now is easy to show that the number of solutions of Q(&,---,&-) = 
is given by 


(9.11) (r=—2s-+1), N = + (r= 2s), 


where 1(0) = 1, 1(8) =—1 for 80. The first half of (9.11) is imme- 
diate ; to prove the second half we have 


gN e(—Aa)G(AQ) +B e(— Aa) G (AQ). 


The stated result now follows on using (9.10). 
In the next place we set up a problem like (1.3). Let N(«, 8) denote 
the number of solutions of 


(9.12) Qi(EM) OEM) =, BiQiE™) + = B, 
where now the Q; are of the form (9.1). As before Q; is of rank s; so that 
the total number of unknowns is again s=s,-+----+s, Clearly (2.11) 
remains valid. We break up the sum as before. Thus the term A =p» = 0 
again contributes g*. Secondly if A = wf; for fixed 1 and » 0 we get 


The remaining terms of (2.11) yield 


(9.14) I G{(A— Q3} (A 


Suppose now that all s; are even and let 8; denote the invariant of Qj. 
Applying (9.10) to (9.14) we get 


(9. 15) o(—Aa + (A 5 


similarly (9.13) yields 


(9. 16) e( (8 —aBi)u) TY (j4i). 


r 
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Hence we obtain 
(a, B) — + (gil(8:) — 1)k(8 — afi) 
+ Igi#(k(a)k(B) + k(a) + &(B)), 
where 1—=[[1(8). In particular for «= (9.17) becomes 


i=1 


(9.18) q?N(0, 0) = + — +q+1—r}. 


(9.17) 


As for the general case odd, *,S, even, it is clear 
from (9.6) that (9.14) vanishes for ¢ = 1; indeed the same is true of (9. 13) 
for This evidently implies 


(9. 19) N(a,B) = (t= 2). 


In the remaining case (t = 1) we get 


(9.20) g?N(a, 8) = + — IT (¢=1). 
j= 
We may now state . 


THEOREM 9 (q =2"). Let s,,- 8, be odd, even. Then 
the number of solutions of (9.12) is given (9.17) for t=0, (9.19) for 
t= 2, (9.20) for ¢—1. 


It may be remarked that it is easy to give a direct proof of (9.19) and 
(9.20) using only (9.2) and (9.11). For to begin with the number of 
solutions of (9.12) is clearly the same as the number of solutions of the 
system 

(9. 21) 
(Bz — Bi) Q2(E) + + (Br—B1)Qr(€") =B— - 


Now the number of solutions of the second of (9.21) can be obtained readily 
from (9.11). Finally we observe that for fixed é@,- - -, €”, the number of 
solutions of the first of (9.21) is g#@-. Formulas (9.19) and (9.20) now 
follow. 
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TOPOLOGICAL PROPERTIES OF HOMOMORPHISMS BETWEEN 
BANACH ALGEBRAS.* * 


By Bertram Yoon. 


1. Introduction. Let B and B, be two (real or complex) Banach 
algebras. A number of known results [3], [6], [12], [13] give conditions on 
the algebras under which certain classes of homomorphisms of B into B, 
automatically possess topological properties. These results are useful in 
representation theory, in the uniqueness of norm problem [12] and are of 
intrinsic interest. In §§ 3, 4 we give a unified treatment of results of this 
type and related matters where we have tried to reduce the assumptions on 
the algebras to a minimum. Here, in algebras with an involution, the involu- 
tion can be either an isomorphism or an anti-isomorphism.? The basic type 
of Banach algebra studied is the p*-algebra where all that is assumed is that 
it is possible to find an involution and a non-negative sub-additive function 
|a| defined on B, vanishing only for «0 and majorized by the spectral 
radius for self-adjoint and skew elements. This is a generalization of the 
A*-algebras of Rickart [12]. § 3 is devoted to showing that all of his results 
for these algebras carry over to the p*-algebra case. In particular every p*- 
algebra has the uniqueness of norm property. 

For the bi-continuity result (§ 4) the domain algebra B is a p*-algebra 
where |x—y| is a metric making B a space of type (Ff) and where self- 
adjoint elements have real spectra. This is a generalization of B*-algebras 
and the *-algebras of Arens. Any *-isomorphism of B into a p*-algebra is 
bi-continuous. 

Every p*-algebra is semi-simple. In § 5 we pass to the study of algebras 
with non-trivial radical in the case of a complex commutative regular Banach 
algebra B. We obtain results on homomorphisms (Theorems 5.3 and 5. 4) 
which, when B is semi-simple or is a B*-algebra, in the main reduce to 
recent results of Rickart [13] and Kaplansky [6]. 


* Received November 28, 1952; substantially revised, August 24, 1953. 

1This research was supported in part by the United States Air Force under 
Contract No. AF 18(600)-685 monitored by the Office of Scientific Research. 

The author is indebted to the referee for his helpful suggestions. 

2 This is possible, in part, because the present treatment unlike its predecessors 
does not use the combination va* if # is not normal. Also note that v”* = 0 is possible 
without «= 0. 
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This paper is a continuation and extension of the work of Rickart and 
Kaplansky [6], [12], [13]; our scientific debt to these authors is cheerfully 
acknowledged. 


2. Notation. Let B be a real Banach algebra. For the notion of spec- 
trum see [6], [12]. a(x) denotes the spectrum of «eB which is defined 
to be the usual complex algebra spectrum of (2,0) in the complexification 
of B. For our purposes we may omit 0 from «(z). If F is a subalgebra of 
B we use a(x| #) for the spectrum of x as an element of EF. p(x) is the 
spectral radius of x (~sup|A|,Aea(x)). If B admits complex scalars 
then a(x) can differ from the usual spectrum o(z) of x Some of our 
hypotheses involve the spectrum as a point-set, but our results are also valid 
if B is a complex Banach algebra and the term spectrum refers instead to 
a(x) (even though the involution and homomorphisms need only be real 
linear). Essentially the same arguments prevail. By an involution on B 
is meant a mapping «— x* which is additive, real-homogeneous, of period 
two and either (ry)* = y*a* for every z,yeB or (xy)* = a*y* for every 
x,y. Let H(K) be the set of self-adjoint (skew) elements of B. Then, as 
usual, B is the direct sum H @ K. B is a p*-algebra if there exists an 
involution and a real function |x| on B, |x| 20, |x| —0 if and only if 
and Any such func- 
tion | z| is called an auziliary function for B. If |x| can be chosen so that 
| «—y | is a metric in which B is a complete space of type (F); ([2], p. 35), 
B is a p,*-algebra. B is Hermitian real if the spectrum of every z ¢ H is real 
and C-spectral if, in addition, the spectrum of every ze K is pure-imaginary.° 


3. @*-algebras. By an A*-algebra [12] is meant a Banach algebra in 
which there is an (anti-isomorphic) involution —>2* and a norm | 2 | in 
which it is a normed algebra and a number k > 0 such that | ¢ |? Sk | x*z | 
for allz. By the arguments of [12], p. 626, every A*-algebra is a p*-algebra. 
The converse is an open question but appears to us to be highly unlikely, 
there being no evidence in its favor. Note also that the definition of a p*- 
algebra permits involutions not allowed in the A*-case. A difficulty in 
trying to exhibit a p*-algebra which is not an A*-algebra is that somehow 
one must examine all possible anti-isomorphic involutions and with each all 
possible normed algebra norms. We feel that the notion of a p*-algebra is 


* The C-spectral property does not follow from the Hermitian-real property for real 
Banach algebras. Consider, for example, the algebra of all real continuous functions on 
[0,1] where #*(t) = a(1—t). 


es 
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simpler and more natural than that of an A*-algebra. In any case we shall 
develope the theory of p*-algebras up to the point reached by A*-algebras. 


3.1. Lemma.* Let B be a Banach algebra with an involution. Then 
the following statements are equivalent. 


(1) The involution is continuous. 
(2) H and K are closed. 


(3) Suppose that forx—u+v where weH, ve K we define a norm 
+e). In this norm B is a Banach algebra. 


Assume (1). Let 7 be the involution and J the identity on B. Since 
I+ T is continuous, (2) follows. 

To show that the norm || z ||, defined in (3) makes B a normed algebra 
it is sufficient to verify that | zy | Let c—=u+n, 
y = Us + v2 where each u,zeH, K. Then 


LY = + + + Ville) = 
say, and 
W* = Ugly + V2, — (Voli + 


The decomposition of w according to B=H @ K gives 
w= (w+ w*) + 3(w—w*) 


and 


Now assume (2). Let || and write %—=Un + with 
Une H, ne K. Then || wm — Un || > 0 and || vm— vn || > 0. Consider u, ve B 
where U,—>U, Vn > Thenu,—reH 
and || u,—r—s||~>0. By (2), s=0 and thus we H. Similarly, ve K. 
Hence for = u-+ || 2 ||, 0 and B is complete in the norm of (3). 

Assume (3). Since || x], || z ||, by [2], p. 41, the two norms give 


the same topology. Since || 2* ||, = || 2 ||,, the involution is continuous in 
the original norm. 


3.2. Lemma. Let B be a Banach algebra with an involution. If 
H(K) is closed-then, for yin HN K’ (KN QB’), 


Clearly that H(K) be closed is equivalent to KM H’ (HO K’) being 
the zero element. Let H(K) be closed, t,eK(H), try, ye H(K). 


‘The modification of the proof to cover the case where the involution is an iso- 
morphism is clear. 
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Let ve K. Then z,=v2, + aveH(K) andz,->vy+yveK(H). Thus, 
by the closure assumption, vy+yv=0. Then vy+ yv—0 also for all 
ve K’ and we obtain y? = 0. 


3.3. Lemma. Let B be a Banach algebra with an involution where 
there is a non-negative sub-additive function | x | defined on H, vanishing only 
for such that |x| S p(x), ce H. Then y?=0 for each ye KN H’. 


Suppose that z, eH, y, ye K. Now we may suppose that | 
= for we could use (|x| -+|—2|)/2. We have 
(1) (2, yeH). 


Let {wn} be a Cauchy sequence in H. By (1), {| wa |} is a Cauchy sequence 
of real numbers. Define f(w) on the closure of H by the rule that if 
w,eH, then f(w) =lim|w,|. If alco w=limy, y,€ H, 
then, by (1), lim|w,|—lim|y,| so that f(w) is well-defined. Clearly 
f(w) is sub-additive on its domain of definition. A theorem of Hille ([5], 
Theorem 22.9.1) asserts that if G is any open set in the complex plane 
containing a(w) and 0, then there exists « >0 such that a(w,) C @ for 
all n where || w,—w || <« (see also [10]). From this we obtain 


(2) f(w) =lim | w» | Slim inf p(wn) p(w) w 
(3) f(w) p(w) =p(w*) | w* 


Since H is closed under the operation of squaring, so is its closure and thus 
(y + 2»)* are in the domain of definition of f. Also y?eH. By (2) and (3), 


OS | /2 | (y + + f(y 
< [| 30. 


Also since || — 2," || then |(y?—2,?)/2|—0. Since 


|(y? + an?) /2 | + | (y? — an?) /2 | 


we have | y?| 0 and thus y* = 0. 


| 


3.4. THEorEM. Let B be as in Lemma 3.3.. If =0 implies  =0 
for re (HM K’) U|K 1B’) then the involution in B is continuous. The 
involution in a p*-algebra is continuous. 


By Lemma 3.3 and hypothesis, K  H’ reduces to 0 and Hi is closed. 
With the aid of Lemma 3. 2, our hypotheses now show also that H N K’ =0 
and K is closed. By Lemma 3. 1, the involution in B is continuous. 
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If B is a p*-algebra, then xe HU K, 2? =0 implies that p(x) =|2| 
=0 andz—0. 


3.5. Lemma. Let A be a real algebra with an involution where no 
element x in HU K, x40 is quasi-nilpotent. Then A 1s semi-simple. 


By ([{5], Lemma 22. 15.1), either A = R, the radical of A, or R is the 
intersection of the maximal regular left (right) ideals of A. Consider the 
latter case. If the involution x—>z* is an anti-isomorphism then it takes 
a maximal regular left (right) ideal of A onto a maximal regular right 
(left) ideal of A and each maximal regular left (right) ideal is the image of 
such a right (left) ideal under the involution. Then in all situations the 
involution takes R onto R. Hence for any ye R, ya y*eR. Since these 
are quasi-nilpotent, our hypotheses give y = 0. 

Clearly any *-subalgebra (closed or not) of a p*-algebra satisfies the 
hypotheses of this lemma. 


3.6. Lemma. Let B be a p*-algebra and let B, denote a Banach algebra 
which is a *-subalgebra of B. Then there exists a constant C such that 
for xe B,, where || are the norms in B, B, 
respectively. 


Since p(x | Bi) = p(x |B) for in B, it is clear that B, is a p*-algebra 
with the same auxiliary function || as B. By Lemma 3.2 the involution 
is continuous in each of B and B;,. Let ce B, r—u+r, ued, vek, 
Then +p(v) Since, by Lemma 
3.1, + ||) is a complete norm we obtain for some >0, 
|x|], ce B. Similarly for some c.>0, | ve Bi. 


Now || is a sub-additive function on B vanishing only for c—0. As in 
the proof of [12], Theorem 5.4, we obtain the conclusion. 


3.7. THEOREM. Every p*-algebra has the uniqueness of norm property. 
Let T be a homomorphism of a Banach algebra B into a p*-algebra B, where 
T(B) =[T(B)]*. Then T is continuous. 


With the above results in hand, the proof is the same as that used by 
Rickart ([12], p. 627). 

In this connection we note that the uniqueness of norm property for a 
Banach algebra B implies that every possible involution on B is continuous. 
This follows by an argument of Rickart ([11], Theorem 3. 1). 


3.8. Corottary. Let T be a homomorphism of a Banach algebra B 
mto a Banach algebra B, where there exists an auxiliary function | x| with 
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|x| Sp(x), ce B,. Then T is continuous. In particular, this holds if B, 
is a BQ*-algebra. 


The continuity of 7 follows directly from Theorem 3.7 since it is 
permissable to use the identity as the involution in B, even in the non- 
commutative case. If B, is a BQ*-algebra [1] then results of Arens shows 
that p(x) =||z||, ce B,. This corollary is an extension of [12], Cor. 6, 
which gives the continuity of ZT where B, is a semi-simple commutative 
Banach algebra. 


4. Isomorphisms and @.*-algebras. This section deals with the prop- 
erties of an algebraic isomorphism 7 defined on a Hermitian-real p,*-algebra. 
Conditions are given which establish the continuity of J and the bi-con- 
tinuity of T. 

Consider the notion of C-symmetry [6]. Since this refers to the 
behavior of self-adjoint commutative subalgebras the notion makes sense in 
our extended meaning of the term involution. 


4.1. THEOREM. 


(1) <A Banach algebra B is a p,*-algebra if and only if there exists an 
involution and k > 0 where Skp(x), ve HU K. 


(2) The class of all commutative p,*-algebras coincides with the class 
of all commutative Banach algebras for which p(x) 1s a complete norm. 


(3) If Bis a C-spectral p,*-algebra then B is C-symmetric. 


(4) If B of (3) admits complex scalars and is commutative then it ts 
a B*-algebra under an equivalent norm. 


(1). Let B be a p,*-algebra with auxiliary function |v|. By Lemma 
3.1 and Theorem 3.4 there exists M >0 such that = ceB. 
Hence, by [2], p. 41, if |2,|—0 then || || Suppose that the 
inequality in (1) is invalid for each & >0. Then there exists a sequence 
{tn}, me HU K, =1, 1 > np(an). Thus | |S <n and 
|z,|—>0. This is a contradiction. 


(2). Let B be a commutative p,*-algebra. By commutativity and (1), 
SI (@+2*)/2 | + (e—2*)/2 | 
<k[p(e + 2*) + p(a—a*)]/2 S <2 | 


If B has p(x) as a complete norm then clearly B is a p,*-algebra in 
every involution including the identity. 


| 
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(3). Let # be any closed self-adjoint commutative subalgebra. Then 
E is contained in a maximal self-adjoint commutative subalgebra A which is 
closed. C-symmetry for / follows if we show it for A. Let A, be the 
complexification of A ([6], [12]) and M the locally compact space of 
maximal regular ideals of A,. A can be considered as imbedded in A;. Now 
for xe A, a(x | Ai) A) If is self-adjoint (skew) in 
A, its spectrum there is real (pure imaginary). If e—u-+v,ueH, ve kK, 
then, under the Gelfand representation of 4,, = u(M) — = 
MeM. Since + yy*) = p(xa*) = p(x?) = p(x) p(a*), for 2, ye A, 
A is C-symmetric by [6], Theorem 4. 6, and (2). 

(4). By (8) and [6], Theorem 4. 5, the involution is conjugate-linear. 
(4) follows from the formulas of (3). 


4.2. Turorem. Let B be a Hermitian-real Banach algebra where each 
closed subalgebra E generated by a self-adjoint element is a Banach space in 
the spectral radius norm p(y|H). Let T be an algebraic isomorphism of B 
into a Banach algebra. Then p(T (x)) =p(r). ce HU K. 


By Theorem 4.1 the hypothesis on # is automatically satisfied if B is 
a p-*-algebra. 

Let Fy be the smallest subalgebra generated by the self-adjoint element u 
and let # be its closure. For we Ey the reality of «(2|B) implies that 
FE) =a(x|B) (see the arguments of [6], Theorem 4.2). However 


a(x 


for any xe L, a(x | EF) is real (as can be seen from the Gelfand representation 
of the complexification of and again Thus 
p(x | =p(x), By hypothesis there exists & > 0° such that 
|v || Skp(x) Sk on The reality of the spectra implies that if M 
is any maximal regular ideal in HL, H/M is isomorphic to the real number 
system. The Gelfand representation «—>2x(M) takes EF into C,(M), the 
algebra of all real continuous functions on 9¢ which vanish at infinity, where 
Mt is the space of maximal regular ideals of H. By the Stone-Weierstrass 
theorem, F' is isomorphic, as a Banach algebra, to C(t). Therefore F is 
semi-simple and regular and, by [13], Theorem 1, Cor. 1, p(w) =p(T(u)). 
Let ve K. Then v?e¢H and 


[o(v) = p(v*) =p(T(v)) = p([T(v)]*) = [p(T (w) ) 


4.3. TuroreM. Let B be a Hermitian-real p,*-algebra. Let T be an 
algebraic isomorphism of B into a Banach algebra where for some c> 0, 
|. T(a*)|| Sel T(x), ce B. Then is continuous on T(B). 
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By Theorem 4. 1 there exists k > 0 such that k || z || Sp(v), ce HU 
Theorem 4. 2 gives 


(a) | T(z) || zl, 


Let B, = 7(B). In T(B) we define the operation [T(2) |’ = T(a*). 
Clearly this provides a continuous involution on 7(B). This involution may 
be extended by continuity to a continuous mapping V defined on B,. It is 
easy to verify that V is an involution on B;. Let || y |], be the equivalent 
norm for B, given by Lemma 3.1. By (a), forz=u-+v, weH, ve K, 


For the case where B is a complex B*-algebra this result is due to 
Rickart ([13], Theorem 3, Cor.). 

The following example shows that the Hermitian-real hypothesis cannot 
be dropped entirely even in the commutative case. Let B be the Banach 
algebra of all complex-valued functions holomorphic for | z| <1 and con- 
tinuous on | z | <1 in the complex plane with || f || = sup | f(z)|, |z| 1. 
Let E be the set | z| = 1/2 and B, be the Banach algebra of all continuous 
complex-valued functions on #. Let the identity be the involution on B and 
B, and let T be the *-isomorphism which sends f(z) ¢ B into f(z) e Bi, where 
z is restricted to H. T is obviously continuous and all the hypotheses for the 
continuity of 7-* are satisfied except the Hermitian-real requirement for B. 
But if we take f,(z) =z" we see that || fn || = 1 while || T(f,)|| 0. 

The next result is the analogue of a well-known theorem on B*-algebras 


(see [6], Theorem 6.4). In our context we must be content with bi-continuity 
instead of isometry. 


4.4. Corotiary. Let B be a Hermitian-real p,*-algebra and B, be a 
p*-algebra. Let T be an isomorphism of B into By, T(B) =[T(B) ]*. 
Then T is bi-continuous if and only if there exists M>0O such that 
| T(*)| SM | T(2)|,ceB. If T ts a *-isomorphism, T is bi-continuous. 


The continuity of 7 is given by Theorem 3.7. If the inequality holds 
we have bi-continuity by Theorem 4. 3, if T is bi-continuous then the inequality 
follows from Theorem 3. 4. 


4.5. Coronary.’ Let T be a homomorphism of a B*-algebra B into 
a p*-algebra B, where [T(B)]* =T(B). Then T(H) and T(RK) are closed 
in By. 


5 Here the involution is an anti-isomorphism. 
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By Theorem 3. 7, 7 is continuous and 7-1(0) is closed. By [6], Theorem 
7. 2, the proposition is reduced to the case where T' is one-to-one. Theorem 4. 2 
implies that 7-1 is continuous on T(H) and T(K). If || T(x*)|| SC | T(2)|, 
ze B, then T(B) is closed by Theorem 4. 3. 


4.6. Turorem.® Let B be a complex symmetric p*-algebra (p,*-algebra) 


| with an identity. Then there exists a C*-algebra B, and a continuous (bi- 


continuous) *-isomorphism T of B into (onto) By. 


By [6], p. 402, B is C-spectral and the involution is continuous by 
Theorem 3.4. Hence we may, by Lemma 3.1, use an equivalent norm in 
which || 2* || = || x ||. We now use the results of Gelfand and Neumark as 
given in [8], [9]. As B is semi-simple (by 3.5) it follows from [9], p. 87, 
that B coincides with its reduced ring in the sense of [9], p. 49. That 
z*z = 0 implies c= 0 in B follows from [9], p. 87. By [9], Theorem 1, 
p. 49, there exists a norm || z ||, for B in which it is a normed algebra and 
such that its completion B’ in this norm is a C*-algebra where the involution 
in B’ is an extension of that of B. Thus the identity mapping T of B into 
B’ is an algebraic *-isomorphism. The conclusions now follow by appealing 
to [9], p. 33 and Corollary 4. 4. 


5. On regular Banach algebras. For the remainder of this paper we 
adopt the following notation. B(B,) is a complex commutative Banach 
algebra with radical R(R,) and whose space of maximal regular ideals is 
MM). Let Mi’ (M’) be the one-point compactification of M(M) by adjoining 
B(B,). Wet T be an algebraic (complex-homogeneous) homomorphism of B 
into B,. Each N’e 9’ determines by the following rule a unique element 
T*(N’) of where 
(1) af T*(N’)| = T(z) (N’), B, 


(see [3], p. 202; [7], p. 76). If T(B) C N’ then T*(N’) —B, otherwise 
T*(N’) is the unique member M’ of MY such that T(M’) CN’. T* is a 
continuous mapping of J’ into MY’ in the Gelfand topologies for 2’ and Mv’ 
so that T*(’) is closed in Mt’. In the special case where 7'(B) is dense in 
B,, T* is a homeomorphism of 9(JM’), onto a closed set in Nt(M’). 
Following the usage in [7] and elsewhere, for a set § in M (or M’) 
we define the kernel of %, k(%) =O M, Me §¥ and, for an ideal J in B, the 
hull of in M(M’) is h(T) = {Me M(M’ ce M’)| M(M’) DT}. If T(B) is 
dense in B, and if Mt and M are both given the Stone topology (hull-kernel 


* Here the involution is a conjugate-linear anti-isomorphism. 
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topology), then it is easy to see that 7* is again continuous on M. B is 
called regular ({14]; [7], p. 82) if the Stone and Gelfand topologies for 
agree. 

5.1. Lemma. Jf B is regular, then B, —T(B) is also regular. 

We use (S) or (G@) when it is desirable to specify which of the Stone or 
Gelfand topologies is used. By the above, the mappings 7* : 2(S) > T*(2)(S8) 
= T*(N)(G) and T**: T*(M)(G) > N(G) are continuous. Hence the inden- 
tity mapping 7: N(S) > N(G@) is continuous. Since the Gelfand topology is 
stronger than the Stone topology, J is bi-continuous and B, is regular. 

We next give some elementary properties of 7*. 


5.2. LEMMA. 
(i) &(T*(M)) 
(ii) implies R= T*(R,). 
(iii) Jf Bis regular the converse holds. 
(iv) Let 2’, be the compactified space of maximal regular ideals of 
T(B). If T(B) is regular then T*(N’,) = T*(N). 


(i). Note that e(.V/’) = 0 for all W’ e T*(MN) if and only if T(x) (N’) 
= 0 by formula (1). This implies (i) and (ii). 

(iii). Suppose that T*(2’) A WM’. Then by regularity and the closure 
of T*(M’) there exists Myo eM, Mo T*(MN) and we B such.that =1 
and 2(7T*(2)) =0. Then ce T*(R,) which is a contradiction. 

(iv). Let J be the identity mapping of T(B) into B,. Since R, N T(B) 
is the radical of T(B), then [*(2’) by (iii). Clearly both T*(2’) 
and T*(2’,) contain B. Suppose that T*(N’,) = MW’ AB, Let 
I*(N’) =N’,. Then T(M’) C N’ and T(B) CN’ (as otherwise T(B) C N’ 
and [*(N’) =T(B)). Thus T*(N’) = M’. Suppose T*(N’) = M’ +B, 
N’e RN. Then T(M’) CN’NT(B). But T(B) AT(B) and is a 


maximal regular ideal of 7(B). 


5.3. THeoreM. Let B be regular. Then the following formulas hold 
for T*(N’). 

(a) T*(’) —h(T>(R,)). 

(b) —h(T>[T(B)]). 

As previously noted, T*(9’) is closed in MV’. Thus 7T*(M’) with the 
point B removed is closed in Mt, hence in the Stone topology of M by regu- 
larity. Then, by Lemma 5. 2, =hkT* (MN) =h[T(R,) ]. 
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8 The proof for formula (b) is largely modeled after [13], Theorem 1. 
r For real Banach algebras Rickart’s theorem gives (b) if B is semi-simple 


and T is 1-1. Set J—T=[T(R)] and 3 =—h(J). Incidentally J is an 
ideal in B. Since T(R) C R, and R, is closed, T-*(R,) D J and then, by 
(a), & D T*(MN’). Suppose that there exists 3, Mo¢T*(MN’). Then 
, M,~B and, since T*(’) is compact in M’, there exists an open set G 
) containing I, whose closure is disjoint with 7*(Q’) and an open set @, in 
‘f M’ such that M,e@, and G, C G. As in the reference cited, a result of 
. Silov ([14], p. 37), gives the existence of u,veB, such that u(M’) —1, 
McG, u(M’) =0, W’¢G, v(M’) =1, v(M’) =0, Mc T*(N) 
and uv(M’) = u( WM’), MWe WM’. Then? wov(M’) = v(M’) and 
where ze R. Since T(v) (N’) =0, T(v) lies in the radical of T(B). Hence 
there exists we T(B) such that T(v) = 0. Let wp = T(vn), Un By Wn 
Then 
T(u) =T(u) o[T(v) ow] = [T(ucv)]ow=[T(v+2z)] ow 
=T(v)ow+ T(z) —T(z)w =T(z) —T(z)w = lim T(z — 2v,). 


But z—zv,eR. Hence J(u) ¢ T(R) which implies that w is in every element 
of 3. However u(J/,) = 1, % which is a contradiction. 


| An example shows that the requirement that B be regular cannot be 

dropped. Consider the example utilized at the end of Theorem 4.3. In the 
notation used there B is not regular since its space of maximal ideals is not 
a Hausdorff space in the Stone topology [4]. B and B, are semi-simple and 


h(T-*(R,)) =M. However, by the fact [4] that every maximal ideal of 
B is of the form {fe B | f(z.) = 0}, it can be seen that is the set of 
maximal ideals corresponding to the complex numbers | z | S 1/2. 

We make the following definitions. Let % be any subset of Mt’ and 
C'(%) be the algebra of all bounded complex continuous functions on § with 
the usual norm. For we B let x(M’| be the function restricted 
| to %. If the set of functions 2(M’ | §), xe B is closed in C(%) we say that 
B is functionally closed with respect to %. If this holds for § = WV we call 
B functionally closed. 


5.4. THerorem. Let J = T-'(0) and let W(W,) be the canonical homo- 
morphism of B onto B/I (B/I onto B/kh(I)). Let T, be the algebraic 
isomorphism given by T=T,W. Suppose that 

(i) Bis regular and functionally closed with respect to h(1), and either 

(ii) T(R) is closed in T(B) or 

(ii)’ I is closed and T(kh(1)) is closed in T(B). 


7 The notation vuov = u + v — uv is employed. 
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Then 


(a) W,T, is continuous,® 
(b) T(B) =T(B) +R, where R, is the radical of T(B). 


Assume (i) and (ii). By (ii), R+I—=T=[T(R)] so that, by 
Theorem 5.3, T*(2’) =h(I). Let y be the mapping r—>2(M’ | 
of B into C(h(Z)) and let 7: y—> y(N’) be the Gelfand representation of B,. 
By formula (1) the correspondence V: Vy(x) = (x) is a norm-preserving 
isomorphism of y(B) onto 77(B). By (i) y(B) is a Banach algebra. 
Then z7'(B) is closed in 7(B,). Now there is no loss of generality in taking 
B,—T(B). This choice gives 7(B) D D T(B). Inasmuch as 
a is continuous it follows that B, =a '*x7T(B) from which (b) is evident. 

The kernel of y is kh(I). Hence, ([2], p. 41), the Banach algebra 
B/kh(1) is equivalent to y(B). Thus there is a number c > 0 such that 


| T(x) (N’)| = | y(z)| Bell e+ 


for allze B. Since W,T?(T(xz)) =x+ kh(1), (a) follows. 

Now assume (i) and (ii)’. In view of Lemma 5.1, B/I is a regular 
Banach algebra. As a purely algebraic matter, ([7], p. 63), only maximal 
regular ideals in B which contain J do have images under W contained in 
maximal regular ideals of B/I. Let 9t, be the compactified space of maximal 
regular ideals of B/J. Then W*(9,’) =h(Z). By (i) and formula (1) 
B/I is functionally closed. The radical of B/I is Wkh(I), ((%], p. 63), 
and by (ii)’, T:(WkhI) is closed in T(B) = T,(B/1). 

Therefore with B/I, T, and 0 (the zero of B/I) playing the roles of 
B, T, and I respectively (i) and (ii) are fulfilled. The above immediately 
gives (b) and the continuity of W.7,"? where W, is the canonical homo- 
morphism of B/I onto (B/I)(kh(0)) = (B/I)/(Wkh(I)). The latter is 
' algebraically isomorphic to B/kh(Z) by standard -algebraic arguments where, 
moreover, the isomorphism preserves the quotient algebra norms. This 
yields (a). 


5.5. Corotuary. In the notation of Theorem 5.4, suppose that (i) 
holds and (ii)” I[=kh(I). Then T(B)=T(B) @R, and T," 3s 
continuous. 

From (ii)” we obtain (ii)’ immediately. Also W, is in this case the | 
identity. To complete the proof we need only show that T(B) MN R, =0. 


*B/kh(I) is given the usual quotient algebra norm which is possible since kh (I) 
is closed in B. 
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As a purely algebraic matter (ii)” gives semi-simplicity for B/I. Hence by 
Theorem 5.3 and Lemmas 5.1, 5.2,0—=T7,7(R,). Since T(B) = T:(B/I) 
the proof is complete. 

If B is a B*-algebra and T-1(0) is closed, then well-known properties of 
B give the hypotheses of Corollary 5.5. The conclusion in this case is given 
in [6]. So far as we know the conclusion that 7, is continuous is an open 
question if B is a non-commutative B*-algebra. 
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LINEAR FUNCTIONALS AND CONTENT.* 


By L. H. Loomis. 


1. Introduction. The Riesz representation theorem for linear func- 
tionals, in a general form, may be stated as follows: To every non-negative 
linear functional J on the algebra A of all continuous real-valued functions 
on a compact Hausdorff space S, there corresponds a unique finite Baire 
(or Radon) measure » such that J(f) = fd» for every fe A. 

if A and J are as above but S is replaced by a dense subset J, there is 
still a uniquely determined measure » (on J/) with respect to which J is 
integration, but now p» is not in general countably additive, but only finitely 
additive. Such a relationship was studied by Markoff [5], who took A to be 
the algebra of all bounded continuous functions on a normal space .\/. 
However, whenever J is defined like this on a uniformly closed algebra of 
bounded functions, the underlying space M can always be compactified and 
the existence of » can be inferred from the Riesz theorem. The situation is 
different if J is defined only on a proper subset of A. Compactification pro- 
cedures—in fact, the use of toplogical arguments of any kind—now appear 
less natural than certain direct methods, and it seems best to drop all 
assumptions pertaining to a topology on S and continuity of the functions 
of A. An early example of such a study is the well known paper of Daniell 
[2], which treats a non-negative linear functional whose domain is a vector 
lattice L of bounded functions on a set S, with no topological assumptions 
whatever. Daniell was interested in the extension of J to a functional 
behaving like a Lebesgue integral, and added a convergence postulate in order 
to get the Lebesgue convergence theorems. He did not try to find an asso- 
ciated measure on S. However, in a similar situation Stone [6] showed that 
the additional postulate that f M 1eL whenever fe L is sufficient to imply the 
existence of a unique completely additive measure on S with respect to which 
I is integration. In this context, the role of the Daniell convergence axiom 
is to prove that the finitely additive » is actually countably additive, and 
hence that the Riemann type integral I(f) = f{fdu is actually a Lebesgue 
integral. As far as concerns the existence of » and the relationship 
I(f) = ffdp, it would seem best to leave the convergence axiom out, as did 


* Received April 9, 1953. 
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Markoff in the paper cited above. This is in keeping with a recent tendency 
to study the relationship J(f) = ffd» in very general settings (see [1,4]). 
The present paper is written in this spirit, but from a point of view which 
does not seem to have been adopted before. 

We begin our considerations with a linear space LE of real-valued func- 
tions on a set S and a non-negative linear functional J on L. If the functions 
of L are bounded and include the constant functions, then it is reasonable 
to call J an integral even in this general situation. For, as is well known, 
a modified Hahn-Banach theorem enables us to extend J in many ways to a 
non-negative linear functional on the space of all bounded functions on 8. 
If m is the finitely additive measure defined on the subsets of S by 
m(A) =I(x4), where x4 is the characteristic function of A, then it is easy 
to see that J(f) is in fact the integral ffdm. 

Such measures are of no particular interest in their relationship to the 
original functional 7. There are many of them, and no one can be singled 
out as having an especially intimate connection with J. However, there may 
exist sets A C S on which all of the measures m agree. That is, there may 
exist sets A whose measures are uniquely determined by the original func- 
tional 7. The investigation of this uniquely determined measure p and its 
relation to J is the program of the present paper, the main problem being the 
determination of conditions under which » is adequate for the integration 
of the functions of L. 

An examination of the functional extension process mentioned above 
shows that the measure of A is uniquely determined by J if and only if for 
every positive « there exist functions g and heL such that gSy4Sh 
and I(h) —I(g) S«, in which case »(A) must be the common extreme 
lLu.b.[(g) =g. lb. [(h). Such a set will be said to have content. (It is 
difficult to resist the temptation to call A a contented set.) 

This consideration suggests a preliminary extension of the domain of J 
by the classical upper and lower integrals. For this extension it is not 
necessary that the functions of LZ be bounded or that le LZ. If an arbi- 
trary function f is bounded above by a function h in L, we define the 
upper integral /(f) as the greatest lower bound of Z(h) for all such h. 
Similarly, if there exists geL such that g =f, the lower integral I(f) is 
defined as l.u.b.7(g) for all such g. The difference 1(f) —I(f) will be 
called the deficiency of f, and denoted def (f). A function f with deficiency 
zero has only one possible value for its integral, 7(f) =I(f) =I(f), and if 
the domain of J is extended to the set R of all such f, then it is clear that R 
is a vector space and that the extended J is non-negative on R. We shall call 
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R the two-sided completion of L (with respect to 7), and we observe that the 
two-sided completion of R is again R; that is, R is two-sidedly complete. 
The definition of content can now be rephrased: A set A has content if and 
only if x4 eR, in which case »(A) is defined as I(x4). 

Let E be the family of linear combinations of characteristic functions 
of sets having content. A function f will be said to be Riemann summable 
if it belongs to the two-sided completion of E. That is, f is Riemann 
summable if, given «, there exist step functions g and h in E such that 
g=fsh and I(h) —I(g) <«. It follows that such a function f is 
bounded and belongs to R, and that J(f) is at once the least upper bound 
of all such numbers J(g) and the greatest lower bound of all such numbers 
I(h). Since g is of the form g = > cixa,, where the sets A; have content, 
and therefore [(g) = cil (xa,) = ce (Ai) = fgdp, we have I(f) = ffdp 
for a Riemann summable function in exactly the sense of the classical 
Riemann integral. 

Supposing that the functions of L (and hence R) are bounded, the 
optimum situation would be that in which the family F of sets having 
content is a Boolean algebra, or at least a Boolean ring, and is large enough 
so that all the functions of L (and hence R) are Riemann summable. 

Now it is clear that if F is a Boolean ring then E is a vector lattice, 
which in turn easily implies that if f and g are Riemann summable, then so 
are f U g and ff g, where f U g = max (f,g) andfNg—min (f,g). Thus 
if F is to be a Boolean ring and R is to be precisely the set of Riemann 
summable functions then R must be a vector lattice. Another obviously 
necessary condition is the property used by Stone, namely, that if fe R then 
fM1eR. These well known necessary conditions are not sufficient however, 
and we shall be largely concerned with the implications which they do have, 
and with the extra hypotheses which we must add to ensure the summability 
of the functions of R. 


2. Measurability and Riemann summability. We begin our systematic 
discussion with some definitions. If f = 0 the spectral family of f is the one 
parameter family of sets of the form F, = {p:f(p) >A}, A>0. Alter- 
natively, we could have used “=” in this definition. However, the other 
two one parameter families, having “<” and “=” in their definitions 
instead of “ >,” and consisting therefore of the sets complementary to those 
in the above two families, would not in general do for our purpose because 
these complementary sets may be “ too large” to have content. 

A non-negative function f is measurable if all of its spectral sets F 
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have content, except perhaps for a countable number of exceptional values 
of A. The allowance for this exceptional set seems to be the correct modifica- 
tion of the usual definition of measurability when the measure in question 
is not completely additive. A function having both signs is said to be 
measurable if its positive and negative parts are measurable. 


Lemma 1. Let L be a vector space of real-valued functions on a set 8 
and let I be a non-negative linear functional on L. If fn 1eL whenever 
feL then every fe L is measurable. 


Proof. Let 0<rA< AL << Ami be any finite sequence of 
positive numbers, and let g,, be the function [(f An) —(f Ans) ]/(An — Ans): 
Then 0S g, = 1, gn = 0 on the set where f = A,_, and gn, —1 on the set F, 


where f >A,. Thus g, is simultaneously a lower approximant for ¢),, (where 
¢, is the characteristic function of F)) and an upper approximant for ),. 


Thus def (¢),) S1(gn) —I(Gnii) and » def (¢,,) SJ(g:). We now fix 
1 


A, =a, and, given e, we choose the remaining A; from among the points d at 
which the functions ¢) have deficiencies greater than «. Then me =I(Q,) 
as above; that is, the number of points beyond a at which ¢) has deficiency 
=e is bounded above by J(g,)/e. Varying a and e we see that the set of 
positive A at which the deficiency of ¢) is positive is at most countable. The 
sets F, for 4 < 0 are accounted for by applying the above result to —f. 


Remark. This proof holds as well for the other decreasing spectral 
family, composed of the sets B, = {p:f(p) =A} whenA>0. 


Remark. Actually, it is only the two-sided completion R which we need 
to be closed under intersection with 1. While this property for R will most 
easily follow from the same property for L, we shall later on consider one 
situation in which it evolves differently. 


The hypothesis that fM1eL for every feL almost implies by itself 
that L is a vector lattice. In fact, if 1 eZ then the two conditions are equi- 
valent, as can be seen from the relations fM0=((f+1)N1)—1 and 
fNg=g9g+ ((f—g)N 0). If 1 is not in L, but f M 1 is always in L, then 
F is a Boolean ring and E is a vector lattice, for the spectral sets of x4 + xB 
include AM B uncountably often, so that, by Lemma 1, 4M B must have 
content if A and B have content. Moreover, if in this case L satisfies any of 
our later conditions for improper Riemann summability, then it requires 
very little more to imply that R itself will be a vector lattice. For instance, 
suppose that lim I(g Me) S0 for every geL and suppose also that every 
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feL is dominated by a non-negative meL. Then g [fU0O=h, where 
g=f—fneandh=(f—fne)+mne. Given 8, we choose ¢ so that 
I(mNe) <4, and then have —I(g) =I(mNe) <8, proving that 
fU0eR, and hence that R is a lattice. 

If f N 1 is always in L our discussion to date gives us a finitely additive 
measure p defined on quite a large Boolean ring of sets, but it does not yet 
follow that » has any useful relationship to J, as the following examples 
show (see [4]). 


Example 1. L is the vector lattice of continuous functions f on [0, 1] 
such that f(0) —0 and 7’(0) exists, and 7(f) =/f’(0). J is then a non- 
negative linear functional. On the other hand, a subset A of [0,1] has 
content if and only if it is bounded away from 0, its content being always 0. 


Vrample 2. L is the vector lattice of continuous functions on [0, 0 ) 
having derivatives at o (f’(0%) =limf(x)/x), and I(f) =f’(«). Here 


a set A has content if and only if it is bounded, its content being always zero. 


The failure of J to be reasonably related to » in these two cases has 
different causes. In the second ease it is due to the fact that the functions 
of L are in general unbounded, while in the first case the functions of L are 
all bounded but the constant functions are missing. However, if neither of 
these defects is present we shall see that every fe R is Riemann summable. 

We notice first that the presence of 1 in L (or in R)-has the very 
strong implication that J is a bounded linear functional in the uniform norm, 
with norm /(1) (J(f) ll.) =| f ]27(1)), and hence that R is 
uniformly closed. Also, F is now closed under complementation, for if A’ is 
the complement of A then x4, —=1—y,4. The assumption that f=0 can 
now be dropped in the definition of the spectral family: the set where f > —a 
is the complement of the set where —f =a, and hence has content for all 
except a countable number of values of a. 


LemMa 2. Let L be a vector space of real-valued functions on a set 8, 
and let I be a non-negative linear functional on L. If 1eL then every 
bounded measurable function f 1s Riemann summable. More generally, if f 
is a bounded measurable function such that the set where f ~0 is included 
in a set A having content, then f is Riemann summable. 


Proof. Adding a suitable scalar multiple of x4 to f, we can suppose 
that f= 0 and that F, — A for some positive A. Taking 6=1.u.b. f, let 
<A be a subdivision of [0,0] of norm Se such 
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that has content for every 1 and F,,—4A. Setting or, 
j= > Magi and h=> Agi, we have g,hek, gSfsh, and 
1 


proving the lemma. 


It is clear from the above proof that the hypothesis of measurability can 
be weakened to the assumption that the spectral sets Ff) have content for a 
dense set of X. 


1. Jf f is measurable, and 0<a<b then fnNb—fNa 
is Riemann summable. 


Proof. The function fMb—fNMa is bounded by b and the set where 
it is different from zero is the spectral set Fy. 


CoroLuaRy 2. Jf L is closed under intersection with 1 and f is measur- 
able and non-negative, then f Ng 1s Riemann summable for every Riemann 
summable g. 


Proof. The hypothesis that fM 1eL implies that F is a Boolean ring, 
as we saw in the discussion following Lemma 1. The result then follows 
from Lemma 2. 


Remark. The expression of ffd» as the limit of sums > A(u(F),) 
b 

—p(F),,)) is equivalent to ffdy= Ada(A), where is the monotone 
a 


decreasing function defined for all positive X (except the values of A in the 
countable exceptional set) by =yp(F)). 


As already noted, Lemmas 1 and 2 imply the following theorem. 


THEoREM 1. Jf L is a vector-lattice of real-valued functions which 
contains the constant functions, and if I is a non-negative linear functional 
on L, then the subsets of S having content form a Boolean algebra on which 
the content p is finitely additive, the class of functions which are Riemann 
summable with respect to p 1s precisely the set of bounded functions in R, 
and I(f) = ffdp in the Riemann sense, for every such f. 


Corottary. In the same situation, if f is a bounded function of R and 
if 0(x) is a monotone continuous function of the real variable x defined on a 
closed interval containing the range of f, then 6(f) eR. 


Proof. The spectral sets of @(f) are the same as those of f (with a 
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monotone change of parameter) and hence all except a countable number of 
them have content by Lemma 1. Lemma 2 then implies that 6(f) eR. 


Remark. It follows from the corollary that if feR is bounded 
then f?¢R. Hence if f and g are bounded functions of R then 
fg +9)*?— (f—g)*]/4eR, 


so that the bounded functions of R form a uniformly closed algebra. 


It now follows that if f:,- - -,f, are bounded functions of R, then so is 
*;fn), where is any continuous function of n variables. For, by 
the Weierstrass theorem, @(f1,---,fn) is a uniform limit of polynomials 
in fi," fn 


3. Improper integration. We turn now to the problem of the inte- 
gration of unbounded functions or of bounded functions when the constants 
are not present. As the two examples above suggest, the difficulties here are 
manifestations of two aspects of the same phenomenon: We cannot now 
obtain two-sided approximations by step functions, in the one case because 
the function in question is unbounded, and in the other case because the 
total content of S may be unbounded. Our theory therefore must of necessity 
be a theory of one-sided approximation, that is, of improper Riemann inte- 
gration. As usual, we take as our point of departure a vector space L of 
real-valued functions which is closed under intersection with 1, and a non- 
negative linear functional J on L. 

We say that a non-negative function f is improperly Riemann summable 
if it is measurable and if there is a constant & such that I(g) < k whenever 
g is a Riemann summable function such that g =f. The improper Riemann 
integral f fd» is then defined as the least upper bound of such integrals J(q¢) 
(= fgdp). If L* is the set of non-negative functions of L, then it follows 
from Lemma 1 and the non-negativity of J that any feL* is improperly 
Riemann summable, with f fd» =J(f), and one of our problems is the dis- 
covery of conditions which will imply the equality. We first set down a few 
rather obvious equivalent formulations of the definition of ffdp. 


Lemma 3. Jf f is non-negative and improperly summable then the 
function g occurring in the definition of f{fdu can be restricted to functions 
of any one of the following forms: 


(1) the step functions of E which are =f; 
(2) {1M bya, where A has content and b>0; 
(3) [1 dda, where oy 1s the characteristic function of the spectral set F). 
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Proof. (1) is obvious. (2) follows from the fact that f/M bya is 
Riemann summable by Lemma 2, and that any Riemann summable function 
is bounded and is zero except on a set A having content. To prove (3) we 
choose any step function he E such that OShSf. If a is the minimum 
positive value of h and } is the maximum value of h, then h S bq, and (3) 
follows from (1) and the Riemann summability of f M Dd. 


We obtain two other slightly stronger conditions from the following 
lemma. 


Lemma 4. Jf f is non-negative and improperly summable, then 
ap(Fa) as a—0 and as 


Proof. Given «, choose he E so thatO Sh Sf and ffdu—I(h) <€/2. 
If A is the set where h ~0 and a is a positive number such that ay, SA, 
then A C Fy. Since a(¢a—yxa) +h Sf, we have < €/2, and 
it is only necessary to take a so that au(A) < «/2 to get au(Fr2) <e. 


The proof is similar as If 6b=2maxh, then (6/2) Sf 
and hence (b/2)u(Fo) < «/2. 


Corotuary 1. In the definition of the improper Riemann integral of f, 
g may be restricted to functions of either of the following forms: 


(4) fnb—fna, where0<a<b; 
(5) where0<a< b. 


Proof. Since fNb—fNa=(fN bda) —adba and fN b(¢da— do) 
= (f M bd.) — bhp, the corollary follows from the lemma and (8) of 
Lemma 3. 


CoroLLARY 2. A non-negative measurable function f is improperly 


Riemann summable if and only if the integral f, Ada(A) is convergent in 
0 
the usual sense, in which case ffdqu——f Ada(r). 
0 
Proof. This follows from (5) above and the fact that I(f M b(¢a— $») ) 
f ada (A), where a(A) =p(Fh). 


Each of the conditions (1.)-(5) translates into a necessary and sufficient 
condition for the equation /(f) = ffd» to hold, and others can easily be 
found. We collect a few in the following lemmas, where, for simplicity, 
the two kinds of improper behavior are separated. 
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Lemma 5. If f is a bounded non-negative improperly summable fune- 
tion then the equality I(f) = ffdp will hold: 


(a) if and only if I(f Na) asa—>0; 

(b) if and only if for every ¢ there exists ge L* and a set A having 
content such that I(g) << and g=f on A’; 

(c) if the set of numbers {I(nf 1 1)} is bounded above; 

(d) if there exists he L* such that f/h as f > 0. 


Proof. (a) is the direct translation of the condition (4) above, since 
fb=f for suitable b. Then we can take g =f Ma and A = F;, to prove 
that (b) is necessary. Conversely, if (b) holds, and if 6 =1.u.b. f(p), then 
fM bx, = f —g, so that the sufficiency of (b) follows from (2) of Lemma 3. 
The assumption in (c) implies that I(f M 1/n) +0, and (a) can be applied. 
Finally, if h exists as in (d), then given « there exists 6 such that f < eh 
when f=8. We can then take g in (b) as ch and A as F%. 


Lemma 6. If 1eL and f is a non-negative improperly summable func- 
tion, then the equality I(f) = ffdp will hold: 


(a) and only if I(f Nb) as 
(b) «tf and only if for every « there exists g e L* and b such that I(g) <« 
and g =f on F,; 


(c) if there exists he L* such that f/h 30 as f 30. 


Proof. The proof is similar to that of the above lemma, and will be 
omitted. 


The last conditions in the above two lemmas are in a sense necessary as 
well as sufficient, but this cannot be seen until another completion process 
for L has been developed, and the discussion will be postponed until Section 5. 


4. Integrals on algebras of functions. 


THEOREM 2. If A is an algebra of bounded functions on a set 8S, if 
le A, and if I is a non-negative linear functional on A, then the two-sided 
completion R is a lattice as well as an algebra and all the conclusions of 
Theorem 1 follow. 


Proof. This follows, as in the proof of the Stone-Weierstrass theorem, 
from the fact that fUg=—(f+g+|f—g]|)/2, that y—|z] can be 
uniformly approximated by a polynomial in z? over any given finite interval, 
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and that because 1¢A the two-sided completion R includes the uniform 


closure of A. 


If the algebra A does not include the constant functions we have an 
improper integral situation as before. R cannot now be proved to be a lattice, 
but, if every fe A is dominated by a function ge A*, then we can show that . 
{1 is always in R. 


Lemma %. Let A be an algebra of bounded real-valued functions on a 
set S such that every function in A is a difference of non-negative functions 
of A, and let I be a non-negative linear functional on A. Then f 1 belongs 
to the two-sidede completion R for every f ¢ A. 


Proof. By hypothesis there exists me A such that | f| =m. Let [a, 6] 
include the range of f, and let 6 be the real-valued function on [a, b| defined 
by and =1/rifx>1. Given «, the classical Weier- 
strass theorem furnishes a polynomial P such that | P(x) —@(r)| <e« on 
[a,b]. Since fo(f) =fN1,fP(f) eA and |f| Sm, wehavegSfnish, 
where g = fP(f) — em, h = fP(f) + em, and I(h) —I(g) S 2eI(m). Letting 
e— U, we see that fN 1eR. 


Thus every f ¢ A is measurable, and we can apply our results on improper 
summability. In particular, we have in more general form the main theorem 
of Arens [1]. 


THEOREM 3. Let A be an algebra of bounded real-valued functions on 
a set S such that every fe A is a difference of non-negative functions in A. 
Let I be a non-negative linear functional on A and suppose that for every 
fe At, I(f) =l.u.b.[(fu) where u ranges over the functions of A such that 
0SuS1. Then I(f) = ffdp for every fe A. 


Proof. Given fe A*, we have to establish any one of the conditions of 
Lemma 5. But, given «, we can choose we A such that 0S u1 and 
I(f) —I(uf) <«/2. Let A be the set on which u > 8, 8 being chosen so 
that A has content and so that 82(f) < «/2, and set g =f(1+6—vu). Then 
g=f on A’ and =/1(f) —I(fu) + 81(f) <«, so that (b) of Lemma 5 
holds, q. e. d. 


The condition J(f) =1.u.b.I(fw) of Theorem 3 has been studied in 
great detail in a different context by Dieudonné [3]. 

We can also consider algebras of unbounded functions, but, if we are to 
avoid the seemingly impossible difficulties inherent in such a generalization 


12 


| 
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of the determined moment problem, we shall have to assume that A is a 
lattice. The condition (c) of Lemma 6 is automatically met here, from the 
simple fact that fe A implies f? © A, and, supposing for the moment that 
1¢A, it follows that I(f) = ffdp» for every fe A. If the constants are not 
present, then one of the conditions of Lemma 5 must of course be assumed. 


5. One-sided completion. We turn now to a deeper study of improper 
integrability, in which we consider the problem in the light of a second 
natural extension of J, this time by means of a one-sided completion process. 
The values of J for the new functions thus added are no longer uniquely 
determined in the sense of being forced by the old values, but they are 
uniquely determined in the sense of being explicitly and naturally defined. 

We start with a vector-lattice R of real-valued functions which is two- 
sidedly complete with respect to a non-negative linear functional 7. The 
non-negative part U* of the extended class U is defined as follows: f e U* if 
and only if f=0,fMgeR for every ge R, andl.u.b. ((fNg):geR} <a. 
I(f) is defined as the above least upper bound, and it is clear that U* is a 
lattice and is closed under addition and multiplication by non-negative con- 
stants. Then U is the vector space of differences f; — f. of functions in U* 
and I(f,—fs) is defined as I(f,;) —J(f-). Straightforward calculation 
shows I to be uniquely defined, linear and non-negative on U. 

Our first goal is to show that U is both one-sidedly and two-sidedly 
complete. 


Lemma 8. If f,geU* and fg then 


Proof. We have to show that (g —f) Nhe R for every he R. Given «, 
choose ke R such that I(g) —I(gNk) <« Nowgnk—fnkSg—f. 
Also, the function [g N (kh + k)] (f Nk) is equal to g—(fNk) 2=g—f 
ifgSh+k, and Thus 


hn (gNk—fNk) Shr (g—f) 


where the extreme members belong to R (by the definition of U*) and the 
difference of the extremes is bounded by gN (h+hk)—gNk._ Since 
I(gn (h+k)) —I(g Nk) <e by the definition of k, we have proved that 
h (g —f) lies in the two-sided completion of R, which is of course R itself. 
This is what we set out to prove. 


THEOREM 4. U is a vector lattice including R which is both two-sidedly 
and one-sidedely complete. 
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Proof. We have already observed that U is a vector space including R 
and that J is a non-negative linear functional on U. We continue with the 
fact that if fe U then ft —f U 0eU*, for if fg —A, where g and heU-, 
then ft =g Uh—heU* by the preceding lemma. Thus U* is indeed the 
fleUand 


Consider, now, any function f in the two-sided completion of U; we may 
as well suppose f to be non-negative. Given «, there exist by hypothesis 
functions g,h e U* such that g Sf Sh and J(h) —I(g) <«. If keR, it 
follows that gN kS=fNkShOk and that [(hNk)—I(gNk) <e. 
Since g 1k and hn k belong to R, it follows that fM & belongs to the two- 
sided completion of R, which is R itself, and hence that f belongs to U*. 
Thus U is two-sidedly complete. 

Finally, let f be any non-negative function in the one-sided completion 
of U. That is, if ge Ut then ge U* andl.u.bl(fNg) Sl<o. But 
then, if geR, fNg=—(fNg)Ng=—hNg, where heU, so that fNg 
=hMgeR and hence fe U. Thus U is one-sidedly complete. This com- 
pletes the proof of the theorem. 

Since we have a larger two-sidedly complete space than before, we must 
consider the possibility of there being new sets having content. Suppose, 
then, that y4eU*. Since y4NgeR whenever geR, and, in particular, 
whenever g = xz where B ¢ F, we see that A has the property that AN Be F 
whenever BeF. If there exists Be F such that A C B, it follows that 
A=AfBeF and A is not a new set. Thus the only new sets having 
content are sets A which are included in no sets of F, but are such that 
AN BeF for every BeF and l.u.b.p(49B) <o. 

If 1e R, so that SeF, it follows that there can be no new sets having 


non-negative part of U. Thus 


content. 

If 1¢R, but R is closed under intersection with 1, it may happen that 
lu.b. {1(f N1):feR} <o, so that 1eU*. This is clearly equivalent to I 
being bounded in the uniform norm, and implies that R and U are both 
uniformly closed. The converse also follows. 


Lemma 9. If R is uniformly closed and closed under intersection with 1, 
then I is bounded and 1eU. 


Proof. If I is not bounded there exists a sequence A, of disjoint sets 
having content such that }4(A,) = ©. Let c, be a sequence of constants 
such that c,)0 and S¢,u(An) = «. The function f = > cay,, is then in 
the uniform closure of R, since it is approximated uniformly by its partial 
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sums, but cannot belong to R (or U), since its integral is infinite. Therefore 
I is bounded. 


We denote the enlarged F by B, and suppose from now on that R is 
closed under intersection with 1, so that the functions of R and U are at 
least improperly summable (with respect to F and B respectively). We then 
have the theorem: 


TuHeEorEM 5. If the equality ffdu—TI(f) holds for every feR then 
it holds for every fe U, and U is precisely the set of improperly summable 
functions. 


Proof. The persistence of the identity ffdu—=TI(f) follows at once 
from the definitions of f fd» and U, and we have only to prove that if f is a 
non-negative function which is improperly summable, then feU*. By 
definition, f is measurable and has the property that l. u.b. {1(g)} =l<o 
for all Riemann summable functions g =f. Let h be any function of 
and, given «, choose an F-Riemann summable function g=h such that 
I(h) —I(g) <«. Since fNh—fNgSh—g, we have fNgSfnh 
= (h—g) +fNg. Nowh—geR, and is an F-Riemann summable 
function and so belongs to R. The above inequality thus shows that fNh 
belongs to the two-sided completion of R, and hence to R. The same 
inequality yields 7(fN h) SIT(h—g)+1(fNg) Se+l. ThusfNnheR 
and I(f Nh) S/1 for every he R, so that fe U* by definition. 


We can now prove, in terms of U, that the sufficient condition of Lemma 
5, (d), and Lemma 6, (c) is also necessary. 


THEOREM 6. Let U be a vector lattice of functions which is both 
two-sidedly and one-sidedly complete with respect to a non-negative linear 
functional I, and which is closed under intersection with 1. Then I(f) = f fdp 
for every feU if and only tf for every fe U* there exists ge U* such that 
f/g 20 as or ow. 


Proof. We have already seen that the existence of such a function g 
guarantees the identification of ffdy» with I(f). Conversely, if I(f) = ffdu 
for every feU then, by the above theorem, it is sufficient to exhibit 
a g with the desired properties which is improperly summable. Now 
I(f) =— dda(a), where a(A) =yp(F,). Let be any strictly 


increasing continuous functions such that 4/@(A) >0 as A—0 or o and 
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such that f "6(A)da(A) converges. The function g = 6(f) is measurable, 
0 
since it has the same spectral sets as f. If {G¢} is its spectral family and 


=a(Gs), then fgdu—— f =— f “8(A)da(a) 


that g is improperly summable by Corollary 2 of Lemma 4. Finally, the 
definition of 6 ensures that f/g—>0 as f—0 or o, and this finishes the 
proof of the theorem. 


6. We conclude with some remarks about the role of the assumption 
that fM1eL for all fe L when L is already assumed to be a vector lattice. 
In a sense, this assumption excludes irregularities which can occur when 
there aren’t enough points in S. For, supposing that the functions of L 
are all bounded, standard weak compactification arguments allow us to enlarge 
S and to extend the functions of LZ uniquely so as to become a vector lattice 
of continuous functions on a compact Hausdorff space. With S thus enlarged 
(to S, say) the hypothesis that fM 1e¢L can be greatly weakened. 

First of all we may as well assume that LZ separates the points of 8, 
since unseparated points can always be identified. Secondly, we must avoid 
the situation where L is a one dimensional space consisting of the scalar 
multiples of a single function; it is obvious that no notion of content is 
available here. Nor can we allow this situation to exist on any subset of 8S. 
That is, distinct points must be treated differently by different functions 
of L; more exactly, if p~q then there must exist f,heL such that 
f(p)h(q) Af(q)h(p). 

The interesting fact is that the above weak necessary condition actually 
implies that f M 1 belongs to the two-sided completion of L for every fe L. 
We can see this as follows. Given a non-negative function fe L, let L(f) be 
the sublattice of functions of L which are dominated by multiples of f. The 
quotients g/f, ge L(f), form a vector lattice L’ of bounded functions on the 
locally compact space S, obtained by excluding from S the closed set where 
f=0, and 1eL’. Moreover our separation hypothesis above implies that L’ 
separates points in Sy. We define J on L’ by J(g/f) =I1(g); J is clearly a 
non-negative linear functional. Recompactifying, we can suppose that L’ is a 
vector lattice of bounded continuous functions on a compact set So, separating 
the points of this space and containing the constant functions. The two- 
sided extension with respect to J includes the uniform closure of L’, and the 
Stone-Weierstrass theorem implies that the uniform closure of L’ is the 
algebra of all continuous functions on So. Going back, (f1)/f is a con- 
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tinuous function on S, which is identically 1 in a neighborhood of oo, and 
therefore remains continuous on the compactification So. It therefore belongs 
to the two-sided extension with respect to J. That is, fM1 belongs to the 
two-sided extension with respect to I, q.e. d. 


HARVARD UNIVERSITY. 
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ON A THEOREM OF BOCHER IN THE THEORY OF ORDINARY 
LINEAR DIFFERENTIAL EQUATIONS.* 


By AUREL WINTNER. 


Let A be an n by n matrix, defined as a continuous function A = A(t) 
for large positive ¢, and consider the linear differential equation 


(1) = A(t)e 


for the n-rowed vector x. If there belongs to every constant vector c a solu- 
tion vector z(t) of (1) satisfying r(t) ~c as too (that is, if the “ initial 
condition ” 2( 0) can be assigned arbitrarily), then A(t) or (1) will be said 
to be of type (*) or to have (*)-character. It is clear from the principle of 
superposition (and from the existence of n linearly independent solution 
vectors) that, if A(t) is of type (*), then there belongs to every ¢ exactly 
one x(t), and —lima(t) must exist for every solution a(t). 

For the same reasons, A(t) is of type (*) if and only if there exists a 
(unique) fundamental matrix X(t) satisfying 


where # is the n-rowed unit matrix. In fact, a fundamental matrix Y is 
defined as any n by n matrix the columns of which are linearly independent 
solution vectors x of (1); so that 


(3) X’ A(t)X 


and det X(t) #0. The latter inequality holds for every ¢ << if it holds 
for a single ¢; in which case it holds for oo also, provided that A(t) is 
of type (*). It is also clear that if A(t) is said to possess (**)-character 
when the limit X(0o) exists for some (hence for every) fundamental 
matrix X(t), then the (**)-character of A(¢) and the additional condition 
det XY (co) 0 (for some, hence for every, fundamental matrix) together are 
necessary and sufficient for the (*)-character of A(t). 


* Received June 20, 1953. 
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If f(t), where const. = t <, is a continuous (scalar, vector or matrix) 


function for which the improper integral 


7 
f fat is convergent, i. e., lim f f(t)dt exists 
e T > 


(as a finite limit), then f(t) will be called of class R. Thus f(t) e FP is 
necessary but not sufficient for f(t) e LZ, if L denotes the case p= 1 of the 
class L?, that is, of the class consisting of those (continuous) matrix functions 
f(t) all elements of which have absolutely integrable p-th powers on the half- 
line const. St 

Suppose that 


(4) A(t)eR 
holds for the coefficient matrix of (1) (i.e., that the matrix function 
(5) A°(t) = A(s)ds, where = lim 
e e T> © e 
t t 


exists, hence 
(6) A®(t) =0(1) 


as t->»o«). Then, in order that (1) has (*)-character, any of the following 
three conditions is sufficient: 


(I) A(t)eL 

(a condition under which the additional requirement (4)-(5) is superfluous) ; 
(11) A(t) e L® and A°(t) e L4 

for some p > 1 and the corresponding g = p/(p—1) >1; finally, 
(IIT) A(t) =O(1) and A°(t) e L, 


where the O-symbol refers to t 

The (*)-criterion (1) goes back to Bécher, appearing in the literature 
for over fifty years (for references, cf. [1], p. 486, footnotes 57 and 58). 
Since (I) will be used below, a simple proof of its sufficiency for the (*)- 
character of A(t) will first be given. 

It is easy to prove (see [3]) that there belongs to every solution vector 
a(t) 40 of (1) a positive number é satisfying | x(t)| > é, as t +00, when- 
ever A(t) satisfies the pair of conditions A(t) e R, w(t) e R, where A denotes 
the least, and p» the greatest, eigenvalue of the Hermitian matrix 4(4-+ A®*). 
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On the other hand, it is clear that A(t) eZ is sufficient for A(t) e L (D RB) 
and p(t)eL(D RFR). Hence A(t) e L implies that, if x(t) is any solution 
vector, then 7(t) = O(1), hence A(t)a(t) eZ. In view of (1), this means 
that 2’(t) L. In particular a’(t) e R, i.e., the limit exists. Finally, 
a(coo) ~0 unless x(t) =0, since € is 0 only in the latter case. This 
proves (I). 

(II) and (III) were proved in [2]. 

It is clear that none of the three assumptions (I), (II), (III) on A(¢) 
is contained in the other two, and that the two conditions of (II) belonging 
to distinct values of p are all distinct; so that all these sufficient criteria for 
the (*)-character of (1) are independent. It will however be shown that, 
by an adaptation of the formal device by means of which the function (5) 
was introduced in [2], it is possible to deduce from the first of the criteria 
(1)-(1II) one criterion which contains all three (and more). The criterion 
in question is condition (i) in the following assertion : 

In order that (1) be of type (*), it is sufficient that 


(i) A°(t)A(t) eL 


(where (4), i. e., the existence of the function (5), is part of the assumption). 
Since (I) is sufficient for (4)-(6), it is sufficient for (i). Similarly, 
(i) follows from (III) also. Finally, (i) follows from (II) by Hélder’s 
inequality. Thus the (*)-criterion (i) contains all of the preceding ones. 
Since the factors of the matrix product occurring in (i) are not com- 
mutable (in general), it is by no means clear that (i) can be replaced by 


(ii) A(t) A°(t) eL 


as a sufficient condition for the (*)-character of (1). That (ii) proves to be 
sufficient, will be due to the fact that if X, — X,(¢) is a fundamental matrix 
belonging to a coefficient matrix A,(t), then the inverse X,* of X, is a 
fundamental matrix Y,—X.(t) belonging to the coefficient matrix A(t), 
where A, = — X,714,X,. This fact can be verified from the rule 


(7) (F-1)’ = — 
which is valid for every differentiable matrix function F = F(t) of-non- 
vanishing determinant. 

The proof of the sufficiency of (i) proceeds as follows: Since (5) implies 
that 1 = (—A°)’, the equation (3) can be written in the form 
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by (3). Since (8) can be written in the form Y’ = A°AX, where 


(9) Y=(H+A°)X, 
it follows that Y’ = CY, where 
(10) C= 


Accordingly, (3) is equivalent to 
(11) Y’ = C(t)Y 


by virtue of the definitions (9)-(10), provided that ¢ is large enough. The 
latter proviso is needed in order that (6) assures the existence of the 
reciprocal which is the third factor in (10). 

Suppose that A (7) satisfies condition (i). Then (10) and (6) imply that 
O(t)eL. Hence, if the (*)-criterion (1) for (3) is applied to (11), it 
follows that C(t) is of type (*). Corresponding to the formulation (2) of 
the (*)-property of (3), this means that (11) has a solution Y = Y(t) 
satisfying Y(t) > FE as too. If this Y and (6) are substituted into (9), 
then (9) defines, for large ¢, an X = X(t) satisfying (2), which means that 
(3) has (*)-character. This proves the sufficiency of (i). 

In order to prove the sufficiency of (ii), put Z2—X-'. Then it is 
readily seen from (7) that (8) can be transformed into 


(8 bis) Z’ = (ZA°)’+ ZAA?®. 
This latter equation can be written in the form W’ = ZAA°, where 
(12) W=Z(E—A’*). 


But (12) and (6) show that, if ¢ is large enough, W’ = ZAA°® is equivalent to 


(13) W’ = W(E—A°)"*AA®. 

Finally, it is seen from (7) that (13) can be written in the form (11) by 
placing Y = and 

(14) C 


Suppose that A(t) satisfies condition (ii). Then, by repeating the 
argument applied above to the case (10) of (11), it follows that the case 
(14) of (11) has a solution Y satisfying Y(t) as to. 
Starting with this Y(¢) and applying the three definitions W = Y~, (12), 
X =Z- successively (the second of them in conjunction with (6)), one 
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obtains an X(t) satisfying (2). This proves that (ii) is sufficient for the 
(*)-character of A(t). 

It is clear from the preceding proofs that the arguments leading to the 
sufficiency of (i) or (ii) can be repeated so as to assume either of the 


conditions 
(i bis) B°(t)B(t) e L, (ii bis) B(t)B°(t) e L 
(along with the existence of B°(¢), i.e., with B(t)) for either of the matrices 
B=A°A, B=AA®*. In addition, these processes can be combined and 
repeated indefinitely. Thus there results an infinite scale of criteria each 
of which is sufficient for the (*)-character of A(¢). 

All of these criteria assume (4), since otherwise A°(¢) does not exist. 
It is therefore natural to ask whether the existence of A°(¢) is a necessary 
condition for the (*)-character of A(t). The answer to this question is in 
the negative: A(t) can be of type (*) when A(t) ¢R. This will be proved 
by an example of an n-rowed A(t), with n= 2. (Such an example cannot 
exist in the scalar case, n = 1, of (3), since 


t 
(15) det X(t) = const. exp f trace A(s)ds, where const. ~ 0, 


to 
holds for every fundamental matrix X(t) for every n = 1.) 

In order to prove the last italicized statement, let A(¢) be unspecified, 
suppose that (3) has a solution X(t) satisfying (2) and put Y(t) = F + V(t). 
Then 
(16) V(t) =o(1), 
as £—>0oo, and (3) shows that A= X’X" = + V)?=—V’S (— if 

n=0 
t is large enough; cf. (16). Thus 
(17) A(t) = V’(t) —V’(¢) V(t) + U(t), where U = & (—1)"V'V". 
Since the formulation (16) of the (*)-character of (3) implies that V’(t) e R, 
it follows from (17) that A(t) if 
(18) V’(t)V(t) 


but U(t) eR. On the other hand, it is clear from the definition of U in 
(17) that U(t) e L (D R) if, as to, 


(19) V’(t) =O(1) and V(t) = O(t*). 
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Since the second of the conditions (19) implies (16), it follows that, in order 
to prove the existence of an A(t) ¢ R possessing (*)-character, it is sufficient 
to assure the existence of some (continuously differentiable) square matrix 
V = V(t) satisfying the conditions of (16), (18), (19). In fact, any such 
V(t) defines, by (17), an A(t) possessing the desired properties. 

In view of the remark made before (15), such a V(t) cannot exist in 
the scalar case (m1). On the other hand, the binary matrix 
(20) V(t) where e(t) = exp(— 
is such a V(t) (for 1S t<o). In fact, it is readily verified from (20) 
that if V’V = || fag || then = 4t/t, which implies that (18) holds, 
while (19) is clear from (20). 

The interest of this example lies in the circumstance that not even the 
first step in the process of the successive approximations to (2)-(3) can be 
made if (4) is violated. In fact, this process consists of the assignments 


(21) Xeu’(t) = A(t) X(t), Xen = EB, where X,(t) =E. 


But (5) shows that (21) leads for k —0 to X¥,(¢) = H—A°(t) or to no 
X,(t) according as (4) is or is not satisfied. 

Actually, not only the implicit assumption (4) of the criteria (i), (ii) 
but also the latter themselves can be expressed in terms of the case k — 0 
of (21). In order to see this, suppose (4) and put 


(22) Y(t) =H—A*(t), Z(t) = 


so that Y = X,, Z =X," (for large ¢; cf. (6)). These matrices are solu- 
tions of the respecitve differential equations 


(23) Y’= B(t)Y, Z’ =C(t)Z, 
where B = Y’Y-! = (H — A°)’(E — or, by (5), 
(24) C=— (H—A°)1A 


(the second of the relations (24) is a consequence of the first in view of 
Z = Y~- and of the rule mentioned before (7)). On the other hand, (6) 
and (22) show that (2) is satisfied by Y = Y and XY = Z. In view of (23)- 
(24), this means that B(t) and C(t) are of type (*) under the assumption 
(4) alone. 
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Under the respective additional assumptions (ii), (1), it is easily con- 
cluded from (6) and (24) that 


(25) (A(t) —B(t))eL, (A(t) + C(t) eL. 


Accordingly, the rdle of the assumptions of the (*)-criteria (ii), (i) for 
A(t) is that of assuring the Z-character of the “small perturbations ” of the 
coefficient matrix A(t) or of the corresponding coefficient matrix — A(t) 
which results if the solution matrix XY is replaced by Y =X, or Z=X,"; 
cf. (23). Correspondingly, since B(t) and C(t) in (24) are of type (*), 
the sufficiency of (ii) or (i) for the (*)-character of A(t) could also be 
proved by applying to (23) the method of the variation of constants and 
using the fact that, according to the (*)-criterion (1), the (*)-character of 
A(t) — B(t), A(t) + C(Z) is assured by (25). 

(It does not matter that, by this approach to (ii), (i), the coefficient 
matrix A(t) of (3) becomes replaced by — A(t) in the second case of (25). 
In fact, it is clear from (4)-(5) and from the definitions of the function 
classes R, LZ that either of the conditions (i), (ii) is satisfied by A(t) if and 
only if it is satisfied by — A(t). On the other hand, since the knowledge of 
a solution X(t) of (3) contributes no information whatever to a solution 
W(t) of W’ =— A(t) W, there is nothing to indicate that — A(t) must be 
of type (*) when A(t?) is of type (*) for some reason.) 

It will finally be shown that the existence of X,(¢) is not only unneces- 
sary but insufficient as well: A(t) need not have: (*)-character if A(t) € R. 
In order to prove this converse of the negation proved above with the aid of 
the matrix V(t), an A(¢) is needed which, as before, is binary (n = 2), at 
least. In fact, it is clear from (15) that condition (4) is not only necessary 
but sufficient as well for the (*)-character of A(t) in the scalar case (n = 1). 

Let A(t) be the binary matrix in which the first and second rows are 
a(t),0 and b( t),— a(t), respectively, where 


(26) a(t)eR, db(t)eR. 


Then (4) is satisfied and (1) can be written in the form of the system 
u’ =a(t)u, v’ =b(t)w—a(t)v, where u, v denote the components of the 
vector x. Since, as readily verified, a solution of this system is given by 


t 
(27) u(t) —exp—f a(s)ds, = cf b(s)u?(s) ds} /u(t), 
t 
the last italicized assertion will be proved if it is shown that the functions 
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(27) need not tend to limits, as {->0, if only (26) is assumed. But it is 
clear from (27) that u(t) 1 and that v(¢) will therefore fail to tend to a 
limit if b(t)u*(t) ¢ R, that is, if 


(28) b(t) exp —2a(s)ds¢ 
t 


Consequently, it is sufficient to ascertain that (26) is compatible with (28). 
To this end, let b(t) be any (continuous) function satisfying b(t) eR 
and b(t) ¢Z (for instance, let b(t) = (sint)/t). Then it is possible to 
choose a (continuously differentiable) function f(t) which tends, as to, 
to 1 and is such that b(¢)f(t) ¢R. Clearly, (26) and (28) become satisfied 
if 2a(t) is chosen to be the logarithmic derivative of such an f(t) = f,(t). 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


[1] E. Hilb, Article ITB5 (1914) in the Encyclopddie der mathematischen Wissen- 
schaften, vol. II,. 
[2] A. Wintner, “On linear asymptotic equilibria,” American Journal of Mathematics, 
vol. 71 (1949), pp. 853-858. 
, “On free vibrations with amplitudinal limits,” Quarterly of Applied Mathe- 
matics, vol. 8 (1950), pp. 102-104. 


[3] 


1 
I 
1 
] 
] 
t 
f 
‘ 
Si 
t 
t: 
0 
ti 
al 
le 
al 
it 
(| 
in 
al 


COMMUTATORS OF OPERATORS, II.* 


By R. Hatmos. 


1. Introduction. Suppose that A and B are linear transformations on 
a complex Hilbert space H and let C be their commutator, C = AB — BA. 
Problems concerning C can be studied on three levels of generality. On the 
lowest level, it is assumed that H is finite-dimensional; on the intermediate 
level, infinite-dimensional spaces are allowed, but only bounded linear trans- 
formations (here called operators) are considered; and, on the highest level, 
both the space and the transformations remain completely unrestricted. Since, 
roughly speaking, on the highest level anything can happen, whereas on the 
lowest level nothing can happen, the mathematically interesting and difficult 
problems are all likely to be on the intermediate level. Thus, for example, 
the identity operator is a commutator of unbounded transformations, but not 
of operators, and the identity operator is the sum of two commutators of 
operators, whereas the identity matrix (of finite size) is not the sum of any 
finite number of commutators of matrices. 

The purpose of this note is to obtain some positive and some negative 
results concerning commutators on the middle level, and to call attention to 
some unsolved problems. The results are of three types. Section 2 studies 
the role which 0 plays in the spectrum of a commutator. Section 3 concen- 
trates on the problem of approximation by commutators, and, in particular, 
on how near a commutator can be to the identity. In Section 4 a factoriza- 
tion theorem, valid for every operator, is proved and applied to the problem 
of studying the spectrum of a commutator. 

The three levels of operatorial generality have their analogs in pure 
algebra ; the lowest level corresponds to finite-dimensional algebras, the middle 
level to normed algebras, and the highest level to completely unrestricted 
algebras. At times the relation becomes more than an analogy; in particular, 
it will be seen that several of the proofs below can be transferred from 
(bounded) operators to (normed) algebras without any change. Nevertheless, 
in order to avoid unnecessary complications, the more special terminology 
and notation of operator theory will be preferred throughout. 

It is occasionally convenient to use the differential notation for commu- 
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tators. Thus, to study AB— BA, the operator B will be fixed once and for 
all, and the commutator, regarded as a function of A, will be denoted by A’. 
The mapping A — A’ is a derivation; i.e., it is a linear transformation from 
operators to operators that has many of the properties of ordinary differen- 
tiation. Thus 


(A,A2)’ A,A.B BA,A.z 
(48 — + 4’ A,, 


and therefore the non-commutative versions of all the usual Leibniz formulas 
are valid. The product formula applied to A* (k=1,2,---) yields 
(A*)’ = AA’ + (A*1)’A; by an obvious induction argument it follows 
that 


k-1 
= 
4=0 


2. Divisors of zero. In the algebraic study of commutators an impor- 
tant role is played by Jacobson’s lemma ([4], Lemma 2). That lemma 
asserts that, in an algebraic algebra over a field of characteristic zero, if A’ 
commutes with A, then A’ is nilpotent. Kaplansky conjectured that an 
analog of this result is valid for normed algebras, i. e., that if A’ commutes 
with A, then A’ is quasi-nilpotent. (Recall that, by definition, A’ is quasi- 
nilpotent if ||(A’)” ||'/"—> 0 as n 00, or, equivalently, if the spectrum of A’ 
consists of 0 alone. Still another formulation says that r(A’) = 0, where r, 
here and below, denotes the spectral radius. For a discussion of such 
concepts and their role in normed algebras see [3], Chapter V.) Kaplansky’s 
conjecture is not settled yet; the following results, however, are corroborative 
evidence. 


THEOREM 1. If A’ commutes with A, then A’ is a (two-sided) generalized 
divisor of zero. 


Proof. The equation for the “derivative” of A” implies that under 
the present commutativity hypothesis (A")’ =nA*"A’. It follows that if 
A is nilpotent (in the usual algebraic sense), then A’ is a divisor of zero 
(also in the algebraic sense). If A is not nilpotent, then the relation 


n || || = || A"B— BA" || S2] 4 


implies that || A"*A’ ||/|| || 0 as i-e., that A’ annihilates the 
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sequence {A”-!/|| A”- ||} of operators of norm 1. Since this is the definition 
of a generalized divisor of zero, the proof is complete.’ 

The above proof is valid in any normed algebra. The method was first 
used by Wielandt in [6] to prove that A’ cannot be equal to 1; for an earlier 
and different proof, see Wintner [7]. That result, of course, is an immediate 
consequence of Theorem 1. Equally immediate is the following corollary, 
which is being put on record here for convenience of reference. 


Corottary. If A’ commutes with A, then A’ is not invertible. 


The conclusion of Theorem 1 is expressed in normed-algebraic terms ; 
it is sometimes convenient to be able to use operatorial language. This 
desideratum is achieved by the following slight weakening of Theorem 1. 


THEOREM 2. If A’ commutes with A, then 0 belongs to the approximate 


point spectrum of A’. 
(For related results. see [5], especially Theorem II, p. 359.) 


Proof. It is sufficient to prove that an operator C is a generalized left 
divisor of zero if and only if 0 belongs to its approximate point spectrum. 
Suppose therefore that CD, => 0, with || D, || = 1, and let x, be a vector such 
that || Data || 24] an Tf yn = Datn/|| Darn ||, then || yn || =1, and 


| Cyn || = | CDratn Duan || S CDn || = 2 |] CDn || > 0. 


If, conversely, Cz, —>0 as with || 2, || =1, write = (2, 
It follows that D, is the projection on the one-dimensional space spanned 
by 2, so that || D, || =1. The relation 


| CDyx || = || C (a, en) an || S || Can | | 


valid for all n and x, implies that || CD, || S || Cz, || > 0 as no. 


3. Density. The results of the preceding section are essentially nega- 
tive; since they say that commutators must satisfy certain conditions, their 
chief use is in proving that certain operators are not commutators. In this 
section and the next the emphasis will be positive; it will be shown that on 
an infinite-dimensional Hilbert space there are many commutators. 


Lemma 1. Jf A is an operator on a Hilbert space H, and if M is a 


1T should like to express my appreciation to Giinter Lumer and Juan J. Schiiffer 
for many interesting conversations on the subject of commutators. Theorem 1, in 
particular, was obtained in collaboration with Schiffer. 
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subspace of H such that dim MS dim M1, then there exists a commutator 
C on H such that Cr = Ax whenever xe M. 


Proof. Let Q be a partial isometry with initial space M and final space 
N C M4. In other words, Q sends M isometrically onto N and annihilates 
M+, whereas Q* sends N isometrically onto WM and annihilates N+ (and 
therefore annihilates M). If P= AQ*, then the desired commutator is 
C= PQ— QP; indeed, if ze M, then 


Cz = PQ« — = AQ*Q2r — QAQ*a = Ar. 


THEOREM 3. If H is infinite-dimensional, then the set of commutators 
is dense, with respect to the strong topology, in the set of all operators on H. 


Proof. For any operator A and vectors 7;,° + -+,2,, let M be the space 
spanned by the z’s, and apply Lemma 1 to obtain a commutator C such that 
Cr; = Azj, j= 1,:--,n. The existence of such a implies that every 
strong neighborhood of A contains a commutator. 


The preceding result makes it natural to ask whether or not the set of 
commutators is dense with respect to the uniform (norm) topology as well. 
The answer to that question is not known. In view of this ignorance, and in 
view of the quantum-mechanical history of the study of commutators, it 
becomes pertinent to ask: how well can the identity operator be approximated 
by commutators? In finite-dimensional spaces the answer is easy. 


TueEoreM 4. If H is finite-dimensional, then r(1— A’) = 1 for all A. 

Proof. For any operator A on H, let t(A) denote the normalized trace 
of A, i.e., the average of the proper values of A. Since ¢(A’) =0 for 
all A, it follows that not all the proper values of A’ can lie in the circle 
{A:|1—A|< 1}. 

CorotLary. If H is finite-dimensional, then || 1— A’ || 21 for all A. 


Proof. This follows immediately from Theorem 4, together with the 
fact that the normalized trace is dominated by the norm. 


Note that, that since 0 is a commutator, the result of the corollary is 
best possible. 

The main purpose of the next result is purely negative; it shows that 
the hypothesis of finite-dimensionality is indispensable in the corollary to 
Theorem 4. 


~ 


THEOREM 5. If H is infinite-dimensional, then there exists a commu- 
tator C on H such that || 1 
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Proof. Let U and V be isometries on H with respective final spaces M 
and N, where MN. (The existence of U and V is ensured by the fact 
that H is infinite-dimensional.) In other words, U*U — UU* =1—UU* 
is the projection on M+, and V*V — VV* =~ 1— VYV* is the projection on 
NL = M; it follows that U*U —UU* + V*V —VV* =1. If P and Q are 


defined by 
= (U + U*)+ (V+ V*), 239 = (U — U*) — — V*), 


then a straightforward computation shows that C= PQ—QP=—1-+ 1k, 
where FR is the Hermitian operator (U*V* — V*U*) —(UV—VU). It 
follows that || R || = 4. (In my earlier paper [1] I showed that every 
operator, and in particular the identity, can be the Hermitian part of a 
commutator. Up to this point the present proof succeeded only in re-estab- 
lishing that result, in a somewhat simpler and more explicit form than before, 
and in furnishing an estimate for the size of || R ||.) 

Every element of the spectrum of C is of the form 1-+ ia, where is 
a real number such that |«|<||R|| <4. If 8 is a positive number, and 
if A belongs to the spectrum of 8C, then A = 8 + 2a, so that 


| |? (1 — 8)? + 


It follows that if 6.1, then |1—A.|?=.97. Three comments complete 
the proof. First: since the Hermitian part of C is 1, and therefore commu- 
tative with its skew-Hermitian part, the operator C is normal (and so also 
is 8C). Second: since, for a normal operator, the spectral radius and the 
norm coincide, || 1 — 8C ||? = .97. Third: since C is a commutator, so is 8C. 

In general the infimum of || 1—C ||, as C varies over all commutators, 
is not known. There are, however, two special situations in which the 
inequality of the. corollary to Theorem 4 remains valid. 


THEOREM 6. If A’ commutes with A, then || 1—A’ | 21. 


Proof. If || 1—A’ || < 1, then A’ is invertible (see [3], Chapter V) ; 
the desired result follows from the corollary to Theorem 1. 


THEOREM 7. If the operator A is such that 


n-1 
lim sup || |] - | At Av | S 1, 


i=0 


then || 1—A’ |] = 1. 
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Remark. Since 
i=0 i=0 
the hypothesis trivially implies that the limit exists and is equal to 1. 


Proof. The Wielandt trick can be applied again. Since 


n-1 n-1 
(A*)’ An-1-t4’At nA™! > | A’) A‘, 


it follows that 
n-1 
n || | = || — BA" | + | A"™*4(1— A’)At | 
i=0 


n-1 
Division by || A”? || yields 


It follows that the hypothesis of the theorem is not consistent with the 
relation || 1— A’ || < 1, and the proof is complete. 

The hypothesis of Theorem 7 looks weird; its principal application is to 
subnormal (and a fortiori to normal) operators. An operator is subnormal 
if it has a normal extension (to a larger Hilbert space). The point is that 
if A is subnormal, then || A* || = || A ||* for every positive integer k. (For 
normal operators this is well known; for subnormal operators, it follows 
from the results of [2].) In view of these comments, and of the fact that 
an isometry is easily seen to be subnormal, the following assertion is an 


n 


immediate consequence of Theorem 7. 


Corotuary. If A is subnormal (and, in particular, if A is an isometry, 
or if A is normal), then || 1— A’ || = 1. 


The considerations of this section have been guided mainly by the 
question: can a commutator be near to the identity in the sense of norm? 
A related question is: can a commutator C be near to the identity in the 
purely algebraic sense that 1—C is nilpotent? In the finite-dimensional 
case the answer is obviously no; the trace of a nilpotent operator is 0, 
whereas the (normalized) trace of 1—C is 1. In the general case this 
problem takes its place with the many other problems concerning commu- 
tators of operators that still remain unsolved. 
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4, Factorization. The main result (Theorem 8) of this final section 
is somewhat paradoxical; it asserts that (if H is infinite-dimensional) every 
operator behaves in some respects like a commutator. 

If K is the direct sum of two copies of H, then every operator on K can 
be considered as a two-rowed matrix whose entries are operators on H. The 
following auxiliary result considers such matrices and proves that certain 
special ones among them are commutators. Since, in the infinite-dimensional 
case, H and K are isomorphic, the result could also be formulated in terms 
of H alone, but only at a considerable cost in clarity. 


LemMa 2. If H is infinite-dimensional, then to every operator A on H 
there correspond two matrices 


where P, Q, R, and S are operators on H. 


Proof. If K is the direct sum of a countably infinite number of copies 
of H, then every operator on K can be written as an infinite matrix whose 
entries are operators on H. Just as in the case of numerical (infinite) 
matrices, not every such matrix corresponds to an operator. If, however, 
X is the matrix whose entries in the diagonal just above the main diagonal 
are all equal to A, and if Y is the matrix whose entries in the diagonal just 
below the main diagonal are all equal to 1, all other entries in both cases 
being 0, then XY and Y do correspond to operators on K. It is easy to verify 
that the commutator YY — YX has A in the (1,1) position and 0 every- 
where else. Since the first “axis” of K is isomorphic to the direct sum of 
all the others, i. e., since K may also be considered as the direct sum of two 
copies of H, the result just obtained can also be expressed in terms of two- 
rowed matrices. The effect of expressing it in such terms is exactly the 
desired conclusion. 


THEOREM 8. Lvery operator on an infinite-dimensional space can be 
written as the product of two operators whose product in the reverse order 
is a commutator. 


Proof. An application of Lemma 2 to the prescribed operator A shows 
the existence of operators P, Q, R, and S such that 


It follows that A = PQ and QP = RS — SR. 
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Corottary. Every operator on an infinite-dimensional space is the sum 


of two commutators. 
Proof. If A=PQ with QP=RS—SR, then A—=PQ—QP+ RS—SR. 


This corollary is an improvement of one of the results of [1]; the 
methods of [1] yielded the same statement with four in place of two. 

What can the spectrum of a commutator look like? Can it be an 
arbitrary compact subset of the complex plane? Here again the answer is 
not known. Easy finite-dimensional examples show that 0 need not belong 
to the spectrum of a commutator. If, however, 0 does belong to the spectrum, 
then the rest of the structure of the spectrum can be prescribed arbitrarily. 
The precise formulation of this somewhat vague statement is the following 


application of Lemma 2. 


THEOREM 9. Jf A is a compact subset of the complex plane, with Oe A, 
then there exists a commutator C whose spectrum ts A. 


Proof. Let A be an operator whose spectrum is A; the existence of an 
operator, and even of a normal operator, with prescribed spectrum is a 


standard exercise in elementary Hilbert space theory. If C -(4 oP then, 


since 0e A, the spectrum of C is also A; the desired result follows from 
Lemma 2. 
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LINEAR DIFFERENTIAL EQUATIONS WITH COMPLETELY 
MONOTONE SOLUTIONS.* 


By Puitie HARTMAN and AUREL WINTNER. 


1. Let the coefficient f(a) in a linear differential equation of the form 
(1) y” —f(x)y=0 


be completely monotone on the half-line (0,0), that is, let all derivatives 
f(x) of f(x) exist and satisfy 


(2) (— = 0 (k= 0,1,- 


on (0,0). It was shown in [5] that (1) must then possess a solution 
y(x) +£0 which is completely monotone on (0,0). In view of the Hausdorff- 
Bernstein theorem, this means that, if the conditions (2) (which in general 
can readily be checked for the coefficient function of (1)) are satisfied, then 
(1) must possess a solution which is representable on (0,00) in the form 


(3) y(z) = f e“tdu(t), where 0 = dp(t) 0. 


0 


Thus the existence of a Laplace solution (in fact, of one having the 
particular structure specified by (3)) follows without any use of the classical 
technique of contour integrations, which can be applied (and is then laborious 
indeed) only if f is given explicitly. Correspondingly, the theorem, being a 
mere existence statement, can supply only the existence, but no indication 
leading to an explicit determination, of the function » occurring in (3). 

(The theorem does have some “explicit” use, but one in the reverse 
direction, namely, as follows: If f satisfies (2) on (0,00) and if it is known 
that (1) possesses a solution y(x) which remains bounded as t-—>o and is 
positive at some 7 = 2x > 0, then (—1)* times the k-th derivative of y(x) 
must be non-negative for every k= 0 and for every x >0. For a simple 
proof, cf. [6], where an application is given to Whittaker’s function Winn.) 


* Received August 18, 1953. 
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2. The existence theorem, mentioned at the beginning of Section 1, was 
extended in [1] from (1) to 
(4) h(x)y” + —f(2)y=0 


under the following assumptions: On (0,0). 


(5*) h>0 
and 
(5) h’,g,f are completely monotone. 


In other words, (5*) and (5), where h’ = dh/dz, assure for the homogeneous, 
linear differential equation (4) the existence of a solution of the form (3). 
Needless to say, if (5,), (5,), (5;) denote the respective assumptions in (5), 
then (5;) is identical with (2) and (5*), (5,), (5y) are satisfied in the 
particular case (1) of (4). 

If a >0 in Kummer’s form of the confluent hypergeometric equation, 
(6) ay” + (y—2)y’ — ay = 0, 
or in the closely related differential equation of the parabolic cylinder, 


(7) y”’ — xy’ —ay = 0, 


then the well-known integrals 


(6 bis) = e-vtta-1(] + (0 

or 

(7 bis) fee exp(— dt (—n<2 <0), 
0 


which are of the form (3), are solutions (the existence of solutions (3) in 
the limiting cases a0 is trivial, since y(z)=1, which belongs to 
p(t) =sgnt in (3), is then a solution). But the existence of these classical 
solutions is not accounted for by the criterion (5*)-(5). In fact, the cireum- 
stance that the coefficients, g(x) = y—z and g(x) = —g@, of the cases (6), 
(7) of (4) become negative (for large x) prevent the satisfaction of (5*)-(5), 


ifa=0. 


3. While these cases of (4) violate (5,), they satisfy (5*) and are such 


that 
(8) h’, — 9’, f are completely monotone. 
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Hence, what is at stake is whether (5,) can be generalized to (8,) in the 
sufficient criterion (5*)-(5) for the existence of a solution (3) of (4) (that 
(8,) is a generalization of (5,) follows from the fact that the negative of 
the first derivative of a completely monotone function is completely monotone, 
while the converse is not true). 

It will be proved that the answer to this question is affirmative. In 
addition, condition (8,) can be replaced by the assumption that —h” is 
completely monotone. Actually, since (5*) is retained and since 


(9) h>0O and h”=0 imply that h’= 0, if 0< r<o, 


this is no further generalization of (5). In other words, (8) is equivalent 


to the assumptions that 
(10) —h”,— f are completely monotone, (09<4<o), 


if (5*) is satisfied. Accordingly, the theorem to be proved can be formulated 
as follows: 

(*) If the coefficients of (4) satisfy the assumptions (10) and (5*) 
on (0,00), then (4) must possess a solution y(x) which is representable on 
(0,00) in the form (3). 

Needless to say, if y(x) is such a solution, then so is any positive constant 
multiple of y(z). Except for this trivial indeterminacy, the solution (3) 
is necessarily unique if the coefficients of (4) violate certain conditions. The 
latter follow in an explicit form from a more general result of [2], p. 382 
(a result which assumes much less than (10) and (5*)). 


4. The theorem announced in Section 3 is contained in the following 
theorem (**). If n= 2, then (**) reduces to (*). For an arbitrary n, 
that particular case of (**) which corresponds to the particular case (5) of 
(8) is known; cf. (ibis) in [3], pp. 731-732. 


(**) Let the half-line (0,0) be the range of x in the differential 


equation 

(11) fo(2)Dry + 3 0, 

where D=d/dz and n= 2, and suppose that 

(12) — D*f,, — Df, and f2,- +, fn, are completely monotone 
and 


(13) fo>0. 
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Then (11) possesses a solution y(a) which is representable on (0,00) in the 
form (3). 


5. The proof of (**) will depend on the following 
Lemma. In (11), let n > 2, and let (13) and 
(14) f(x) 20 (j = 2,3,° 


hold on0 <x <0 (while f, 20). Then, if a solution y = y(x) 40 of (11) 
satisfies 
(15,) (—1)*D'y = 0 (0<4<ow) 


for k =0,1,- - -,n—2, it must satisfy (15;) for k =n—1 also. 


In order to verify this lemma, note that (11) and (15;,), where 
k=0,1,- - -,n—2, imply that (—1)"(foD"y + f,D"*y) = 0; hence 
D{(—1)"*(D™y)exp (fa/fo)ds} S0 
x 
for O0< XSa<o. If (15,,) fails to hold for some z-value, say for 
a=X>0, then for Consequently, 
(—1)""D"*y is decreasing for so that (—1)"°D"*y = 0 is 
increasing for XY [x<oo. In particular 


(16) (—1)"*D"*y = const. > 0 for large a. 


On the other hand, n = 3 and the inequalities (15,3), (15,-.) imply that 
D»-*y tends to a (finite) limit as r—>oo. But a quadrature shows that this 
is incompatible with (16), that is, with the assumption that (15,-,) can 
fail to hold for some value of r= X > 0. 


6. The proof of (**) proceeds as follows: 


The assumptions of (**) imply (13) and (14). Hence, assertion (i) of 
[3] is applicable and assures the existence of a solution y = y(t) 0 of (11) 
satisfying (15,), where k = 0,1,---,n—41. It will be shown, by induction, 
that this solution satisfies (15;) fork —0,1,---. 

Assume that (15;) holds for k —0,1,---,n-+ j—2 for some j= 1. 
It will be shown that it also holds for k =n-+j—1. To this end, let (11) 
be differentiated j times. The resulting equation, 
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Cim{ (Dfo) +3 (— 1)**(Dmfx) — 0, 
where = 7!/m!(j — m) !, can be written in the form 
(17) +3 (—1)**F D™*y = 0, 
where F;, = Fj,(a) is defined by 
(18) +3 (—1)"CimD" firm 
when 7 = 1,2,--- and k—1,2,---, 7, and by 


j 


m=0 
when j= 1,2,--:- and +n. 
The assumptions (12) imply that 


(20) (—1)** => 0, (—1)*D™f, 20 and (—1)'D*f;=0 
for k =0,1,--- and j=2,:--,n. Hence, (13) and 20) show that 
(21) F,(z) 20 for k = 2,3,---,n+4, 


since neither Df, nor f,; occurs in the formulae for F;, when k 2 2. By the 
induction hypothesis, the solution y=y(x) of (17) satisfies (15,) for 
k=0,1,---:,n-+j—2. Hence, the Lemma of Section 5 shows that (13) 
and (21) imply that y = y(z) satisfies (15,) for k = n + j —1 (note that the 
condition n > 2 of the Lemma is satisfied by the present n + 7 22+ 7> 2). 
This completes the induction and the proof of (**). 


7. Both in the assumptions and the assertion of (**), the z-range was 
the open half-line (0,00). It is however clear that the proof applies on the 
closed half-line [0,0 ) also. The resulting variant of (**) can be combined 
with the following remark (which has nothing to do with differential 
equations) : 


If a function y(x) is completely monotone on [0,0), then the same is 
true of the functions y;(x), where 11, 2,-- - and 


= {y(2) y® (0)a4/k1}/(— a)? 


In order to see this, it is sufficient to observe that, according to Taylor’s 
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formula with integral remainder, the expression {- - -} occurring in the 
definition of is 


(—2)! (— 1)'y(at)(1 — t)dt/(I—1) !. 


Appendix. 


Theorem (i) of [3], p. 731, on which the proof of (**) was based above, 
states that the conditions (13)-(14), where 0 << x <<, assure the existence 
of a solution y= y(x) satisfying (15,) for k =0,1,---,n—1. There is 
no condition on f, (= 0). Correspondingly, if f, = 0 is assumed, then (15;) 
holds for k =n also (cf. loc. cit.). 

It is natural to ask whether or not the assumptions (14) can be omitted 
for some values of j on the range 2 = j = n if, in the corresponding assertion, 
some values on the range 0 = k = n —1 are omitted for the index k in (15,). 
A partial answer is given by (+) below. 


If Yo,* * Ym are m functions, each having m derivatives, let W, = 1, 
W, = y, (x) and, if k = 2,- - -,m, let Wy, = Wa = Wi 41, Yo, 
denote the (k-rowed) Wronskian determinant of the functions - yx. 


(t) In the differential equation (11), 
let 
(22) 20 fork =m-+1,---,n 


and let the differential equation 


(23) fo(2)Dmy + 3 0 

possess a complete set of solutions y,(x),° *,Ym(x) satisfying 

(24) Wi Wi(x) = Ye) > 0 for k= 0,1,- +--+, m. 
Then (11) has a solution y = y(2) satisfying (15;) fork =0,1,---,n—m. 


This theorem (+) will be deduced from Theorem (I) in [3], pp. 733-734, 
which deals with a linear, homogeneous system of first order differential 


equations. 

First, if denote the positive functions a) = W/W, 
dy, = where k =1,---,m—1, and = Wm/Wm-1, then, 
according to Frobenius, the expression in the right of (23) can be “ fac- 
tored ” as 


Da,Daoy, 


1 
0 
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where, for example, Da,Dayy = D(a,(D(aoy))) (for a simple proof, see [4], 
pp. 815-316). If y is replaced by D"™y, then the expression in the last 
formula line becomes identical with the first m + 1 terms of (11). Hence, 
(11) can be written as 


k=m+1 
Define n functions -,2n by the equations 
(26,) 2, == (—1)*" if k= 1,---,n—m, 


(262) Zn—-msk = (— 1)"-m+k-1q)_ Da,Dr-"y if k 


Then the n-th order differential equation (11) and/or (25) is equivalent to 
the following system of n first order equations: 


(27;) = — If 1,-- -,n—m—1, 
(272) = Zn—m+k/Ak-1 if k= » 
P n 
k=m+1 
Since do,* * *,@m are. positive, it follows from (22) that the coefficients of 
—%,° * *,—2%, on the right side of (27,)-(273) are non-negative (in fact, 


the coefficients are 0,1, 1/do,° +, 1/Qm-15 fms1//fodms fm+2/fodms* * > fn/fodm)- 
Hence Theorem (I) of [3], pp. 733-734, is applicable and implies that the 


system (27,)-(27;) has a non-trivial solution = 2,(%),° +, 2n = 2n(Z) 
satisfying = 0, where k—1,---,n and0<a<o. The definition 
(26,) of z, for k =1,- - -,n—m and the definition of Zpm4, in (262) now 


show that (11) has a solution y= y(z) $0 satisfying (15,) for k = 0,1, 
*++,nm—m. This proves (f). 

It may be mentioned that while Theorem (i) of [3] can readily be 
deduced from (+), the converse deduction is impossible. This is shown by 
the example 


(28) + y¥=0, 
which has the general solution 
(29) y = + bed” cos (34x/2) + ced sin (3427/2). 


In fact, the conditions imposed by (i) of [3] on (23) are satisfied by the 
case (28) of (11), where m = 3, but it is readily verified that for no choice 
of the integration constants a, b, c will there result m = 3 solutions 4, Yo, Ys 
satisfying the conditions (24). 


THE JOHNS HOPKINS UNIVERSITY. 


206 PHILIP HARTMAN AND AUREL WINTNER. 


REFERENCES. 


[1] P. Hartman and A. Wintner, “On the Laplace-Fourier transcendents,” American 
Journal of Mathematics, vol. 71 (1949), pp. 367-372. 


[2] and A. Wintner, “On the classical transcendents of mathematical physics,” 
ibid., vol. 73 (1951), pp. 381-389. 
[3] and A. Wintner, “ Linear differential and difference equations with mono- 


tone solutions,” ibid., vol. 75 (1953), pp. 731-743. 

[4] G. Pélya, “ On the mean-value theorem corresponding to a given linear homogeneous 
differential equation,” Transactions of the American Mathematical Society, 
vol. 24 (1922), pp. 312-324. 

[5] A. Wintner, “On the Laplace-Fourier transcendents occurring in mathematical 

physics,” American Journal of Mathematics, vol. 69 (1947), pp. 5-13. 
» “On the Whittaker functions W,,(x),” Journal of the London Mathe- 
matical Society, vol. 25 (1950), pp. 352-353. 


[6] 


( 
1 
( 
Si 
( 
( 


ON NON-OSCILLATORY LINEAR DIFFERENTIAL EQUATIONS 
WITH MONOTONE COEFFICIENTS.* 


By Puitie HarTMAN and AUREL WINTNER. 


1. Introduction. Let the coefficient function in the differential equation 
(1) y” + q(t)y = 9, 
where 0 Xt <., be continuous and monotone (both cases 
(2,) dq = 0, (2_) dg=0 


will be considered). Along every solution y= y(t) #0 of (1), define 
h=h(t) by 


(3) 

and k = k(t) by 

(4) k=y?+y"/q if 

Then, in the sense of Stieltjes integrations (in ¢), 

(5) dh = y*dq 

and 

(6) dk = — y*dq/q? (qA~9); 


ef. e.g., [10], pp. 384-385 and [5], pp. 530-531, respectively. Hence the 
monotony of g implies that of both functions (3), (4) of ¢; in fact, 


(7,) dh = 0, dk = 0, (7_) dh =0,dk=0 
in the respective cases (2,), 


On any interval on which y(t) 40, the logarithmic derivative 


(8) r(t) =y"(t)/y (4) 
satisfies the Riccati equation 
(9) rtrtq=0 


(even if g is not monotone). 
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This paper will deal with the behavior of the functions (3), (4) and 
(8) when qg(¢) is monotone and 
(10) (1) is non-oscillatory. 


By (10) is meant that some (hence, according to Sturm, every) solution 
y(t) #0 of (1) has only a finite number of zeros on the half line 0 =¢ <a. 
For other results on the case of (1) for which (2_), (10) and g > 0 hold, 
ef. [5]. 


2. Positive q(t). The first assertions will deal with the case in which 
q is monotone, positive and satisfies (10). Then (2,) is impossible, hence 
holds. In addition, 
(11) 0<q-0 as 
and, by (10) and g > 0, 


(12) f q(t)dt 
must hold (cf. [11], p. 97), which implies, by (2_), that 
(13) tq(t) ~0 as 
The first theorem, in contrast to most of the other theorems in this paper, 
will not assume that q is monotone. 


(*) If (1) ts non-oscillatory and q(t) 20, and tf y=y(t) ts any 
solution (s£0) of (1), then, from a certain t, onward, the derivative of (8) 
cannot become positive, 


(14) r(t) S0 forti<ct<o, 


and, tf t is large enough, 


(15) f q(s)ds Sr(t) S1/(t— ty) (t > to). 
t 
The inequalities (15) refine the consequences, 


(16) f r?(t)dt << and r(t) > 0 as t> 0, 


of (10) and (12) (ef. [13], p. 375, where it is not assumed that g = 0 but 
only that (10) holds and that the integral in (12) exists as an improper 


= 


h 


it 
or 
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integral; cf. also [4], p. 391). The first inequality in (15) is stated in [13], 
p. 377. 

Proof of (*). Since (9) holds for large ¢, the assertion of (*) con- 
cerning (14) follows from (9) and gq=0. It also follows from (9) that 
”“+q<0. Hence, a quadrature and the second of the relations (16) lead 
to the first of the inequalities (15). Finally, (9) implies that 7” + 7r?S0. 
If r(¢) = 0 for some large ¢, then g =0 and r= 0, and so it can be supposed 
that r(t) >0 for large Hence r?=0 implies that 1+ S0, 
and so a quadrature gives — 7-1 + ¢ < const., which proves the second of the 
inequalities (15). 


In order to obtain further results under the same assumption (10) as 
in (*), recourse must be had to the following theorem (cf. [2], p. 703, and 
for the following formulation, [6], p. 633): Even if just (10) (without (2,)- 
(2.) or sgn g= const.) is assumed, (1) possesses a solution y = y(t) #0 
which will be referred to as the principal solution of (1), is unique to a 
constant factor and is characterized by the property that 


(17) yo(t) =o0(| y(t)|) as E00, 


where y(t) is any solution of (1) linearly independent of y)(¢t). An equi- 
valent characterization of yo(¢) is that, when the lower limit of integration 
is large enough (so large as to exceed the last zero of y(t)), 


(18) (y(t) )-*dt if and only if y const. yo, 


where y = y(t) denotes any solution (5s<0) of the non-oscillatory differential 
equation (1); cf. loc. cit. If y=y(t) #0 is not a principal solution, then 


yo(t) 
t 


is defined for large ¢ and is a principal solution. 


(i) Let (1) be non-oscillatory and let q(t) be positive and monotone 
(so that (11) and (2_) hold). Then the function (3) belonging to any 
solution of (1) ts such as to satisfy h(t) = 0 and dh(t) = 0, which implies 
that h(o) exists (<0) and h(wo)=0. There exists some solution 
y(t) 40 for which the sign of equality holds in h(o) =0; in fact, the 
principal solution yo(t) is such a y(t). In order that h(c) =0 holds for 
every solution y(t), it is necessary and sufficient that 


14 


y 
) 
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(19) tq(tdt 


(in the latter regard, cf. (12)-(13)). 


Proof of (i). Since (7_), (11), (12) hold, only the assertions dealing 
with h(co) 0 need to be proved. The first of these assertions is that 


(20) =0, 


where the subscript of h refers to the choice y(t) of y(t) in (3) (the corre- 
sponding notation, 7, will be used in connection with (8)). Suppose that 
(20) is false. Then ho(t) >c holds for sufficiently large ¢ and for some 
c=const. >0. Since (3) and (8) imply that (9) can be written in the 
form 


(21) r+ h/y? =0, 


it follows that — 1,’ = c/yo? (for large ¢), and so, by a quadrature, 


t 
—ry(t) + Const. = c (Yo(s) )-?ds. 


But the expression on the right of this inequality cannot be bounded as ¢ >. 
by (18), whereas the function on the left of the inequality is bounded, by 
(15). This contradication proves (20). 

In order to prove the remaining part of (i), note that (19) fails to hold 
if and only if (1) has a solution y(t) =40 which is asymptotically propor- 
tional to t as {—>0o (for references, cf. [8]). On the other hand, if y(t) 
is chosen to be positive for large ¢, then y(t) is asymptotically proportional 
to ¢ if and only if the (non-increasing) function y’(t) has a positive limit 
y (0). Hence the statement of (i) concerning (19) is equivalent to the 
statement that h(t) = y” + qy? = Const. > 0 and y > 0 hold (for large f) 
if and only if y’(t) = const. > 0 does (here y(t) is a non-principal solution ; 
ef. (18)). 

Since the first part of the latter statement is clear, it is sufficient to 
show that if h = Const. > 0, y > 0 hold for large ¢, then y’ = const. > 0 does. 
But h(t) = Const. and (21) imply that — 1’ = Const./y?, hence a quadrature 
and (15), (18) show that 


r(t) > Const. f (y(s) )-*ds. 
t 


If this inequality is multiplied by y(t), then the expression on the right is 


«= 
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a principal solution y.(¢) (cf. the formula line following (18)), and so 
the resulting inequality can be written in the form y’(t) = y(t). Since 
yo(t) > 0 for large ¢, it follows, for reasons of convexity, that y’(t) 2 0 
and yo(t) = const. > 0 for large Hence, y’(¢) = const. > 0 for large ¢. 
This completes the proof of (i). 


3. Negative q(t). While (*) and (i) have dealt with g(t) > 0, it will 
now be assumed that g(t) < 0; so that, if g(t) is monotone, q(o) = 0. 
If —«o< q(«) <0, the asymptotic behavior of every solution y(t) and of 
its derivative is supplied by [12], pp. 60-61; cf. (39) below. 

The assumption g < 0 alone (without (2,)-(2_)) implies that the prin- 
cipal solution y)(¢) (to a constant factor) satisfies the inequalities 


(22) y(t) >0, y(t)<0, >0 (0<t <a) 


and that, for large ¢, every non-principal solution y(t) (again to a constant 
factor) is subject to the inequalities 


(23) y(t) >0, y(t) >0, > 0; 


(Kneser; cf. [1], pp. 48-53, and, for other proofs, [2], p. 702, and [7]). 
There obviously exist solutions y = y(t) satisfying (23) on the entire range 
0St<o. Under the additional assumption that g is monotone (and that 
the above-mentioned case —o< qg(o) <0, which can be integrated by 
asymptotic formulae, is excluded), theorems (ii), (iii) below will describe 
the behavior of the functions (3) and (8). 


(11) Let q(t) be negative and monotone and q(«) =0; so that (2,) 
holds and 
(24) 0>q->0 as 


Then, for the principal solutions (22), 

(25) ro(t) <0, >0 
and (what is more) 

(26) ho(t) <0, dhy(t) 20 (0=t<o), 
whereas for the non-principal solutions (23), and if t is large enough, 


(27) r(t) >0, r(t) <0 
and (what is more) 
(28) h(t) >0, dh(t) = 0. 
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In addition, r(%) =0 holds for every solution and = 0 for the 
principal solution. Finally, h(o) = holds for the non-principal solutions 
(23) if and only if 


(29) f — tq(t)di (¢q<0). 


The assertion r(o) 0, made after (28), follows from Perron’s refine- 
ment of Poincaré’s theorem; cf. [9]. 

In contrast to (22) and (26), the inequalities (28) need not hold on 
0=t<o (but only for large ¢) even if (23) holds for all ¢= 0. 


Proof of (ii). Since (9) is equivalent to (21), it is clear that (22) 
and (26) imply (25) and that (23) and (28) imply (29). Note that 
dh = 0, by 

Suppose, if possible, that ho(t) = 0 for some 20. Then = 0 
and, by (21), ro’(#) S0 for ¢=t). Since (22) implies that ro(¢) < 0, it 
follows that ro(¢) = —const. < 0 for t= t), which contradicts = 0. 
This proves (26), hence (25). 

In order to prove (27) for large ¢, it is sufficient to prove that h(t) >0 
for some ¢t. Since r(t) >0 and r(o) —0, it is clear that r’(t) < 0 for 
some t, say for t= ¢). Then (21) implies that h(t) >0. This proves (28), 
hence (27). 

Concerning the assertion ho(o) —0 of (ii), note that, by (21), yo() 
and yo’( 0) exist (as finite limits). In addition, yo’(0%) = 0, since y)’(t) (< 0) 
has a finite integral over 0St<o. Hence ho(o) —0 follows from 
Yo =0, =O and the boundedness of y(t). 

Finally, the proof of the assertion of (ii) concerning (29) and h(o) = « 
is about the same as the proof of the assertion of (i) concerning (19) and 
h(o) 0, and will therefore be omitted. 

The following theorem (iii) describes the situation in the second of 
the cases 
(30) =0, =—, where < 0. 


in the same way as (ii) in the first of the cases (30). 


(iii) Let g(t) be negative and monotone and q(«) =—«; so that 
(2_) holds and 
(31) 0> q(t) ~—o as too. 


Then, for the principal solutions (22), 


(32) <4, (t) <0 (0 St<o) 
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and (what is more) 


(33) ho(t) > 90, dhy(t) S0 (0OSt<o), 
whereas for the non-principal solutions (23), and if t is large enough, 
(34) r(t)>0, (t) >0 

and (what is more) 

(35) h(t) <0, dh(t) =0. 

In addition, 

(36) (0) =0 or r(o) =0o,h(o) 


according as the solution is principal or non-principal. 


The first and third of the four assertions (36) are known; in fact, the 
classical proof (cf. [1], p. 54) for the case —o < g(o) <0 is valid for 
the case g(0) also. 

Proof of (iii). The proof of (33) (hence (32)) and of (35) (hence 34) ) 
in (ili) is about the same as the proof of (26) (hence (25)) and of (28) 
(hence (27)) in (ii), and will therefore be omitted. What remain to be 
proved are the second and the fourth of the relations (36). 


Since (22) implies that yo dt where yo <0, and that y’ is 
monotone, it is clear that yo’(0) = 0. On the other hand, since gq < 0, it is 
seen from (3) and (33) that 0 < ho(t) < yo?(t). Hence the second of the 
assertions (36) follows from = 0. 

Finally, since (23) implies that y(oo) oo, hence y?(¢) > const. > 0 
(for large ¢), the fourth of the assertions (36) follows from (2,), (5) and (31). 

The following theorem deals with the case of a monotone, negative q(t) 
with —o< < 0, say 


(37) where 0<w<o. 

It is known that if g(t) ~—o? as t->o, then the principal and non- 
principal solutions, y = y(t) and y= y(t), satisfy 

(38) (t) and r(t) as t>0 


(cf., e.g., [1], p. 54), and that (37) and the monotony of q (in fact, the 
boundedness of the variation of g on 0 =¢t <oo) imply that 


t t 
(39) y(t) ~ const. exp — (— q(s))ids, y(t) ~ Const. exp (— q(s))ds 


as t—>oo (cf. [12], pp. 60-61). 
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(iv) Let q(t) be negative and monotone and satisfy (37). Then, 
according as (2,) or (2_) holds, the principal solutions (22) satisfy, for 
0S=t<o, 


(40,) <—o, 1’ > 0 or (40_) > —a, T) <0 
and (what ts more) 
(41,) hyo < 0,dhy 2 0 or (41_) ho > 0,dhyo = 0; 


in addition, hy(o) =0. On the other hand, 


t 


(42) + {exp2 (—a(s) )lds}dq(t) 

is necessary and sufficient in order that the non-principal solutions satisfy 
(43,) h( co) or (43.) h(co) 
in which case, for large t, 

(44,) r>w,r <0 or (44_) r<or”’>0 
and (what is more) 

(45,) h>0,dh20 or (45_) -h<0,dhS0. 
If (42) fails to hold, then (43)-(45) must be replaced by (7) and 

(46) h( co) & 


If (42) does not hold, then h(oo) can be made an arbitrary number 
+0) by a suitable choice of the non-principal solution y= y(t). For 


example, if g(t) ——1, so that (1) becomes y” — y = 0, then, for the choice 
y=e't-+ ce of the solution, the function (3) is h(t) =—4c (and sof 
h(o) =—4c). 


Proof of (iv). The proofs of the assertions concerning 7) and ho are 
similar to the proofs of the corresponding assertions in (ii) and will be 
omitted. The proof of the statements concerning (43) (hence (44) and 
(45)) and (46) follows from (5) and the last part of (39), leading to 


(42bis) A(t) 4 const. ~ Const. {exp 2 f (— )idu}dq(s). 
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4. The case where 


(47) f q(t)dt converges cf = lim 


e 


ossibly just conditionally). As mentioned after (15), the inequality 
P J y 
f q(s)ds Sr(t) and the relations (16) do not depend on the assumption 


t 
q= 0 of (*) but only on (10) and (47). 

In contrast to the inequality (15;), which involves the remainder of 
(47), the assertions of (ii)-(iii), where g <0 and (2,)-(2_) are assumed, 
lead to inequalities (again for r) which involve the square root of —q. In 
fact, if (2,) and (24) hold, then, if ¢ is large enough, rp <0 and r>0. 
Hence, by (26) and (28), 


(48) 0>m>—gq* and r> where g* = (—q)!>0. 
Similarly, if (2_) and (31) hold, then (33) and (35) imply that 


(49) ‘and where g* = (— q)4 > 0, 


if ¢ is large enough. 


Since (48)-(49) are inequalities for the logarithmic derivative (8) of 
a principal or a non-principal solution, yo or y, quadratures lead to inequal- 
ities for y. and y themselves. There is a corresponding remark for (15,), 
which holds for principal and non-principal solutions. In fact, a quadrature 
of (15,), where (10) and (47) are assumed, shows that there belongs to 
every solution y(¢) £0 of (1) a const. >0 satisfying, for large t, the 
inequality 

t 


(50) y(t) > const. exp f Q(s)ds, 
where 
(51) Qt) =f a(s)as 


(cf. (47)). But the inequality (50) can be improved if the assumptions 
(10) and (47) are strengthened: 


(t) Let q(t), where0 St <o, be any continuous function satisfying 


215 
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(47) and having the property that, if Q(t) is defined by (51), the differential 
equation 
(52) + 4Q0°(t)a =0 


is non-oscillatory. Then 
(a) the differential equation (1) is non-oscillatory ; 


(8) corresponding to every solution x(t) #0 of (52), there exist two 
solutions of (1), say y= yo(t) and y= y(t), satisfying 


(53) 0 <yo(t) S| f Q(s)ds y(t) 


for large t. 


The assumptions that (47) holds and that (52) is non-oscillatory imply 
the inequalities (53) for certain solutions of (1). If the assumption that 
(52) is non-oscillatory is strengthened to 


f tQ2(t)dt 


there result asymptotic formulae for the solutions of (1); ef. [18], theorem 
(i), p. 719. 

Assertion (a) of (f) was proved in [3], p. 959. Assertion (8) of (+) 
will follow by a modification of the proof of (a). j 


Proof of (+). If an arbitrary solution 7(¢) £0 of (52) is replaced, 
when necessary, by —2(t), then, since (52) is supposed to be non-oscillatory, 
it can be assumed that z(t) > 0 for large ¢. In terms of such an z(t), define 
(for large ¢) a function by placing 


t 
(54) 2(t) {x(t)}4exp Q(s)ds > 0. 
0 
Logarithmic differentiation of (54) gives 
(55) ha’ +Q, 
whence another differentiation leads to 
(56) 2!” — {$p’ —q + (4p + Q)?}2 = 0, 


where p denotes 2’/x. In view of the latter definition, p belongs to (52) in 
the same way as r in (8) belongs to (1). Hence, what belongs to (52) in the 
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same way as (9) belongs to (1) is the Riccati equation p’ + p? + 49? = 0. 
It is seen that (56) can be written in the form 


(57) 2’ + {q+ (4p— @)*}2 where p= 2’ /z. 


Since the solution (54) of (57) does not vanish, the differential equation (57) 
is non-oscillatory. But the coefficient function q of (1) is majorized by the 
coefficient function {- - -} of (57). Hence (+) follows from the comparison 
theorem (iv) in [6], pp. 635-636, and its proof. 


5. A remark on the case g=O0. As mentioned at the beginning of 
Section 2, condition (47) is necessary for (10) if g=0. Hence it is natural 
to ask whether or not the converse of assertion (a) of (+) is true in the case 
q = 0, that is, whether or not (52) must be oscillatory if (1) is and if g= 0. 
The answer to this question remains undecided even under the restriction 
that g tends monotonously to 0 as to. 

What is easy to see is that, even without the restriction just mentioned, 
the answer is in the affirmative if (52) is weakened to 


(58) x” + —0; 
in addition, the assumption g= 0 can be relaxed to 9 = 0, if (47) holds: 


(1) If (1) ts non-oscillatory and (47) holds, and if the function (51) 
is non-negative, then (58) is non-oscillatory. 


In fact, if y= y(t) #0 is a solution of (1), then (16) is a consequence 
of (47), and so a quadrature of (9) leads to 


+R++Q=0, where R(t) r*(s)ds 
t 

(for large ¢). This can be written in the form {Rk} + 2RQ —0, if {BR} 
denotes R’ + R? + Q*. Consequently, {R} = 0 (for large ¢), since 
and R= 0. Hence the statement concerning (58) follows from the criterion 
of [13], p. 368. In fact, the definition of {R} shows that the limiting case, 
{R} = 0, of the inequality {R} = 0, is the Riccati equation which belongs 
to (58) in the same way as (9) belongs to (1). 

Under the assumptions of (I), assertion (8) of (+) has the following 
analogue: 


l 
0 
t 
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(II) If the conditions of (I) are satisfied, then, corresponding to every 
solution x== x(t) 5£0 of (58), there exists a solution y=y(t) of (1) 
satisfying 


y(t) =|2(t)|exp Q(s)ds 
for large t. 


In particular, (1) has solutions satisfying 


t 
y(t) texp Q(s)ds 
for large ¢, since Q? = 0 implies that (58) has solutions satisfying x(t) = t. 


If this remark is combined with the assumptions and the assertion of (I), 
it follows that (58) has solutions satisfying 


t 
a(t)= texp f cf Q?(u)du) ds. 
Hence (1) has solutions for which 


t t © 
y(t) = texpl f Q(s)ast 


holds for large ¢. 
The details of the proof of (II) will be omitted since it is similar to 


that of (8) in (t+), if (54) is replaced by 
t 
2(t) =2(t) exp f Q(s)ds. 


THE JOHNS HOPKINS UNIVERSITY. 
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DIVERGENCE OF VECTOR FIELDS AND DIFFERENTIAL 
EQUATIONS.* 


By S. K. ZAREMBA. 


The present note contains several propositions concerning the geometry 
of the integral curves of the differential equation 


(1) du/X (x,y) = dy/Y (2, y). 


All these propositions involve the divergence X,(2z,y) + Yy(x,y) of the 
vector (X,Y). The first part of this note gives a geometrically intrinsic 
form to the well-known formulae describing the variation of the integral 
curve of a differential equation, regarded as a function of the initial point. 
The second part deals with some immediate applications of the first part to 
the theories of limit cycles and first integrals; finally, in the third and fourth 
parts, we obtain a theorem which is partly reciprocal to Bendixson’s criterion 
for the absence of closed integral curves. 


I. The variation of the integral curve. It is well known [3] that if 
the functions f(z,y) and f,(z,y) are continuous in a domain G, and if 
21,7) is the solution of the differential equation dy/drz = f(a, y) satis- 
fying the initial conditions x = 2,, y =», then 


(2) Yn (2, expt inte, 9) 


provided that the are of the curve y —y(#, 2,7) bounded by the points of 
abscissae 2, and zx. is contained in 

In the more general case of an equation like (1), where neither of the 
inequalities X(x, y) A 0 and Y(a, y) ~0 holds throughout the domain under 
consideration, we cannot make use of this formula. However, the following 


proposition can then be applied: 


THEOREM 1. We assume that in a domain G the functions X (x,y) and 
Y (x,y) do not vanish simultaneously and have continuous first-order partial 
derivatives. Let (2,,y,) and (22, y2) be any two points connected by an arc 


* Received June 12, 1953. 
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L of an integral curve x= x(t), y= y(t) of (1) contained in G, the para- 
meter t satisfying 
(3) dx/X (x,y) = dy/Y (a, y) = dt 


and x(t;) =a, y(ti) =yi3 ((=1,2). Finally, let JF; be the orthogonal 
trajectory of (1) passing through the point (2, yi), and let u; be the para- 
meter on J; satisfying 


(4)  —de/¥ (a, y) = dy/X (a, y) = dw/{[X(2,y) 919 
and vanishing at the point (ai, yi) ; t= 1, 2. 


Then, if we consider uz as a function of u, defined by the condition that 
the values of u, and u, corresponding to each other should belong to points 
connected together by integral curves of (1), we have, for u, = 0, 


(5) dus/duy f uD) + 


Proof. L can be divided into a finite number of ares, on each of which 
either Y(z,y) or Y(z,y) is constantly different from zero, and it follows 
from the shape of (5) that it suffices to prove this formula for any of these 
arcs; for obvious reasons of symmetry, we can assume that, on the are under 
consideration, X (x, y) + 0. 

Let 7: (u,) and y2(u,) be the ordinates of the intersections of the lines 
t—=2, and r—2>, in that order, with the integral curve of (1) passing 
through the point of J, which corresponds to the given value of uw. By 
considering the curvilinear triangle formed by the aforesaid integral curve, 
by J, and by the line  ~7z,, and by making this triangle tend to a point, 
we find, for = 0, dyi/du,; =1/X(%4, 41). Similarly, dus/dyz = X (#2, y2). 
Hence du2/du, = X (x2, y2)/X (a1, y1) dy2/dm. Substituting here, for dy2/dm,, 
the value given by (2) when f(2z,y) = Y(2,y)/X(z,y), we find 


rts —X,¥ 


which, after simplification, gives (5). 


The interpretation of this formula in terms of two-dimensional flow is 
obvious, and (5) could also be obtained from the formula describing the 
variation of area in such a flow, but a rigorous proof on these lines seems 
to be longer than the preceding argument. 


= 
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CoROLLARY 2. On the assumptions of Theorem 1, if, on JF, and J,, 
we introduce, instead of u, and Us, parameters v, and v, satisfying 


and vanishing at the point (ai, yi) (1=1, 2), we find, for v, = 0, 


where [R(x, y)]? = [X(a, y)]? + y)]*?. Similarly, on the same assump- 
tions, if we introduce, on Ji, a parameter o; representing the length of the 
arc measured from the point (ai, yi) in the direction of the increasing u;’s 
(i = 1,2), we find, for o, = 0, 


(8) = 9s) /R (2s, exp f ‘(Xe + ¥,)dt}. 


II. Miscellaneous applications of Theorem 1. In this section we pro- 
pose to show how various propositions, which were obtained by means of 
apparently more complicated methods, can be represented as immediate, or 
almost immediate, consequences of Theorem 1. 

We recall that a closed integral curve C of (1), which does not pass 
through singular points, is called a limit cycle if it is a limit set of integral 
curves passing through a certain neighborhood of C. The limit cycle C is 
called stable if it is the limit set, for > +00, of every integral curve passing 
through a certain neighborhod of C. It is called unstable if the same occurs 
for t > —oo instead of {> -+o. Finally, a limit cycle is called semi-stable 
if the neighboring integral curves cituated on one side of it tend to it with 
t -+ oo, and those situated on the other side tend to it with We 
have the following: 


Proposition 3. A limit-cycle C of (3) is stable if 


(9) (X,+Yy)dt <0, 

and unstable tf 

(10) + Y,)dt > 0. 


Proof. Through an arbitrary point P of C, let us draw an orthogonal 
trajectory J of (1), on which the parameter w is defined by (4) (u being 
substituted for u;) and vanishes at P. Here, the formerly considered curves 
J, and J, coincide, and u. will now denote the value of the parameter u 


(6) — dx/Y (x,y) = dy/X (x, y) = dy; 
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belonging to the first point, say Qe, of J met again after having started, 
from the point Q, of J, defined by u= 1, along an integral curve of (3) 
in the direction of the increasing ?’s. 


According to Theorem 1, we have, for ui; = 0, 


du,/du, = exp{ (X, + Y,) dt}, 
Cc 


which shows that, at this point, (9) implies du,/du, <1. Therefore, if (9) 
holds and w, is sufficiently small, | < |u| if Qi is sufficiently near 
to P, Q» falls between P and Q. From this, by a well-known type of argument 
[4], [5], [13], it follows that C is the limit set, for t > +00, of the integral 
curve of (1) through Q,. In other terms, (9) implies that C is stable. 
Similarly, the instability of C follows from (10). 

The usual proof of this theorem is based on the theory of variational 
equations (e.g., [2]). 


CoroLuary 4. On a semi-stable limit cycle C of (3), we have 
(X,+ Y,)dt =0. 
Cc 


(Cf. [V7], where no proof is given.) 
Hence, or directly from Proposition 3, we deduce the following: 


CoroLLaARY 5. A domain in which X,+ X, has a constant sign cannot 
contain semi-stable limit cycles. (Cf. [7], where no proof is given.) 


As is well known ([4], p. 78), if such a domain is simply connected, 
it cannot contain any closed integral curves, but otherwise stable—or un- 
stable—limit cycles can occur. 


PROPOSITION 6. A domain in which X, + Y, identically vanishes can- 
not contain limit cycles. 


Proof. If we revert to the notations of the proof of Theorem 1, we find 
that du,/du, == 1. Indeed, since corresponding points on J, and J, are 
connected by integral curves of (1), a relationship of the type of (5) applies 
not merely when u, = 0, but whenever u, is sufficiently near to 0. Hence 
U, = U, identically, as u. 0 when u, —0. Now, if we assume the existence 
of a closed integral curve and make J, coincide with J,, as in the proof of 
Proposition 3, this shows that all the neighboring integral curves are also 
closed. 


224 S. K. ZAREMBA. 


This proposition is usually proved by an argument based on the invariance 
of areas under the group of the given differential equation (e.g., [5], pp. 
92-93). 

By a first integral of (1) valid in a set D of the (2, y)-plane, we 
understand a function, say J(a,y), satisfying the following conditions: 
(i) J(z,y) has continuous first-order partial derivatives throughout D; 
(ii) J,(a, y) and J,(a, y) do not vanish simultaneously in D except, possibly, 
at singular points of (1); (iii) J(z,y) is constant along any arc of an 
integral curve of (1) contained in D. 

The following theorem concerning first integrals and proved by Hl. H. 
Alden [1] can be obtained as an almost direct conseqquence of Corollary 2. 


THEOREM 7%. Let the origin be a centre (in the sense of Poincaré)? 
of (1) and let D be a domain bounded by a closed integral curve and swept 
by closed integral curves of (1) surrounding the origin. Then if, throughout 
D, X and Y have continuous first order partial derivatives and do not vanish 
simultaneously, except at the origin, there exists a first integral of (1) valid 
in D. 


Proof. Let J(x,y) be the area bounded by the integral curve, say C, 
of (1) passing through the point (x,y). If we make J(0,0) =0, J is 
evidently continuous throughout D and satisfies (iii). The coordinates of 
the running point on C being é and », let = $(t), 7 = y(t) be the solution 
of (3) satisfying ¢(0) =z, ¥(0) =y, and let w be the smallest positive 
value of ¢ for which ¢(t) = 2, y(t) =y. Finally, let o, and o, be the lengths, 
counted as in Corollary 2, of the ares on the integral curves of (4) passing 
through the points (z,y) and (é,7) respectively. It is easy to see that the 
partial derivative of J in the direction of the vector (—Y,X) is equal to 


€ § do./do, ds, s being the length of the are of C and e=+1, namely 
Cc 


e=-+1 if the vector (—Y,X) points to the outside of C and «=—1 
if it points to the inside. According to (8), this becomes equal to 
R(x, y), z(x,y), where 


The expression obtained for the aforesaid partial derivative being continuous, 


* We recall that, according to Poincaré [13], a singular point of (1) is called a 
centre if, at this point, D(X, Y)/D(#x,y) #0, and if all the integral curves passing 
through a certain neighborhood of this point are closed (Since Bendixson [4] this 
term has also been used in a much wider sense). 


= 
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and the partial derivative of J(z,y) in the direction of the vector (X, Y) 


identically vanishing, we find 
(12) y) = y) 2(@, 9). 


In (11), » is a continuous function of x and y, which is bounded in the 
neighborhood of the origin, since the absolute value of 


d/dt tan? y/x = (yX — zY)/(a? + 
= {y[X.(0, + X,(0, 0)y] —2[¥.(0, + Y,(0, 0)y]}/(@*? + y’) 


has a positive lower bound in the neighborhood of the origin, the quadratic 
form in the numerator of the last expression being necessarily definite in the 
case of a centre [10], [13]. Hence z(2, y) is bounded in the neighborhood 
of the origin and therefore J,(z, y) and J,(x, y) both tend to zero as zx, y > 0. 
On the other hand, according to (11) and (12), | Je(z, y)| + | Jy(a, y)| > 0 
whenever |x|+]y!/>0. Thus J(2,y) isa first integral of (1) valid in D. 


III. The existence of closed integral curves. As mentioned before, it 
is well known that if X(2,y) and Y (a, y) have continuous partial derivatives 
of the first order in a simply connected domain D, and if througnout D the 
divergence y) = + Y,(2, y) has a constant sign, the differential 
equation (1) admits of no closed integral curve * contained in D. 

Incidentally, this theorem, which was first stated by I. Bendixson in the 
form of a somewhat casual remark ([4], p. 78), seems now to be generally 
known owing to H. Dulac [7], who also made the rather obvious generalization 
of allowing X (x,y) and Y (2, y) to be multiplied by a variable factor z(z, y). 

We shall prove a theorem which can be regarded as partly reciprocal 
to Bendixson’s theorem, and which shows that, in conjunction with a suitable 
factor z(2, y), this criterion can give information about the existence of closed 
integral curves when it is most likely to be required. 

Let us first examine the cases when the question of the existence ef closed 
integral curves can be answered without deliberation. It is well known that 
the Kronecker index of a closed integral curve of (1) with respect to the 
vector (X,Y) is + 1; ef. [9], [15], [16]. Hence the sum of the indices 
of the singular points in the interior region of a closed integral curve also 
amounts to + 1, and therefore a simply connected domain containing no 
singular points with positive indices cannot contain closed integral curves. 


? We recall, that according to the generally accepted convention, integral curves are 
considered to terminate at singular points. Exceptionally, however, we shall regard as 
a closed integral curve also the sum of a finite number of integral curves (in the pre- 
ceding sense) if it forms a simple closed curve. 


15 
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On the other hand, singular points of index greater than 1, and singular 
points of index 1, but of a topological type other than that of a centre, or of 
a nodal, or spiral point admit elliptic sectors in the sense of Brouwer [6], so 
that in their presence the problem of the existence of closed integral curves 
is solved in advance. 

Apart from the case of a centre, in which this question is again auto- 
matically answered, the singular point must then be of the topological type 
common to nodal and spiral points, and characterized by the property that 
every integral curve passing through a certain domain U, containing the 
singular point, leads to this point in one, but always one, and always the 
same, of the two directions determined by the increasing, and by the decreasing 
If, at the singular point, D(X, Y)/D(2, y) ~0, either all the integral 
curves reach it with definite tangents (and the point is then called nodal), 
or they do so winding themselves about it infinitely many times (and the 


point is then called a spiral point). In both cases, 
(12a) D(X, ¥)/D(2,y) >. 


since J = + 1, and the singular point is reached only with t oo [10], [13]. 

The largest possible domain U, i.e. the locus of all the integral curves 
terminating at the singular point under consideration, will be called the 
domain of influence of the given singular point. The components of the 
boundary of this domain are clearly closed integral curves of (1), possibly 
passing through other singular points, or through the point at infinity, or 
even reduced to one of these points, which must in any case be excluded from 
U. Thus the domain of influence can also be described as the largest domain 
containing the given singular point, but containing neither parts of closed 
integral curves of (1), nor other singular points, or points at infinity. With 
a few additional, but not very restrictive assumptions listed below, we can now 
state the following theorem: 


THEOREM 8. Let the functions X(z,y) and Y(x,y) have continuous 
first and second order partial derivatives throughout the (x, y)-plane, and 
let Equation (1) have a spiral, or nodal, point with a simply connected 
domain of influence, which will be denoted by D. If at this singular point 
=X,+ and if F(x, y) is a function not vanishing at the 
singular point and admitting continuous first order partial derivatives through- 
out D, but otherwise arbitrarily given, there exists a function z(x,y) defined 
throughout D with continuous first order partial derivatives and identically 
satisfying the condition 


(13) [2(2, y) X(a, y) Jo + [2(2, y) (2, y) Jy = F(a. 9). 
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Moreover, tf X(x,y), Y (x,y) and F(z, y) are analytic in D, z(x,y) 1s also 
analytic in D; on the other hand, if F(a, y) has a constant sign in D, 2(2, y) 
also has a constant sign in D. 


Proof. Obviously, we can assume that the singular point under con- 
sideration is at the origin of the coordinates. Without impairing the generality 
of our argument, we can also assume that 


(14) 8(0,0) < 0. 
Equation (13) can be rewritten in the following form, 
(15) A (2, Y)%e + V(X, y)%y F(x, y) y) 


which shows that this is a first order linear partial differential equation in z, 
whose characteristics are given by the equations 


(16) du/X(z, y) dy/Y (a, y) dz/{ F(z, y) y)}- 

The projections of these characteristics on the (2, y)-plane are clearly 
the integral curves of (1). If x—2(t), y=vy(t) is a solution of (3), the 
unknown function z, considered along the corresponding integral curve, as a 
function of ¢ [where ¢ is determined by (3) ], satisfies the ordinary linear 
differential equation 


dz/dt + 8[x(t), 


Because of (14), the point x(t), y(t) tends to the origin as t> +o 
[10], [13]. Hence our solution of this equation must remain finite for 
t—+-+o. But it is easy to see that there is only one such solution; it can 


be expressed by the formula 


+00 
and we find that P 
t>+00 
On the other hand, it is fairly obvious that, in our case, there exists a 
family of homothetic ellipses centred at the origin and, in a certain neighbor- 
hood of the latter, nowhere tangent to the vector (X,Y). It is clearly 


sufficient to prove it for the linearized vector field (Xo, Yo) defined by 
X, = Ax + By, Yo = Cx + Dy, where 


Aun X,{0,0), B= X,(0,0), C=¥,(0,0), 


and this can best be done as follows: 
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In view of A+ D=8(0,0) #0, we can assume AD > 0, for, if this 
inequality did not hold, we could, by a rotation of the axes of co-ordinates 
(which leaves A + D invariant), make A = D and hence AD > 0. Now the 
equation of the family of ellipses can be written as ux? + by? =c, where c 
is the parameter of the family. The condition that the ellipses should never 
be tangent to the vector (Xo, Yo) is that the quadratic form a(Az + By)zx 
+ b(Cx + Dy)y should be definite; verifying that this can be obtained with 
positive values of a and 6 is a matter of simple algebra and is based on the 
inequality AD(AD— BC) > 0. 

Any of the aforenamed ellipses, if sufficiently small, is intersected exactly 
once by every integral curve of (1) passing through a certain neighborhood 
of the origin. Having arbitrarily chosen one of these ellipses and denoting by 
w the parameter of a point thereon, let us, further, denote by [2(t, w), y(t, w)] 
the solution of (3) determined by the condition that [x(0, w), y(0, w)] should 
be the point of the ellipse corresponding to the given w. We have clearly 


(19) D(x, y)/D(t,w) 


throughout D, with the obvious exception of the origin. Then (17), with 
the substitution of z(t,w) and y(t,w) for x(t) and y(t), defines z(2, y) 
throughout D, and, since the relevant integrals are uniformly convergent, 
this is a function which has continuous first order partial derivatives in 7 
and y and satisfies (15); further, the shape of (17) shows that if F(z, y) 
has a constant sign in D, z(z, y) also has a constant sign. 

The only point of D where this argument does not apply, and indeed 
where z(x,y) has not yet been defined, is the origin. Owing to (18), if we 
make 2(0,0) = F(0,0)/8(0,0), z(z,y) becomes continuous at the origin, 
at least when the origin is approached along an integral curve of (1). 

Now we shall see that if X(z,y), Y(z,y) and F(2,y) are analytic 
throughout D, z(x,y) is also analytic throughout D. The point c—y=0, 
z= F(0,0)/8(0,0) is a singular point of (16), at which the Jacobian of 
the denominators equals — 8(0,0) D(X, Y)/D(2,y) 

Poincaré’s equation in S [14] becomes 


X,—S Xy 0 
Y,—S 0 = 0. 
F,—2#, F,—2#, —s—S 


It has one positive root, S =—68(0,0), and two other roots which are both 
negative in the case of a nodal point of (1) and imaginary conjugate with a 
negative real part in the case of a spiral point of (1). In both cases [14, 
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p. 170 and p. 175 in the Oeuvres], there exists exactly one solution of (15) 
passing through the singular point of (16), and this solution is analytic. 
It clearly must coincide with z(z, y), as defined by (17) ; therefore z(z, y) is 
analytic throughout D. 


IV. The non-analytic case. In the non-analytic case, no indication 
about the behavior of z,(a,y) and z,(, y) at the origin can be derived from 
the literature concerning singular points of systems of differential equations 
(e.g. [9], [11], [12]); a study ad hoc is, however, possible owing to 
Theorem 1. 


We first reduce the general case to that when 
— (20) 8(0, 0) = F(0, 0); 8.(0, 0) = F'x(0, 0) ; 8,(0, 0) = F,(0, 0). 


It is easily verfiable that, in view of (12a), this can be achieved by sub- 
stituting, in (15), Lly+M)-X(a,y) and (Kx+ Ly+ M)-Y(2,y) 
respectively for X and Y, the constants K, L, M being determined by 
a set of three linear equations. Then the originally required factor is 
(Ke + Ly+M)-2z(2,y). The continuity of z,(z,y) and z,(z,y) at the 
origin will now follow from the relations 


lim 2,(2,y) =0, limay(2,y) =0; (x,y) (0,0). 


which, in turn, are equivalent to the statement that the partial derivatives of 
z(x,y) in the directions of the two mutually orthogonal vectors (X, Y) and 
(— Y,X) tend to zero as the point (2, y) tends to the origin. 

On the other hand, the expression given for z(x,y) by (17) is linear 
in F(z, y) and is constantly equal to 1 if F(z, y) =8(z, y) ; hence 


where G(x, y) =F (x,y) —8(a, y). 
Thus 


x exp 8[a(r), y(r) |dr} dT. 


As it was pointed out, the neighborhood of a nodal, or spiral, point can 
be covered by a family of homothetic ellipses without contact with the integral 
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curves. Hence there exists a positive number a such that, on any integral 
curve of (1) in a neighborhood of the origin, 


+ [y(T)]? S a*{Lo(t) + [y(t)]*} whenever ¢ ST. 


Owing to (14) and (20), there exist numbers 8, 8,, and 8 such that, in a 
neighborhood of the origin 


(22) [| @(a,y)| << B + 9°); (23) <0. 


Hence, in a sufficient small neighborhood of the origin, 


+00 


(28a) | dz/dt | < + [y() 1) (8— 
= Bl 1 + + Ly(t)1°- 


The partial derivative of z along the vector (X,Y) equals R-*dz/dt and 
since, in the neighborhood of the origin, 


(24) +9"), 
where the constant A is positive, it follows from (23a) that the partial 
derivative of z along the vector (X, Y) tends to zero as (2, y) — (0,0). 

In order to obtain an evaluation of the partial derivative of z along the 
vector (— Y,X), we assume that a point (2, yo) of D, sufficiently near to 
the origin, is given. Through this point, we trace a solutien, 

(25) t= =7(%), 
of (4), the parameter wu, vanishing at the point (2,4). The solution of 
(3) satisfying (25) for ¢=0 can be written as 


(26) t= U1), y= y(t, u). 


Accordingly, we rewrite (21) as follows: 
(21) f w), y(t, 


t 
x exp (f 96, do} ae 


In order to facilitate the differentiation of this expression with respect 
to u,, we will change the variable of integration in the following manner: If 
u, is sufficiently near to zero, every point of (26) corresponding to a positive 
t is connected by an integral curve of (4) with a point « = 2(T, 0), y = y(T, 9) 
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of the curve (26) corresponding to wu; 0, so that ¢ can be regarded as a 
function, say x(T,u,) of T and u,. Let 


¢(7, U1) = 2[x(T, U1), ul, = Ui), Us]. 


Noting that, for a constant T, —94(T,u,), y=y(T,u,) are parametric 
equations of an integral curve of (4), we can introduce on it, for a while, 
a parameter wu. satisfying (4). Then, applying Formula (5), we find, for 
u, == 0, 


= — (x, y)/[R(z, 9) expt f (+, 0), 0) Jar}, 


dy /du, = X (x, 0), (x, 0) 1dr}, 
0 
where z = (7,0), y=y(T, 0), and therefore, owing to (24), 


(28) (T, | (T, 0)| << 0)]? + [y(L, 0) 


With the new variable of integration, (27) becomes 
fo) 


(29) z[&(u1), n(ur) | =1— f ui), | 
x exp{I[x(T, Ur), | AL; aT, 


where J (¢, — | 7 (6, ui), us) and consequently 
0 


Of course, 
(31) = 1, 


but in order to differentiate (29) and (30), we need an explicit expression 
for x,. It can be obtained from (7), where we have to substitute —Y and X 
for X and Y respectively. Denoting by v a parameter satisfying (6), we find 


_ {R[¢(T, 0), WT, 0)]}? 


where in X, and Y,, the values of x and y as functions of v should be sub- 


(32) x, (Ty exp( { 


stituted, and where v, and vz denote the values of v at the intersections 
with the curves (7,0), y=y(T,0) and c= ¢4(T,u), y= ¥(T, 
respectively. 

If, in the neighborhood under consideration, | X,|, | | Yy|<o, 
the partial derivative, with respect to wu, of the first factor in (32) is, owing 
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to (28), less than 80A~*{[¢(7,0)]*? + [v(7, 0) ]*}-? in absolute value for 
u,=0. .\s to the integral in the second factor, by relating v to a parameter 


Uz satisfying (4), and by applying (5), we obtain 


lim (v2 — v,) ‘UU, = {R[ d(T, 0), ¥(T, 


u,>0 
x exp( f 3[4(@, 0). 0) ]d0}, | 
and therefore, for u, = 0, | 
X,[9(T, 0), 0) |] — V2[¢(7, 9), 0) ] 
d/du; X,— Y,)dv = — 
J, (R[ 0), 0)]}? 
x exp{ 8[4(0. 0), 0) ]d0}, 


which is less than 2wdA-*{[ (7, 0)]? + [y(7T, 0)]?}> in absolute value. Both 
factors in (32) becoming equal to 1 when wu, = 0, we conclude that 


(33) Xpy,( 2s 9) | < 100r*{[ 0)]? + [y(T, 0)]?}". 


It follows from our assumption that there exists a number p such that, 
in the neighborhood under consideration, | G,(x, y)', | G(x, y)| < p(a? + 
Then, according to (28), 


(34) | uw), |/dur | < 2p/d. 
Similarly, if | 8.(z,y)|, | y)| <o, we have, for = 0, 
(35) G(T, wu), ur) | < 2o{[o(T, 0) + [y(T, 0) ]?}4/. 


If we differentiate (30), with respect to u, and take into account (23), 
(31), (33), and (35), we find, for u,; = 0, 


| [x(T, ts), us] | 
T 
< {[6(@, 0) + [¥(0, 0) 
+ 100 | 0) + 0) 


On the other hand, from the properties of the previously introduced constant 
a it follows that, for 0 [OST, 

[(@, 0) + [¥(@, 0) ]? = (7, 0) ]* + [y(7, 0) 
Hence there exists a constant v satisfying, for u, — 0, 


| ur), | {[ p(T, 0) + [y(T, 0) }?}*. 


(36) 
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Now, if we differentiate (29) with respect to u:, noting that the relevant 
integrals are uniformly convergent and taking into account (22), (23), (31), 
(33), (34), and (36), we find that | dz[€(w1), (uw) ]/du: | is less than 


10wB) /A? + Br = (2pr + 1008) /(| | A2) + 


hence bounded in the neighborhood of the origin, and since, when multiplied 
by R(X, Yo), it gives the partial derivative of z(a,y) in the direction of the 
vector (— Y, X), the latter derivative tends to zero as (a, y) — (0,0). 

Thus both partial derivatives of z(x,y), which are continuous in D out- 
side the origin, tend to 0 as a7,y-—>0. In view of the fact that z(2,y) is 
continuous at the origin when approached along integral curves of (1), now 
it is easy to see that this function is continuous with continuous partial 
derivatives throughout D. Hence the proof of Theorem 8 is complete. 
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ON THE HOMOTOPY TYPE OF AN 4,°-POLYHEDRON (n 2 3).* 


By KENICHI SHIRAIWA. 


1. Introduction. It is an important topological problem to determine 
the homotopy type of a space by means of its algebraically computable invar- 
iants. Only certain special cases have been solved. 

In 1949, J. H. C. Whitehead [3] proved that two simply-connected 
4-demensional polyhedra are of the same homotopy type if and only if their 
cohomology systems are properly isomorphic. He then introduced a special 
type of a polyhedron, a generalization of a simply connected 4-dimensional 
polyhedron named an A,”-complex which is (n —1)-connected and at most 
(n + 2)-dimensional. He proved an analogous result in [4]. A related 
result is the proof of S. C. Chang [6] that any A,?-complex (n = 3) has the 
same homotopy type as a union of the elementary polyhedra (See section 4 
below) with a single point in common. 

This raises the problem of the homotopy type of an 4A, °-polyhedron, 
which is (n—1)-connected and at most (n-+ 3)-dimensional. May an 
analogous result be obtained? H. Uehara has established [10] the relevant 
fact that, roughly speaking, the homotopy type of an A,°-complex (n > 3) 
with vanishing (n + 1)-homotopy group is completely determined by means 
of its cohomology groups and Sq? of Steenrod [8]. 

The main purpose of this paper is to determine, in general, the homotopy 
type of an A,°-complex (n = 3) by means of certain cohomology operations. 
These operation are Steenrod square, Sq’, Adem’s operation [9], and its 
modification introduced by N. Shimada-H. Uehara [11]. Our method and 
the theorem obtained here are quite analogous to those of J. H. C. Whitehead 
[3], [4]. The author is deeply grateful to Messrs. N. Simada and H. Uehara 
for their kind criticisms and encouragement. 


Throughout this paper, we assume n = 3. 
2. A,°-cohomology systems. First we shall consider the case n > 3. 
Let H" (r=0,1,2,- - -) be finitely generated abelian groups which satisfy 


the following conditions. 


* Received May 8, 1953. 
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i) H*=0O for all values of r~0, n,n+1,n+2,n-+3. 
ii) H° is a free cyclic group. 
iii) #H" is a free abelian group. 


We now introduce another group, using the notation that if A is any 
additive group, then ,,A is the subgroup of all elements ae A such that 
ma=0Q, and A, is the residue group A—Am (m>0). We define 
A*m(mH"*!) (m= 2,4), to be the image of in an isomorphism, A*,p, 
and define H"(m) as H'(m) = H'» + A*m(mH"*). We do not here regard 
A*,, as uniquely determined by the system, but we do postulate a uniquely 
determined homomorhpism A,,:H"(m) — such that A,,(0) = and 
AmA*m = 1. We also postulate pmo: H"(m) as the natural map onto 
the summand H’,, C H"(m). We next define H"(4) H'(2) as follows: 
vo,, | H% is the natural map onto the summand H*, C H"(2) and 
po, | A*s(4H"*?) is defined by the formula 


== for re A*,(,H*"*). 
We also define ps2: H"**(2) = H"**, H"*, = H"**(4) by 
mod 2H"*’) = 2x mod for ze 
Finally, we postulate y,: H"(2) H™*(2), (r=n,n+1), and 
Ni = (0) (2) (2) = Ma, 
Ny —= yn (0) N (4) (2) = Ms 


where WN is the subgroup of H"(2) consisting of all elements v such that 
A,v=0. They may be arbitrary homomorphisms. By an 4A,°-cohomology 
system H (n > 3) we shall mean a set of groups H" (r=0,1,-- -) H"(2) 
(r=n,n+1,n+2,n+3) H"(4) (r=—n,n-+ 3) together with homomor- 
phisms Am, pmo (mM = 2,4), yr (r= n,n +1), 2) of the kind just 
described. 

Let H, Ht be two A,°-cohomology systems (n > 3). By a (A, »)-homo- 
morphism f: H — Ht, we mean a map such that f|H*, f| H*(m) are 
homomorphisms into the corresponding groups Hit, Hit(m) and fAm = Anf, 
fm.o = Pm,of Where Am, #m,o Mean the same in Ht as in H. If f is a (A, p)- 
homomorphism and fy, = y,f, then f(N,) C N,t, f(N2) C N.t and f induces 
the homomorphisms M, into M,+ and M, into M.*. Therefore, when f com- 
mutes with y,, f®;, ®f have the same meaning. By a proper homomorphism 
f:H— Hi, we mean a (A,p)-homomorphism f: H — Ht which commutes 
with y, and also 
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We have defined an A,°-cohomology system when n> 3. We shall 
modify these definitions in case n= 3. This happens because in an A;°- 
complex its cohomology ring has a non-trivial multiplication. 

Let R be a ring which satisfies the following conditions (Cf. J. H. C. 
Whitehead [3]): 

i) R=>Ar(m), (r= 0,1,-- -) (m=0, 2,3,° -°). 

ii) All H*(m) are finitely generated abelian groups which form a 

spectrum of cohomology groups in the sense of J. H. C. Whitehead 
[2] with homomorphisms A,: H"(q) H"*, B"(q) H'(p). 

iii) Ht(m) =0 if r¥0, 3, 4, 5, 6. 

iv) H° is a free cyclic group whose generator is a unit element of R. 

v) In R# the multiplication satisfies the following conditions: 

a) Hm), ye H%m), then cy and ry = (— 


b) (y) = [m(m, r, s)/(m, 7) (mM, 8) where 
ze H?(r), ye H%(s), c= (r,8),m > 0. 


If, in the definition of an A,°-cohomology system (n > 3), we replace the 
groups H", H™(m) by R and homomorphisms Am, (m= 2,4) by Ap, 
Mp.q, then we get a definition of an A;°-cohomology system. By a (A, »)- 
homomorphism between two A,°-cohomology systems, we shall mean a ring 
homomorphism of one cohomology system into the other which commutes 
with A, and pp, A proper homomorphism is the same as n > 3. The groups 
in an A,°-cohomology system, H, will often be defined in terms of cochain 
groups (r=0,1,---,Ct=0 if r>n-+3). These are free abelian 
groups of finite rank, which are related by a coboundary operator 8: C’ > C™?, 
such that 850. By the usual procedure, the r-dimensional cohomology 
groups H*(m) with integers reduced mod m can be defined. For integers 
p > 0, = 0 two operators Ag, are defined such that A,: H"(q) ~ H™, 
My,q: H"(q) H*(p). Let xe At(q) and let 2’e2. That is to say, 2’ is a 
cocvele mod gq. Then 82’ = qy’, where y’ is an (r+ 1)-absolute cocycle. We 
define Ajz=y, a cohomology class containing y’. Let c= (p,q), then 
(p/c)2’ is a cocycle mod p, and we define p»,,% as its cohomology class mod p. 
It is easily verified that A,v and ppv depend only on ze H*(q) and not on 
the particular choice of ex. (See J. H. C. Whitehead [3]). 

Let H, Hi be A,°-cohomology systems, which are defined in terms of 
cochain groups C = {C’}, Ci = {C*"}. A cochain map g: C > Ct, obviously 
induces a (A, »)-homomorphism f: H > Hi. We shall say that f is realized 
by g. Then 
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Lemma 1. (Whitehead [3], [4]) Any (A,p)-homomorphism can be 
realized by a cochain map. 


3. The main theorem. Let K be an A,°-complex, so that the co- 
homology groups of K or the cohomology ring of K (in the sense of J. H. C. 
Whitehead) and homomorphisms defined below form an A,°-cohomology 
system. Among the homomorphisms, A, and pp, are the same as defined in 
the last part of section 2, and y, is the Steenrod square, Sq?: H"(2) — H"**(2) 
(r=n,n+1). Next we define ; as follows. 

Let xe N, = and a’ ex. Then S8q?(p2,.2’) = dv (mod 2) 
and let = $(2’ a’ — 2’) and ézt = (mod 2) and Sq°Sq'a = 

n n-1 


(mod 2), where = at LU zt + at dat. Sq?Sq?x’ = 83?v (mod 2), where 
n-3 n-2 
=v dv. Adem defined in [9] a cochain map 
n-1 n 
E;: C1(K*) (K) 


and proved the formula = + Sq*8q'a’ (mod 2). From 
these facts we have the operation 


®, (x) = Egn_3(x’*) + 37v + mod Sq?H"*?(2). 
And similarly if ze (Sq?)-1(0) 
= 2H (at) + + mod (2). 
If Sq?A.z ~ 0 (mod 2), then it is easily verified that 
C 2H"™3(4) (2). 


They are obviously homomorphisms and depend only on xe WN, (or N2). 
(For these homomorphisms, cf. [9], [11].) We now state the main theorem. 


THEOREM 1. Any A,*-cohomology system is properly isomorphic to the 
cohomology system of some A,°-complecz. 


THEOREM 2. Two A,°-complexes are of the same homotopy type tf and 
only if their cohomology systems are properly isomorphic. 


THEOREM 3. Let K, L be A,°-complexes, then any proper homomor- 
phism f*:H(L) > H(K) has a geometrical realization f: K > L. 


It is easily seen that Theorem 3 implies Theorem 2. (Cf. J. H. C. 
Whitehead [3].) Hence we shall prove Theorems 1 and 3. 
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4. am2(K"*?). It is known (S. C. Chang [6]) that any A,?-complex 
(n = 3) has the same homotopy type as a union of elementary polyhedra 
with a single point in common. Here we shall write down the elementary 


polyhedra : 
Br 


Bz (a) = 8" U e"**, where e"** is attached to S" by a map f: den? Sn 
of degree o, a power of a prime. 


B,(r) = S"* U e**?, where is attached to by a map f: de"*? 
of degree 7, a power of a prime. 

B,=S8"U e**, where e”*? is attached to S" by an essential map of 
de"*? onto S”. 


B;(2”) = 8" U U ent? B, U e"*1, where e"*? is attached to 8S" C By 
by a map f: de"* > 8" of degree 2?. 

Bo(22) = (S" S***) U where is attached to S"** by a 
map f:de"*?-» of the form where a denotes an 
essential map of de”*? onto S" and b maps de"*? onto S"*? with degree 24. 


B; 22) (S" V §**) U em? U = (22) ent, where e”*1 is 
attached to S" C B,(22) by a map f: de"** > 8” of degree 2?. 


To obtain a reduced A,°-complex, we must compute mn.2(K"**) of an A,?- 
complex K"*? (Cf. [10], [11], [13], [14].). , 
LemMa 2. Let Z denote the group of integers. Then 
i) = Ze, (8"**) Zo, (S"**) = Z. 
ili) anio(B.(c)) =0 if o is odd. 
ill) its generator is represented by. a map 
covering e"* essentially and 2é. is represented by an essential 
map f:S"*?—> 8" C B,(2). 
iV) (B2(2?)) (if p>1) its generators and ye; 
é, is represented by a map covering e"*' essentially, and np» ts 
represented by a map covering 8” essentially. 
v) ) = 0 if r is odd. 
Vi) mnse(B3(22)) =Z.; its generator & is represented by a map 
covering essentially. 
Vii) anw2(Bys) =Z its generator w, is represented by a map covering 
en*? with degree 2. 
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Vili) =Z + Zz, its generators ts, where 2g; = 0. ts 
represented by a map covering e"** with degree 2, and &; 1s 
represented by a map covering e"** essentially. 

ix) = Z2; tts generator ts represented by a map 
covering essentially. 
X) (Bz(2?, 22)) =Z.+ Z, its generators &,2 where 1s 
the same as is used in ix) and &;,. is the same as is used in iv). 
Lemma 3. If K is m-connected, m= 1, and L is n-connected, n= 1, 
then 
mi(K L) fori Sm-+n 
Tm+n+1 (K V L) (K ) + (K) ® Tnii(L) 


where the isomorphism of the tensor product is defined by means of the 
Whitehead product. (This was pointed out by the referee.) 


One uses a theorem of G. W. Whitehead [15] and calculates 
X L,K L) by the Kiinneth Theorem and the Hurewicz Theorem 
to prove this. 

Let K"**? be a reduced A,?-complex described as follows (in all the 
summations there exist finite terms). 


+ + Boi(oi) + Bs i (2) 


qi>1 o;,odd 


+ Bsi(ri) + Bai + + Boi (2) 


T, odd 


From Lemmas 2 and 3, if n > 3, mn42(K"**) is a direct sum of the (n + 2)-nd 
homotopy groups of the elementary polyhedra of which K"** consists. If 
n = 3, m5(K*) is the direct sum of the 5-th homotopy groups of the elemen- 
tary polyhedra and of the group W generated by the Whitehead product. 
For later uses, we shall write the generators of an,.(K"**) as follows 

i) 3 Tn+2 2) 3 (S"**, 3 (1, iy 

tn2(B2i(2)) 3 Tns2 (Boi (2%) ) 3 

(Bz i (2%) ) 3 i, Tn+2 3 l4, is 

Tns2(Bs,i( 2") ) 3 & 


Tn+2 (B;,i (27%, 24) ) 3 €3,1,4 


al 
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5. Reduced A,*-complex. In order to define a reduced A,*-complex, 
we first introduce the complex K, which has its (n + 2)-skeleton K"*? as 
described at the end of Section 4, and we shall transform K to a simpler 
type of an A,°-complex, which we shall call a reduced A,*-complex. For this 
transformation, we use the following Lemmas by J. H. C. Whitehead [5], [2]. 


Lemma 4. Jf two spaces P, Q are of the same homotopy type, and 
f:P>Q is a homotopy equivalence, and if a map a:de"** P is gwen, 
then the attached spaces P U e"*? and Q U e"™**, are of the same homotopy 
type, where e"*! is attached to P by a map a: 0e"*t > P and e", is attached 
to Q by a map f- a: > Q. 


Lemma 5. If X, Y are simply connected CR-spaces (a connected 
compactum which is an A.N.R.), then a map f:X—Y is a homotopy 
equivalence if each of the induced maps H"(X) H"(Y) (n=2,3,- +) 
is an isomorphism onto. 

We shall fix K"**. Let (7 =1,2,-- +) be principal (n + 3)-cells 
of K. Then K is an attached space, i.e., e"**; are attached to K"*? by maps 
fj: de"*®; K"*?, Hence f; determines an element of an,2(K"*?). 

We shall define B: Cni3(K) = K"*?) — (K"**) by the above 
correspondence; then 8 is a homotopy boundary homomorphism. We shall 
now write K as {K"**, Be"**;}. Henceforth we assume n > 3. 


Let Ber?) = Darts + Dats 


where the a are integers, the 6, c are integers mod 2 and the d are integers 
mod 4. Then we shall write {K"*?, Be"*5;} by the matrix form 


ems, [a dj. 


Now we consider homotopy equivalences of K”*? onto K"*?. Using Lemma 4, 
we shall simplify the matrix (*). From Lemma 5 it is easily verified that 
all the following maps are homotopy equivalences of K”** onto K"**. 


I.1) §***,; is divided into two hemispheres and the one is mapped 
onto S”**,; with degree 1 and the other is mapped onto S"**, ; 
(i 7) with any degree, or onto (A = 1, 6, 7) essentially, 
or onto ;(2%) essentially, or onto e”*', ;(2%) essentially, or 


1.2) 


1. 3) 


I. 4) 
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The remaining part of K"* 


onto (A= 5,7) essentially. 
is mapped identically. 


S"**,; is divided into two hemispheres and the one is mapped 
onto S"**,; with degree 1 and the other is mapped onto 8"; 
essentially, or onto ;(2%) essentially. The part of K"? 
excluding 8"*?,; is mapped identically. 


An(n + 1)-cell or an (n + 1)-sphere belonging to K"*? is divided 
into two parts and the one is mapped onto itself with degree 1, 
and the other is mapped onto any n-sphere of K"** essentially. 
The cells of K"** excluding the (nm + 1)-cell or the n-sphere are 
mapped onto themselves identically. 


B.;(2) is divided into two parts and the one is mapped onto 
B.;(2) with degree 1 and the other is mapped onto B,;(2) 
(i4j) with degree 1. The cells of K"** excluding B.;(2) are 
mapped identically. (The reader should be referred to the 
following operations for detailed accounts.) 


Using Lemma 4 and Lemma 2, we shall translate the above homotopy 
equivalences to allowable operations in the matrix (*) which do not alter the 
homotopy type of K. That is to say, without change of the homotopy type 
of K and K"**, we shall simplify the matrix which represents .attaching maps 


of 


The following operations are allowable 


I. 
IL. 
Il. 
IL. 
II. 5) 
II. 


1) 
2) 


II. 


Any two rows or columns are exchangeable. 

Any column can be added to any other column. 

The elements of any column can have their signs changed. 
row can be added to row (17). 

t:,78 Tow is reduced mod 2 and is added to &’s row. 

t:,78 row is reduced mod 2 and is added to 7’s row. 


4,18 row is reduced mod 2 and is added to £”’s row by considering 
the non-zero element mod 2 is the generator of the group of 
integers mod 4. 


t,i8 row is reduced mod 2 and is added to €”’s row by considering 
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that the non-zero element mod 2 is the element of order 2 of the 
group of integers mod 4. 


II.9) £s row is added to 7’s row. 


IT.10) és row is added to &’s row considering the non-zero element 
mod 2 is the element of order 2 of mod 4. 


II.11) £’s row is reduced mod 2 and is added to 7’s row. 
IT.12) £2,’s row is added to é.,7s row (tj). 


If. 1)-II. 3) are trivial operations; IT. 4) 5) 7) are based on I. 1); II. 6) 8) 
are based on J. 2); II.9)-II.11) are based on I. 3); and II. 12) is based 
on I. 4). 


em’; |a@ b TI,1,2,4) Eo * * 
* 8 6 Fi * 000 0 
IT, 5,6, 7,8) |#2 * * * * * * 0 
0 F 
0 O 
* 00 0 0 BF, * 0000 
* 0 £0 * 0 £8 
* * * QO * 
IT, 9) 2 II, 1, 2) 
* * Q Q E, * 
G * -0 
QO * @ 0 | 
g ly lgls é 
* 0 


* 


3 


II, 9) 0 F| II,1,2) G * 0 
G0 0 G0 0 0 
& £ 0 * 0 0 


II, 9, 10) 


II, 1, 2) 


IL, 2) 


ty 


Oo #6006060 638 


Let 
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IT, 1, 2) 


II, 1, 2,3) 


EB, 


6. Cohomology operations in a reduced A,°-complex. 
reduced A,°-complex (n > 3). 


D*om 


oo x¥ 


(j= 1,- +, 80,8 +81 + 82+ 83 +, 8+ 8: + 82 +83 + &) 


0 
0 0 0 
00 0 
* 0 
* 0 0 
n € 
00 0] 
0 O 
00 0 
Go BE, 
00 FI 
00 0] 
(#) 


where the diagonal elements of H, are odd integers and the other elements 
of E, are even; all the elements of F, and F are even; the diagonal elements 
of F are 2 and the other elements of F are zero; the diagonal elements 
of G are 1 and the part of G lower and left to the diagonal consists of 0. 
We can now assume that any A,°-complex has the same homotopy type 
as the complex K with a matrix of the form (#). 
a reduced A,°-complex. By the cell of i-th kind, we shall mean an (n + 3)- 
cell en*3; ( <j S 2s). In case n = 3, the argument about 


We call such a complex 


of section 5 shows that, if we neglect the direct summand group generated 
by the Whitehead product, the above discussion is still valid. 


Let K be a 
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E 0 
| 
c= 
* 0 * 
0 0 | 
| 0 0 | I 
* 0 0 0 0 
| * 0 £0 
* 0 GO 
| 00 0 
00 0 0 
m tts € 7 
0000 0 So 
| * 0 £0 0 
— > |F, * G00 0 0 ts 
| 0 0 0 2E0 Ss 
000 0 0 
I 
a 
Cl 
H 
Si 
of 
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Darin: + + Da + 
(7 +1,° +, 81), where d**; =0 or 1. 


+ 
(J = 80 + 8 +, 80+ 81 + 82), 


where d”‘; is even. 


=S0 + 81+ S2+1,° +, 8 +81 + + 8s), 
where is even. 


Then 


ii) Let j run over all the indices of the cells of the first kind. 
iii) == > 
In all the cases above, we think of S8"*%,;,- - -, e"*2;(2) as cocycles mod 2, 
and e"**; as cocycies mod 2. 
From the matrix (#) of section 5, it is easily seen that the cocycles mod 2 


consisting of the cells of the first kind form a base of Sq?H"*1(K, Z.). 
Thus, the cells of the i-th kind (i= 2,3,4) form a base of 


Z2)/Sq?H" (K, Zo). 
Hence we can write 


iv) =X ©, (8%.4(2?)) = where j runs 
over all the indices of the cells of the second kind. 
®, (2?¢) ) > 


Similarly, in 2H"**(K,Z,), the cells of the second kind form a base of 
M4,29q°H"**(K,Z.). Then, the cells of the i-th kind (1 = 2, 3,4) form a base 
of Zs) (K, Zo). Hence 


| 

8 
$ 
8 
e 
x 
) 
d 
a 
) 
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v) ,(8";(2)) = > 1/2d**;e"*5;, where j runs over all the indices of 
the cells of the second and the third kind. 


To prove ii)-v), the usual method is applicable. That is to say, using 
an appropriate mapping of K into M U where e"*® is attached 
to S"*! by an essential map of de"** onto S”*?, or using a map of K into 
L = 8" U where e”** is attached to 8” by an essential map of de"** onto 
S", ii)-v) can be proved. 

The above relations are still valid in case n = 3. (For detailed accounts, 
the reader is referred to [9], [10], [11].) 

In case n = 3, we read the relation between cup products and Whitehead’s 
products. Let us introduce a lexicographic order in the 8" of K”*? by the 
subscripts of the 8" and let Z(S8";i, S"~,i°) be a cyclic group of order 
d= (g,h), where g (or h) is the order of S",; (or when is 
considered as an element of H,(K,Z). Then the direct summand W generated 
by Whitehead’s products of zp,2.(4"*?) is a direct sum 

Z i, S" 

(k,4)>(k’, i’) 
Let ¢ be a projection of an,2.(K"**?) onto W, and let 
go Ben; — > mkt TS" i, Sy 
(k, 4) >(k’, 1”) 
where [S”,, S",] is a generator of Z7(S",, 8"), and m**; is an element 
of Z(S",, 8",). 
Then 


vi) (S" x47) => (k, 1) > 


For the proof of these facts one uses the complex S* < S*. 


7. Proof of Theoreml1. (n>3). Let H={H", H™, H", 
H"(2), H™2(2), H™*(2), H"(4), H™*(4), A, py 
be an A,°-cohomology system. We shall construct an A,*-complex K, which 
has an A,°-cohomology system that is properly isomorphic to a given H. We 
assume (without loss of generality) that y,:H"(2) — H"*?(2) has a normal 
form, i.e. if yn, is expressed in terms of a matrix form, each row or column 
contains at most one unity. (Cf. S. C. Chang and J. H. C. Whitehead [7].) 


Let Up, be generators of the summand of whose orders are 0, 
and let -,v% , be a generator of the summands of H* whose orders are 
powers of 2, and let W,- --,w*, be a generator of the summands of H* 


whose orders are powers of odd primes. Let 


fc 

® 

m 

N 
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un, ynte, 
000 000 00 04 
000 00 0a 
10 00 0 00 £0 Ola, 
000 £00 00 
om 10 00 0 0 0 db 
000 000 0 £03 
[0 00 0 £0 0 0 %& 

Co Cs 


be a matrix representing y,: H"(2) > H"**(2). Let 


+ ( 2") + S (24, 29), 


where 2% is the order of v"*!;, o; is the order of w”*;, 2% is the order of 
1S the order of w"*?;, is the order of v"*',,,;, 2"* is the order 
27+ is the order of aNd is the order of Let 


72,4 


where (7 tnis + + corresponds with the generator of 
Now we shall modify Ni = > (2) (2) as 
follows. As yniiHl"*'(2) is a direct summand of H"**(2). There is a map 
61,: N, H"**(2) which induces ®,. Next is a direct sum- 
mand of 2H"*(4) and M NN is a_ direct 
summand of Nt,=y"n(0) M so there is a map 
Ni, — 2H"**(4) which induces ®,. Let 


Then we shall attach e"**; (7 =1,- +, + + Pris) to as follows. 


i) Ber?; = 


} 
e 
il 
) 
), 
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where a’; is the order of w"*%;. 


ii) Bers; = + bY + 5 
+ + De + 
(j = +3) where a’; is the order of and 
+ > + > 
(j = tus + C63 +, tus +43) where a*; is a half of the order of 


(j = tus + + a3 +1,° >, + Quis), Where a*; is a half of the order 
of and = ‘dt, 


We have finished the construction of an A,°-complex, which has a 
properly isomorphic A,*-cohomology system, to a given H. The verification 
is very simple, from the construction. 


8. Proof of Theorem 3 (n>3). Let K, LZ be two A,°-complexes. 
If f: H(L) ~H(K) is a proper homomorphism of the A,°-cohomology 
system of L into that of K, we shail construct a map g: K — L which induces 
f. Without loss of generality, we can assume that K and L are reduced 
A,?-complexes. 


0! 
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By Lemma 1, f is realized by a cochain map f*#: C*(L) >C*(K). 
Let fy: C(K) > C(L) be a dual chain map of f*. We shall show that 
fy can be realized by a map g: KL. By the fact that f: H(L) > H(K) 
is a proper map, it is clear that f,|Z"*? induces a proper homomorphism of 
an A,?-cohomology system of L"*? into that of K"*?. So, by the theorem due 
to J. H. C. Whitehead, fy|K"*? can be realized by a map g: K"*?—L™, 
To proceed to the next step, we state a lemma due to J. H. C. Whitehead. 


Lemma 6. Let K, L be simply connected complexes and let e" be a 
principal cell, where, n > 2. Suppose that f: C,(K)—>C,(L) be a chain 
map such that the map f|C,(Ko) (r=0,1,---) can be realized by a 
cellular map go: Ky > L, where Kyo = K —e”. If goBe" = Bfe", then go can 
be extended to a map g: KL, which realizes the chain map f. 

Let a be a projection of mn,2(K"*?) onto its torsion group. Then, by 
the results of section 6, 

98 — = 98a — Buf = gl 
where [ = Br. 
By (#) of section 5 and the results of section 6 it is clear that 


(8.1) if +, 8,8 +8: + 
So + + So + 83 + Sq. 
(8.2) = & 4 e"*9; + 2) ) 
+ e"*8; + ent; 
(J = 80 +1,° +, 80+31). 
(J +5 +1,° +, 8 +58, + 82). 
(8.4) Tert8; == 2&2 ) 
+5 +52 +1,°- +, 5 +5: +5. +583). 
Considering that f* commutes with §, Sq? and ; and that the cochains 
of the 0-th, the first, the second and the third kind are the images of 8, Sq’, 
#, and ®, respectively, we can see that f* maps the cochains of the i-th kind 
of Z into cochains of the i-th kind of K (i= 0, 1, 2,3). 
Taking the dual map fy of f*, we see that fy maps the chain of the 
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i-th kind into the sum of chains of the i-th kind and of the fourth kind. 
So gf! —Ifz is zero on any cell of the o-th kind and fourth kind by (8.1). 
If e*5; is a cell of the first kind, it follows from (8.2) that 


where é7,; and S”"*',; belong to L. Since fze"**; is a sum of a cycle of the 
first kind and a cycle of the fourth kind, we get the following equality by 
using (8.1) and (8.2). 
This leads to g? —Ify = 0 on the cells of the first kind. If e"*’; is a cell 
of the second kind, then 
> i(fF®, (78", i) yes, + 
Since fze"**; is a sum of a cycle of the second kind and a cycle of the fourth 
kind, it follows by (8.1) and (8.3) 


Therefore gf! —Ifz —0 on the cells of the second kind. This is also true 
on the cells of the third kind by analogous arguments. This completes the 
proof. 


9. The case m= 3. The arguments about 2;(K*) and the results of 
section 6 enable us to prove Theorem 1 and Theorem 3 in case n = 3, as a 
slight modification of the proof above (n > 3). 
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ON RIESZ SUMMABILITY AND SUMMABILITY BY DIRICHLET’S 
SERIES: A FURTHER ADDENDUM AND CORRIGENDUM.* 


By C. T. RasaGopa.. 


1. Terminology. It is perhaps necessary to explicitly call attention to 
the terminology of my earlier paper with the same title [4], which is not 
always in accord with practice elsewhere. Let A(u) be a function of bounded 
variation in every finite interval of u = 0, assumed (for convenience) to be 
such that A(0) —0. We understand by the Riesz (integral) mean of A(z), 
of order r= 0, the function o,(x) = A,(x)/z", where 


A,(z) = rf (w)du, r>0, oo(@) = Ay = 


If A(x) is a A-step function defined with respect to a sequence 
O=Ay <A. << As Ano, ie, if A(z) is defined by . 


A(z) =a,+a.+° ++ dy for An < Ann, N= 1, 2,3,°° -, 


A(z) =0 for then o,(x) is the Riesz typical mean of > ap, 
of order r and type An (or A). In the special case A, =n, the Riesz typical 
mean is written R,(x) and called the Riesz arithmetic mean ; it can, for 
some purposes (e.g. Theorem ?’ of the next section), take the place of the 


Cesaro mean of order r= 0 of 5 a, defined by c,(n) = [ > Opnsstteai/Conm 
y=0 


The integral F(t) —{ e“td{A(u)}, t >0, whenever it exists, is called 
0 


the Laplace transform of A(u). Its special case, when A(wu) is the A-step 
function already defined, is the Dirichlet series > a,e-* called the D(An)- 
transform of >\a,; the further special case in which A, — 7 is the power 
series >) a,e-"* called the Abel transform (or mean) of > dn. 

A(u) is said to be summable to sum s by the r-th Riesz mean (r > 0) 
or by the Laplace transform (whether in the general case or in one of the 
special cases), according as o,(u) as or F(t) as t>-+ 0. 
Similarly > a, is said to be summable by the r-th Cesaro mean (r > 0) when 
Cy(n) >S as 


* Received August 15, 1952, substantially revised March 31, 1953. 
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The opening statement of my previous paper ([4], p. 371) should be 
understood to mean: When A(z) is a sequence or series > dn, its summability 
by the k-th Riesz arithmetic mean (k > 0) necessarily implies its summability 
by the Laplace transform in the form of the Abel mean. 


2. A theorem on summability by Riesz means. The following theorem 
appears in the concluding section of my earlier paper ([4], p. 377). 


THEOREM t. Write 


lim sup =@;,, lim inf o,(7) 


0 


lim Or lim Or Ox. 
Too 


Then Go (finite) ensures for ail sufficiently large r, 
sayr=ro+1. 


The previously known analogue of Theorem ¢ for Cesaro means ([3], 
p. 199), due to Littlewood, is as follows. 


TuHeEorEM ?’. In the case of the series > dn, let 


lim sup ¢,(n) lim infc,(n) —¢,, 


n> 


lim ¢, == lim ¢, = Cg. 


Then (finite) ensures for all sufficiently large r, 
say r=ro+1. 


Pennington ([3], p. 200) has recently pointed out that the proof of 
Theorem ¢ outlined by me is incorrect,’ and given an interesting alternative 
proof along entirely different lines ([3], pp. 201-206). However, it may be 
pointed out that my proof, with an obvious modification, is valid for Theorem ?’. 
For, the hypothesis of Theorem ?’ implies (i) that c,(n) is bounded as n —>0o 
for all large r, say r= 70, and (ii) that }a,e-"* converges for ¢ >0 and 
tends to s as t—»-+-0. The implication (i) has the consequence that R,(z2), 
as also B,(x)/x"*? = (r+ 1){R,(x) — RB,,,(x)}, is bounded as roo for 
r=>r, ({1], p. 113, proof of Theorem 58) and, taken along with (ii), 


*The mistake is in the assumption that the boundedness of the Riesz (integal) 
mean o,(@) as x %, for some r > 0, ensures the existence of the Laplace transform 
F(t) for t>0. Not even the convergence of o,(#) as #— © suffices to ensure the 
existence of F(t) for t >0. The opening sentence of [4] which has this wrong impli- 
cation should be corrected as in the last sentence of the first section of the present paper. 


0 rT > 
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establishes the summability of } a, by R,(x) for r= rp.) + 1, when we appeal 
to Theorem 1 of my previous paper [4]. But summability of } a, by R,(z) 
implies summability by c,(n) ({1], Theorem 58) and so the proof of 
Theorem ¢’ is complete. 


8. Some lemmas and their proofs. The proof of Theorem ? given 
above can be adapted to prove Theorem ¢, by means of the following lemmas 
of which Lemmas III and IV are for alternative and not simultaneous use. 


Lemma J. If there is anr=1r,=0 such that | o,,(x)| SK, then 


(i) = to + yf “etd, (u) du converges absolutely for 
t> 0, 
(ii) for any k >1, 
K + lim inf &,,(t) = [K + gree], 


t>+0 


K —lim sup 4,,(t) = [K — 


t>+0 
(rok) 
(To) 
where Vk T(ro +k +1) (k —1)*1 
Lemma II. If ¢, =o. (finite), then we can find ry > 0 so that the 
conclusions (i) and (ii) of Lemma I hold and furthermore ©,,(+ 0) =s. 


Lemma III. Jf there is an r»>=0 such that (in the notation of 


Lemma 1) 
(1) ©,,(t) converges absolutely for t > 0 and tends to s as t—>-+ 0, 
(2) Bro(X) = (To + 1) — (4) } = 


then lim = 8. 


LemMa IV. Suppose that A(u) = 0, A(u)e* is integrable over (0,0 ) 
for t > 0 and (in the notation of Lemma 1) 


(3) t6,(t) = (u)du~ Ht, 
0 


where a=0, H=0; the supposition (3) being interpreted to mean 
t*)(t) = o(t*) in the case H=0. Then 


(w)du~ (a +1), 


Proof of Lemma lI. The conclusion (i) of the lemma is obvious. 


Ne 
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To prove the conclusion (ii), we shall first suppose that o,,(z) = 0, or 
A,,(t) 20. Then, for a given y > 0, 


(4) 4. 1) A,,(u)du = T(r = h(u)(a A,,(u)du 
where 
(5) h(w) == / (¢ — 


Now h(u) defined by (5) has absolute miniinum, corresponding to 
u—=a— (k—1)/y=U (say), whose value is 


(6) h(U) = (ay) (fe — 1)* 
Consequently we have, from (4), 


h(U) 
+ 1)aror* 


yret 1 


J. 


h(U) in (6), considered as a function of zy, has absolute maximum when 
sy =r, +k, this maximum being 


(8) (7 Lhe) / — 1) (rg +e + 1)/T (Ke). 


In (7) we can choose « = (7) + &)/y and replace h(U) by its largest value 
given by (8), getting the best form of (7): 


(9) T(r +1) “WA, (u)du 
» F(rotk+1 


Now, by a well-known formula, 
(10) = +k + + (udu, 
Substituting for the integral on the right side of (9) its value in (10), 
we obtain from (9): 

Mane (2), (ro +k)/y. 
Letting y—- 0 in (11), we have 
(12) lim inf ®,,(y) = lim inf ( + &)/y) 


y>+0 y>+0 


| 
iS 

= "k T(k)P (1 + 1) A,,(u) du. 
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If o,,(2) does not satisfy the condition o,,(7) 20 but is such that 
| o,,(r) | SK, or o,,(4) = — K and —o,,(x) = — K, we can replace A,,(w) 
in the preceding argument successively by Ku’e + A,,(u) and Ku’ — A,,(u) 
and deduce from (12) the two inequalities of the conclusion (ii) of Lemma II. 


Proof of Lemma II. The hypothesis of Lemma II implies that o,(2) 
is bounded as x oo for all large r. Hence we can find ry > 0 such that 
| o,,(«)| SK and consequently the conclusions (i) and (ii) of Lemma I 
hold. The further conclusion ,,(¢) > s as t—>-+ 0 follows the conclusion 
(ii) of Lemma I when we let k +0, since 4, +1 as ko. 


Proof of Lemma III. This lemma is a slightly modified form of an 
extension of a theorem of Szadsz already obtained by me ([4], Theorem 1).? 
It requires no proof if it is recalled that, in my extension just referred to, 


I have in effect assumed that 
f e“td{A(u)} converges for ¢ > 0 and tends to s as t> + 0, 
0 


derived from this assumption the hypothesis (1) of Lemma III, this deriva- 
tion involving an appeal to known theorems ([7]|, Chapter II, Theorem 2. 3a 
and §§ 8.1, 8.2), and finally used (1) and (2) to establish the conclusion 
of Lemma III ([4], Lemma 4 and § 1. 2). 


Proof of Lemma IV. This is omitted, the lemma being a well-known 
theorem of Hardy and Littlewood ([2], Theorem 1). 


4, Proof of Theorem #. The hypothesis of Theorem ¢, which is also 
that of Lemma II, implies (as a result of the lemma) the existence of an 
> 90 such that —O(1) as and hence (1) holds. Now 
o,,(%) = O(1) also implies (2) in the form B,,(x) = O(2’*1). Hence the 
conclusion of Theorem ¢, which is included in that of Lemma ITI, follows on 
an application of Lemma III. 

The referee has suggested that Lemma IV may be used instead of 
Lemma III in the preceding proof, and the proof concluded thus. The 
condition o,,(z) = O(1) shows that we can find a constant K such that 
A,,(x) + Kat = 0; while the condition ,,(¢) s as t—-+ 0 implies 


f “ett A, (w) + Ku}du ~ (s+ (1% +1), t>+0. 


2In fact Lemma III reduces to Szasz’s theorem ([5], Theorem 1) when r, = 0. 


of 


co 


to 


be 
F( 
pay 


0 
te 
| 


ON RIESZ SUMMABILITY. 257 


Hence we can take, in Lemma IV, 4,,(u) + Ku’ instead of A(u), m +1 
instead of a, and conclude that 


+ Ku}du~ (s+ (tm + +2), coo, 
or 


Arg (@) = + yf A,,(u)du ~ sx", 
0 
whence the conclusion of Theorem ¢ follows at once. 


Remarks on the proof of Theorem t. (i) It is clear from the proof of 
Lemma III ([4], § 1.2) that the lemma is effectively a transformation of 
the well-known case «1 of Lemma IV. Thus it would seem that there is 
no essential gain in simplicity resulting from the use of Lemma IV in place 
of Lemma ITI. 


(ii) Pennington remarks, apropos of the version of Theorem ¢ for 
Riesz typical means, that its proof by Dirichlet’s series requires the use of 
quite deep-lying results in the uniform summability of such series. The proof 
of Theorem ¢ given here, along the same lines, makes use of the standard 
techniques employed in my previous paper [4] including Doetsch’s ‘ positive ’ 
Tauberian theorem either in its simplest form (case «=—1 of Lemma IV) 
or in a generalized form (Lemma IV). 


5. Minor corrigenda and addenda. In [4], p. 374, relation (10), w** 
of the right-hand member is to be corrected to u**?. 


In [4], p. 374, condition (ii) of Corollary 1, B,(a2) = — ka,**! is to be 
corrected to By, = — 


In [4], line 6 of p. 851, ‘ vol. 59’ is to be corrected to ‘ vol. 69.’ 


In [4], p. 851, condition (i*) of Corollary 3, {00 is to be corrected 
tot>-+ 0. 


The inversion of integrations, in the proofs of Lemmas 4, 5 of [4], may 
be justified by Fubini’s theorem, if we suppose that the Laplace transform 
F(t) of A(u) is defined as a Stieltjes integral, as in the first section of this 
paper, so that 


F(t) = = H/T (k + = ,(t), 
t>0,k=0, 


17 
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where the second integral converges absolutely ([7], Chapter II, Theorem 
2. 3a and §§ 8,1, 8. 2). 

In the proof of Lemma 5 of [4], p. 376, the passage from the step in 
line 4 to the next step may be elucidatel thus. Since the inner integral in 
the former step is non-negative, we have 


0 0 
(r + k) /2, 
and this leads to the latter step, by Fubini’s theorem. 
Lemma 4 of [4] has been recently proved by Szisz ([6], §3) by a 


somewhat different method. 
In conclusion, I would like to thank the referee for helping me to revise 


thoroughly this paper. 


RAMANUJAN INSTITUTE OF MATHEMATICS (KARAIKUDI), MADRAS, 
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AN INTERPOLATION PROBLEM FOR HARMONIC FUNCTIONS.* 


By J. L. WaAusH. 


The study of functions f(z) analytic interior to the unit circle and 
defined there by requirements of interpolation is due especially to Kakeya, 
Takenaka, Malmquist, and Walsh [1] if the functions are subjected to the 
requirement 


(1) 


dz| <0, C:|2|=—1, 


and is due especially to Lokki [2] if (1) is replaced by the Dirichlet integral 
of f(z) over |z| <1. It is the object of the present note to indicate that 
the extremal methods developed by these authors, and the theory of repro- 
ducing kernels developed by Bergman [3] and others [4], apply also to the 
study of real functions w(z) harmonic interior to C and defined there by 
requirements of interpolation, where u(z) satisfies a condition analogous to 
(1). We indicate the differences of method required for harmonic rather 
than analytic functions, and present the specific formulas involved. 
We denote by H the class of functions (z = re‘*), 


(2) u(z)=4a,+ > r*(a, cos + by sin k6), + b;7) <0, 
k=1 k=l 


each of which is clearly harmonic in |z|<1. For a given function u(z), 
the coefficients a, and b, are uniquely determined, because on each diameter 
of C:|z|=—1 the function is represented by a power series in the real 
variable r; such a representation is unique. The series in (2) converges in 
the mean on C to some function u(e), and since the partial sums are 
represented in | z| <1 by their Poisson’s integral taken over C, it follows 
that w(z) is also represented in | z| <1 by the Poisson integral of u(e‘) 
taken over C. The norm || wu || of u(z) is defined as the (non-negative) square 
root of 


(3) day? + (an? + 
Cc k=1 
A function of zero norm vanishes identically. The first series in,(2) con- 


*Received May 12, 1953. 
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verges uniformly on every circle | z | =r < 1, since the a; and 6; are bounded, 


and we have 

|z|=r | 
this last member is not greater than the second member of (3), and by Abel’s 
theorem approaches the second member of (3) as r->1. Of course (4) is 
valid for any function u(z) harmonic in | z| <1, so the last condition in 
(2) can also be interpreted in the form 


23 
(5) lim f, | w(re) <0; 
0 


the integral whose limit appears here increases monotonically with r, as is 


obvious from (4). 


1. An extremal function. 


THEOREM 1. With | a! <1, let fa(z) be the function of class H which 


takes on the value unity for z =a whose norm is least. Then for an arbitrary 
function u(z) of class H we have 


(6) u(a) dz |/ fa()| de | 


From (2) we have (| a, cos n@ + 6, sin né |? < a,” + b,?) 


(1) | u(re)| S + (as? + 


from which it follows that a sequence of functions u,(z) of class H of bounded 
norm is normal in |z| <1. A suitable subsequence converges uniformly on 
any closed set interior to C to some harmonic function u(z). The monotonic 
character of the integral in (4) implies 


2r 2r 
f | |? dd = lim | |? || un 
0 n> 


since || up || is bounded, (5) is satisfied and u(z) is of class H, with 
|| « || Slim sup || uw, |i. The set of functions of class H taking the value 
unity in za contains the function u(z) =1, hence is not empty; there 
exists a function f,(z) of this set of minimum norm. We have || f, || > 0 by 
virtue of fa(%) =1. 


Lemma 1. A function uo(z) of class H is orthogonal to fa(z) when and 
only when uo(«) = 0. 


— 
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If we have w.(«) = 0, we have for every real e by the definition of f,(z) 
(8) ll fu |? S fa + ||? = |] fa I? + 2e(fa, Uo) + €? || Uo 

where we use the generic notation 

(f.9) = 3 f |. 


Since (8) holds for every « we must have (fa, Wu) = 0. Conversely, if we 
have (fa, Uo) =0 but uo(«) 40, the function 
(fa(2) + eto(z))/(1 + etlo(%) ) 


is of class H, takes the value unity when z = a, and has as norm the square 


root of 


(|| fa |]? + €? |] Wo []?)/L1 + 


a norm which for a suitably chosen numerically small « is less than that of 
fa(z). This contradiction of the definition of fa(z) establishes Lemma 1. 

If u(z) is an arbitrary function of class H, the function u(z) — u(@) 
is of class H and vanishes for z =a, so by Lemma 1 we have 


J. [w(z) —wu(a«) ]fa(z)| dz | =0; 


in particular from u(z) =fa(z) we have || fa |?—= f fa(z)| dz | 0, 
Cc 


(6) follows, and Theorem 1 is established. 


The function f,(z) of Theorem 1 is unique, for if a second such function 
F,(z) exists, the function f.(z) — Fa(z) vanishes for z = «, hence by Lemma 
1 is orthogonal to both f,(z) and F'a(z), hence orthogonal to fa(z) — Fa(z), 
| | and vanishes identically. The function fa(z) of class H is uniquely charac- 
terized by fa(%) and the property of being orthogonal to every uo(z) 
of class H which vanishes in z =a; for if a second function F,(z) has these 
properties the function fa(z) — Fa(z) is orthogonal to both fa(z) and F,(z), 
hence is orthogonal to f4(z) —Fa(z) and vanishes identically. If @(z) is a 
function of class H and if for every wo(z) in H we have uo(«) = 2(uo(z), (z)), 
then the function —fa(z)/(fa(z),1) is orthogonal to every uo(z) 
in H, is orthogonal to itself, and vanishes identically; thus we have 
= fa(2)/m(fa(2),1). 

Lagrange’s multipliers give a necessary condition for a minimum and 
thereby enable us to compute f(z). In the notation (2) for fa(z) with 
&% = pe’? we have to minimize 


(9) Jay? + > (an? + by?) 


— 
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subject to the auxiliary condition 


(10) 4a) + p" (an cos np + sin nd) = 1. 
We thus write (Au ~ 0) 


F(A, + (a cos nd + bn sin np)] + + (ay? + by?) ], 


OF /0an = Ap" cos np + = 0, n>0O; 
OF /day = + 0, 
OF = Ap" sin nd + 2b, = 0, n> 0. 


Substitution of these unique values of a, and b, in (10) now yields 


— (A/2u) + (cos? np + sin?ng) = 1, 
Jay (1—p?)/(1+ = 2(1 —p*) c08 ng/(1 + p*), 
by = 2(1— p*) p* sin no/(1 + p*), 


(11) = (1 — p*)[(1 — r°p*)/(1 — 2rp cos(p — 8) + 1°p*) ]/(1 + 


The denominator in (6) reduces to 27(1 — p”)/(1 + p?), so the second member 
of (6) reduces to the familiar form of Poisson’s integral. 

Our entire proof of Theorem 1 can be based directly on Poisson’s integral 
rather than on the general theory of orthogonal functions; the contrast in 
method is that of derivation rather than immediate verification, methods of 
general rather than restricted applicability. The computation of fa(z) must 
of necessity yield the kernel of Poisson’s integral, for we have already 
shown that any function @(z) of class H with the reproducing property 


Uy(%) = ; Uo(z)®(z)| dz| for every uo(z) in H must be identical with 
JC 

fa(z)/ J fa(z)| dz |. Incidentally, we have here a new proof of the validity 


of Poisson’s formula. 
Another form of (11) is (z= re#®) 
fa(re') == (1 —p*) Re[ (1 + 22) /(1 —&)]/(1 


which will prove especially useful. 


2. Interpolation and orthogonal functions. We are now in a position 
to study the general problem of interpolation. 


THEOREM 2. Let the distinct points interior to C be given, 


it 
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finite or infinite in number. Let yn(z) be the function of class H of minimum 
norm which satisfies the auxiliary conditions 


(13) Yn(%0) = Yn(%) Yn(Gn1) = 0, Gn) = 1. 
Then the sequence {W»(z)} is orthogonal on C. 


Theorem 2 is similar to a theorem due to Bergman [3], involving 
analytic functions and multiple interpolation in a single point of a region. 

The set of functions of class H satisfying the auxiliary conditions (13) 
is not empty, for there exists a polynomial in z satisfying those conditions; 
the real part of this polynomial is of class H and also satisfies those conditions. 
Thus y,(z) exists. No function of class H satisfying (13) is identically zero. 

The function y,(z) defined in Theorem 2 is orthogonal to a function 
u(z) of class H with u(a) =u(a,) =0 when and only 
when we have u(@,) =0;.we omit the proof, which follows precisely the 
method of proof of Lemma 1. Consequently each y, is orthogonal to 
Wnsis Wns2,° SO the sequence is orthogonal. 

Moreover, if we have for a function u(z) of class H the equations 
U(do) = Uu(a,) =0, then we have as in the proof of (6), 


A special case of this equation is found by setting u(z) =yYn(z): 


so (14) can be written 


If u(z) is an ‘arbitrary function of class H, there are two formal 
expansions of u(z) of form 


(16) ~ doo(z) + ai(z) 


the one expansion is the usual orthogonal function (Fourier type) expansion 
found by formal expansion of u(z) on C, and the other is an interpolation 
series expansion found by setting successively z= %,@,,- - +, and solving 
for do, in order: 


U(%) = AoWo(%), = Apo (a1) ° 


These two formal expansions are identical. The fact that a) is the same 
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whether found by one method or the other is a consequence of (15) for n = 0. 


Assume this conclusion true for do, @:,° * *,@n1. The function 
Un-1 (2) == U(Z) — — — (2) 
vanishes for 2 = %, %,° * *,@%n-1. By (15) it is then immaterial whether a, 


is determined by interpolation to up, in = namely ayWn(%n) = Uni 
with = 1, or determined by the formal orthogonal expansion of 


on C: || Yn ||? = Un-1(2)Wn(z)| dz |. 


Each function y,(z) is unique, and is also the unique function which 
satisfies the auxiliary conditions (13) and is orthogonal to every u(z) of 
class H which satisfies the conditions u(%) = 0, as follows 
by the method already used in connection with Theorem 1 for n—0. We 
shall prove that Y,(z) is a linear combination of the functions Wo(z), 
W,(z),° +, Un(z), where = fa,(z). These functions ¥;,(z) are linearly 
independent on C, for by (12) they are functions harmonic over the extended 
plane except for singularities in the distinct points 1/é; respectively. These 
functions ¥;,(z) can then be orthogonalized and normalized on C by the 
Gram-Schmidt process, yielding a set $o(z), $:(2),° *,¢n(Z), Where ¢$;(2) 
is a linear combination of the functions Yo(z),¥.(z),° Thus 
$x (2) is orthogonal on C to the functions ¢o, ¢1,° Px-1, hence is orthogonal 
to Wo, hence by Theorem 1 vanishes in the points @, 
But ¢, is not orthogonal to ¥;, so (also by Theorem 1) we have ¢;,(a.) ~0. 
Every function u(z) which vanishes in all the points %, @,-°--, %;, is orthogonal 
to Wo, - hence is orthogonal to By our characterization of 
yx (z) in terms of orthogonality to such functions u(z), it follows that ¢,(z) 
is a constant multiple of ¥;,(z), so ¥,(z) is a linear combination of the func- 
tions Wo, -, conversely, W; is a linear combination of Wo, Wi, de 

The formulas for the orthogonalization and normalization of the functions 
(12) may be readily written by the reader in determinantal form; we have 


for instance 


Cc Cc 7C 
== (1 — | |?)/(1 + | |?) — | + | |?) 
X Re[(1 + ajax)/(1 — aja,)]. 
The two fundamental questions of interpolation for functions of class 


H are 1) given a function u(z) of class H and a set of points %, @,,° °° 
interior to C; to express u(z) in terms of the values u(a,); 2) given a set 
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of points 2%, ,° - - interior to C and corresponding functional values Ao, 
A,,: - + 3 to determine all functions u(z) of class H satisfying u(a,) = Ax. 
The second of these questions is the more general and essentially includes 
the first, so we proceed to consider it. 

There obviously exists a unique linear combination of the functions 
%n(z) which takes on prescribed values Ao, *,An 


in the respective points %, %,° * *,@,, so there exists a unique linear com- 
bination of the functions ¥(z), ¥,(z),- - -,¥n(z) which takes on those values 


in the respective points. The formula for these (equivalent) linear com- 
binations is readily written in determinantal form. The function uo(z) thus 
obtained is orthogonal to every function u(z) of class H which vanishes in 
the points %, hence is the unique function of minimum norm 
which satisfies the prescribed conditions of interpolation: 


The case of an infinite number of points a, #,,- - - and prescribed values 
Ao, Ai,* +: is similarly treated. The points and values define a formal 


infinite expansion (16) found by interpolation in the points a,; a necessary 
and sufficient condition that there exist a function u(z) of class H taking on 
the values A; in the points a, is the convergence of the series > a,” || Wx ||*. 
If this condition is fulfilled, the second member of (16) converges in the 
mean on C, hence by (7) converges uniformly on any |z|Sr< 1, and 
represents a function wo(z) in H which satisfies the conditions of interpolation. 
Any function in #7 which vanishes in all the points «; is orthogonal to every 
¥x, and conversely; such a function is orthogonal to uo(z). Thus uo(z) is 
the unique function of class H of least norm which satisfies the conditions of 
interpolation. The most general function of class H satisfying the conditions 
of interpolation is then uo(z) plus an arbitrary function u(z) of class H 
which vanishes in -all the points a, or otherwise expressed is uo(z) plus an 
arbitrary function u,(z) of class H which is orthogonal to all the y;,(z). 

If we take as departure not arbitrary assigned values A; but a function 
u(z) of H, its formal series of interpolation (16) is on C merely the formal 
expansion of u(e‘’) (measurable and square-integrable on C’) in terms of the 
orthogonal functions y;,(e!”), hence converges in the mean on C and represents 
in | Z | <1 the function wo(z) of least norm satisfying the conditions 
Uo(%) u(a,) for every k. 


3. Real points of interpolation. There is one non-trivial case where the 
functions ¢;,(z) and y¥,(z) can be written down at once: 
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THEOREM 3. Let there be given the real distinct points 


(17) = 0, &1, Ae, ° 


interior to C. In series (16) we may choose Wo(z) =1, 

Yn(2) == Re[ Anz (2— (@— ana) /{(1— ara) (1— aye) 
(18) An = (1 — (1 — On?) / On (On — (Op — 
The functions 

are normal and orthogonal on C, 

The analytic functions F(z) = U,(z) + 1Vn(z) whose real parts occur 
in (18) are known ([1], p. 305) to be orthogonal on C; for 7 Ak we have 


0 =f dz | (U; + iV;) — ds, 


(19) (U Ue + V;Vi)ds = ~ 


On C we have dz = izds, so by Cauchy’s integral formula (even if 7 =k -~0) 
F,(2)F,(s)ds —i f 0, 
Jc 


(20) V;V;,)ds = (UjVx ds = 0. 
JC Cc 


The orthogonality of the set exhibited in (18) follows from (19) and (20). 
The function y,(z) exhibited in (18) is a linear combination with real 
coefficients of the functions 


(21) Re[(1 +- /(1— |, k=0,1,2,---,n, 


by the classical theory of the decomposition of rational functions; since the 
procedure of orthogonalization of a given sequence leads to essentially unique 
results, it follows that the sequence exhibited in (18), found by orthogonaliza- 
tion of the sequence (21), is identical with the sequence y,(z) previously 
considered in (16), found by orthogonalization of the functions ¥;(z) = f4,(z) 
of (12). 

From the analogue of (19) with jk >0 and from (20) we have 


f | F; |? ds = 2 f U;? ds; the first member is readily computed, so nor- 
J¢ 


malization is as indicated in Theorem 3. 
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4, Completeness of orthogonal functions. The functions ¥;(z) in (16) 
may be complete but need not be complete. For instance, if the «, can be 
| written in the form «, = pe“®, 0 < p< 1, where p is independent of & and 

where 8/z is irrational, the points a, are everywhere dense on the circle 
|z |; a function of class H which vanishes in all the points ¢, vanishes 
identically interior to C. On the other hand, if the «;, satisfy the conditions 
of Theorem 3, the function V;(z), kA 0, in H does not vanish identically 
and is orthogonal to all the functions y,(z) [or Un(z)], so the latter do not 
form a complete set in H. 

There are various procedures for completing a given set y¥;,(z) in (16), 
to which we now turn. An arbitrary function of class H which vanishes in 
all the points a, is orthogonal to each y;,(z), and conversely. If the functional 
values A; = u(a,) are prescribed with > a;,? || y; ||? convergent, or if a func- 
tion u(z) of class H is given, the formal expansion (16) converges in | z| <1 
and converges in the mean on C to the function uo(z) of minimum norm 
with the property wo(%,) = u(a,). Every function u(z) of class H which 
takes on the prescribed values is given by the equation 


(22) u(z) = + (2); 


where u,(z) is a function of class H which vanishes in all the points a,. 
By the orthogonality of uw) and uw, we have. 


If the set ¥,(z) in (16) is not complete, we denote by H’ the closure of 
the linear family of functions u(z) of H defined by the y,(z), and denote 
by H” the closure of the linear family defined by a complete subset of 
orthogonal functions of H each of which is orthogonal to all the y; then H” 
is the class of all functions of H vanishing in all the points a, or alternately 
orthogonal to all the ¥,. Each function of H is the unique sum of a function 
of H’ and a function of H”. It may occur (in contrast to the facts for 
interpolation to analytic functions, for which the set of points on which all 
functions of the analogue of H” vanish is precisely the set {a,}, and the 
Blaschke product whose zeros are the a, is convergent) that each function of 
H” vanishes on a point set larger than {a,}, as for instance under the 
conditions of Theorem 3 with «,—0; in the notation of the proof of 
Theorem 8, the functions =Un(z) + 7V,(z) form a complete set with 
respect to the class H. of analytic functions | |? <0; the real 
part of a function of class H, is of class H, and an analytic function whose 
real part is of class 77 is itself of class H.; thus the functions {U,, V,} form 


} 
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a complete set with respect to class H ; the closure of the linear family defined 
by {U,} is H’, and the closure of the linear family defined by {V,} is H”; 
all functions of the latter family vanish at every point of the axis of reals 
interior to C. 

We can be more explicit, with the choice that a, is real, a,—0. Every 
function of class H which vanishes for real z is orthogonal to each y;(z), 
hence is orthogonal to each function of class H’. But it follows from (2) 
that every function of class H is an analytic function of the real variable r 
on the segment —1<2a2<-+1, y=0, so every function of class H which 
is orthogonal to all the y;,(z) vanishes in all the points a and vanishes on 
the axis of reals, hence (by Schwarz’s principle of symmetry) satisfies the 
equation u(z) =—u(Z). In the expansion (2), which is unique, it follows 
that every a, vanishes. Consequently, the class H” is precisely the class of 
functions (2) with every a; equal to zero, namely the closed linear extension 
of the class {r* sin k6} ; otherwise expressed, the class H” is the subclass of H 
for which u(e‘%) is an odd function of 6. 

We return to the situation of arbitrary a, where the set H” is not empty, 
and denote by S the set of points interior to C on which not all the functions 
of H” vanish. Precisely the interpolation method used in the proof of Theorem 
1 yields a formula analogous to (6) for the determination of the value in an 
arbitrary point « of S of an arbitrary function of class H’. Moreover there 
can be determined by the methods already used a set of orthogonal functions 
W"o.W%,° > > all of class H” such that an arbitrary function- u”’(z) of class 
H” can be represented 


(24) (z) doo + a's (z) +° 


the function vanishes in points - of S and does not 
vanish in #”;; the unique formal expansion (24) may be found either by the 
usual method of expansion on C in orthogonal functions or alternately by 
formal interpolation to w”(z) in the points #”;; the points a”, are not 
uniquely determined, but may be chosen for instance as the subset in S of 
the points pe*®, 0 < p< 1, k=0,1,2,- - +, where is irrational, and p 
is independent of *. The functions yw”; are all orthogonal to the original 
set y,, so an arbitrary function u(z) of class H has the expansion in orthogonal 
functions 


which is valid in |z| <1 and valid in the mean on C. Series (25) also 
partakes of the usual properties of interpolation series; the function yx 
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vanishes in the points %,,°°*,@,. but not in the point o, so if 
do, 41, * *, 4x1 have been determined, the coefficient a, can be determined 
by formal interpolation in the point a; ; the function y”;, vanishes in the points 
°° +, but not in the point so if do, +, ao, 
+, have been determined, the coefficient a’; can be determined 


by formal interpolation in the point #”;. 


5. Kernel functions. The relation of the classes H, H’, H” is of 
particularly simple form when expressed in terms of kernel functions. If 
the set of functions ¢,(z) each of class H is normal and orthogonal but not 
necessarily complete on C, and if an arbitrary sequence a, with > ad,” conver- 


gent is chosen, the series >) dndn(z) converges in the mean on C, and hence 
0 


by (7) converges uniformly on every closed set interior to C. Since the 
series >) dndn(z) converges for every such choice of the ap, it follows by a 
result due to Landau that the series }[¢,(z) |? converges for every z interior 
to C. Consequently the kernel function 


(26) Ie (2, t) =2 dn(z)on(t) 


n=0 


is defined for each ¢ with |¢| <1 as a function of class H in z, and is 
symmetric in z and ¢; the series defining /(z,¢) converges in the mean for z 
on C. If u(z) is a function of H and if the set ¢n(z) is complete in H, we 
may write for every z with |z| <1 


fF dt | bale) dt | 
04 n=0 


From this equation it follows that the kernel when defined by (26) in terms 
of a complete set {¢,} of normal and orthogonal functions can also be defined 
by the analogue of (26) in terms of a second complete set {¢’,} of normal 
orthogonal functions: 


for |t| <1, since &(z,¢) is of class H in z for fixed ¢, | t | <1; this series 
for k(z, ¢) converges in the mean for z on C and uniformly for | z| Sr <1. 
A kernel k, k’, k’” is associated with each of the classes H, H’, H” pre- 
viously introduced ; the kernel can be expressed in terms of a complete normal 
orthogonal set for the corresponding class: 


| 
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k(z, t) k’(z,t) =2 
(2, t) 
Since the set {y’,. Wn} is complete in H, we have k(z, t) = k’(z, t) + k’(z, t). 
If u(z) is of class H, we have 


Cc n=0 Cc 


which is the formal expansion of u(z) in terms of the set {yn}. 

If a function u(z) of class H is given, or if functional values in the 
points a, are otherwise given corresponding to which a function of class H 
exists, we have a formal development > a;v’x(z); if u(z) is given this 


development represents the function { k’(z, t)u(t)| dt |. If the functions 
Jc 


y’.(z) are complete in H, this development represents the unique function 
of class H taking on the given values in the given points. If the functions 
{y’.(z)} are not complete in H, this development represents the function of 
class H of least norm taking on the given values in the given points; the 
whole class of functions of H taking on the given values in the points a is 
precisely the class 


k=0 


and can also be expressed as > Ax’),(2) + f k’’ (2, t)U (t) | dt |, where U(z) 
k=0 JC 


is an arbitrary function of class H. 

In the special case already considered (§ 4) that the a, are real with 
%—>0, we have shown that H” is precisely the set of functions u(z) in H 
for which u(e’) is an odd function of 6; since every function of 6 is the 
unique sum of an odd function of 6 and an even function of 6, and since 
every odd function is orthogonal to every even function, it follows that the 
set H’ of functions of H orthogonal to all functions of H” is precisely the 
set of functions u(z) of H for which u(e‘’) is an even function of 6. For 
this special case we may write (r < 1, p < 1) 


k’ (ret, pet?) = + +2 cos n6 cos np 
n=1 


= Re[1/(1— + — rpet-0)], 


(re*®, pet?) sin cos nd 
n=1 


1 i(0-d i i (6+ 
= Re[rpe*9)/(1 — rpe*@-) — /(1 — rpet@+9)) }, 
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6. Multiple points. In interpolation problems for analytic functions, 
coincidence of two points of interpolation corresponds to prescription of the 
derivative of a function ; for harmonic functions, such coincidence corresponds 
to prescription of a partial derivative of a function; similar remarks apply to 
coincidence of more than two points of interpolation. For harmonic functions 
it is obvious by Laplace’s equation that not all partial derivatives of a function 
at a point can be assigned arbitrarily; indeed, at any point precisely two 
partial derivatives of any given order can be assigned independently. For 
instance, a harmonic function u(z) which vanishes at z = 0 together with its 
(n —1)-st derivatives but with 0"w(0)/dz" ~ 0 is 


u(z) = Re(2") =2" 


a harmonic function v(z) which vanishes at z = 0 together with its (n —1)-st 
derivatives but with £0 is v(z) =Im(2") 
The expansion (2) can be interpreted as an interpolation series defined from 
the values of uw(z) and the derivatives and at z=—0; 
the corresponding complete set of orthogonal functions of class H is 
{r" cos r” sin 

The theory of interpolation for harmonic functions as we have already 
developed it, holds with only minor and obvious changes if partial deriva- 
tives as well as functional values are assigned. For instance if 2(| 2% | < 1) 
and v are given, there exists a function uo(z) in H with @’u9(2)/dx” =1 
whose norm is least; any function u(z) in ZH satisfies the equation 
(20) /dx” = 0 when and only when u(z) and wo(z) are orthogonal; we 
have for an arbitrary function u(z) of H 


Uo [u(z) — Uo (2) (Zo) /dx”|ds = 0, 
JC 


(Zo ) == f 


ZC 


Uy (z)u(z)ds/ f [ wo (z) |? ds, 
Jc 


a formula which corresponds to differentiation of Poisson’s integral under 
the integral sign. The choice u(z) =U,(z), a multiple of Re[(z—2)*"] 


with 9°U(z») /ax” — 1, yields f [uo(2)]? ds — f to(2)Uo(2)ds. We shall 
Cc 


not attempt to derive specific formulas to include all such cases. The set of 
functions (18) with the a, not all distinct corresponds to prescription of 
every u(a,) and of suitable successive derivatives 0"w(a,)/dx" in multiple 
points 

If a series of interpolation for harmonic functions is desired, corre- 
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sponding to interpolation in points some of which are counted multiply, it is 
necessary to order the points according to increasing multiplicities, so that 
functions ¥(z) of increasing subscript correspond to derivatives of higher 
order at a point. 

We have chosen (3) as the basis of our definition of norm because of its 
close relation to Poisson’s integral and the Dirichlet problem and to other 
classical results. Several generalizations of our entire discussion can be 
envisaged: a) use of a positive norm function in (3), b) use of surface 
integrals over | z| <1 in (1) and (3) instead of line integrals, and c) con- 
sideration of more general regions, not necessarily simply connected. The 
methods that we have used apply also in these cases; the specific formulas 
are somewhat different, but at least in the simpler cases can be obtained by 


the reader. 
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SUR L’HOMOLOGIE ET LA COHOMOLOGIE DES GROUPES 
DE LIE COMPACTS CONNEXES.* 


par ARMAND BorEt. 


Introduction. On sait que l’homologie et la cohomologie a coefficients 
réels des groupes de Lie simples compacts connexes sont complétement connues, 
(voir notamment [7], [10], [16], [17], [18]t). Quant a la cohomologie 
entiére ou mod. p, elle a été déterminée pour les groupes classiques seulement, 
tant au point de vue additif ([8], [9], [16]) que multiplicatif ou méme des 
puissances réduites de Steenrod ([3], [4], [6], [15]).2 Nous nous proposons 
ici principalement d’étudier la cohomologie par rapport 4 un corps de carac- 
téristique quelconque des groupes localement isomorphes aux groupes classiques 
et des deux premiers groupes exceptionnels G2, F,; pour Spin(n), G., F, nous 
déterminerons aussi la cohomologie entiére et, partiellement, l’algébre d’hom- 
ologie (le produit étant le produit de Pontrjagin), et les éléments univer- 
sellement transgressifs. 

Ces résultats feront l’objet des 3éme, 4¢me, 5éme parties, .qui sont 
précédées de généralités sur le produit de Pontrjagin et d’un théoréme de 
transgression dans les espaces universels; ils seront établis essentiellement par 
étude de l’algébre spectrale de certaines fibrations, les unes de nature générale 
faisant intervenir les espaces classifiants, les autres particuliéres aux groupes 
étudiés, et en grande partie indépendamment des résultats de I et II: Les 
Nos. 3 4 6 notamment ne seront utilisés que dans les Nos. 15 et 16. Pour 
pouvoir décrire ces algébres spectrales et, éventuellement, passer a l’algébre 
dont F,, est Valgébre graduée, nous nous appuyerons fréquemment sur des 
résultats de [3], pour la plupart longuement résumés dans [6], et qui ici 
seront supposés connus. ‘Toutefois nous avons rappelé au No. 0 les princi- 
pales notations et au No. 1, avec quelques compléments, toutes les définitions 
et résultats concernant la structure de l’algébre de cohomologie d’un H-espace. 


* Received October 7, 1953. 
7 ainsi qu’une Note écrite en collaboration avee C. Chevalley, & paraitre dans les 
Memoirs of the Amer. Math. Soe. 
* Ces résultats sont rappelés au No 8. 
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Le No. 2 établit quelques relations entre le produit de Pontrjagin et 
Valgébre de cohomologie; le No. 3 associe 4 tout élément te H,(G, Ky) un 
endomorphisme diminuant le degré de s de l’espace de cohomologie H*(G, K,), 
ou encore de la suite spectrale d’un espace fibré principal de groupe structural 
G; ces opérateurs ont été introduits en caractéristique 0 par J. Leray [12], 
[13]; comme application, nous démontrons un théoréme assez général con- 
tenant comme cas particuliers des résultats de J. Leray, H. Samelson relatifs 
aux homomorphismes transposés des applications G > G/U, U > G, (U sous- 
groupe de @). 

Dans IT nous montrons que si l’algébre de cohomologie H* (Be, Ky) d’un 
espace classifiant pour G est une algébre de polynémes, alors H*(G, K,) 
posséde un systéme simple de générateurs universellement transgressifs ; c’est 
done, grosso modo, une réciproque aux résultats du § 19 de [3]. 

On sait que les centres des groupes classiques SU(n), Sp(n), SO(2n), 
SO(2n + 1),? sont cycliques 4 respectivement n, 2, 2, 1 éléments. Dans la 
3éme partie, nous nous occupons des quotients de ces groupes par leurs centres 
(ou dans le premier cas par un sous-groupe du centre), autrement dit des 
groupes projectifs unitaires PU(n), PSp(n), PSO(2n). Auparavant, nous 
étudions plus généralement au No. 10 la cohomologie mod. p du quotient de 
G par un sous-groupe cyclique Zm, (m==0Omod.p), 4 Vaide de l’algébre 
spectrale des revétements réguliers, sur laquelle on peut donner des renseigne- 
ments trés précis lorsque H*(G, Ky) posséde un systéme simple de générateurs 
universellement transgressifs. 

IV est consacrée au revétement universel, 4 deux feuillets, du groupe ortho- 
gonal SO(n), le groupe des spineurs Spin(n). L’essentiel est ici la cohom- 
ologie mod 2 et nous calculons H* (Spin(n),Z.) au No. 13, par V’intermédiaire 
de V’algébre spectrale de la fibration Spin(n)/S,; —Vn»2; en particulier, 
Spin(n) a de la 2-torsion si et seulement si n = 7, et son polynéme de Poin- 
caré mod 2 s’obtient 4 partir de (1 + ¢)(1 + #?)--- (1 + é**) en y remplacant 
(1+ #*)-(1+ #) par (1-+ #**) toutes les fois que & est une puissance de 
deux; le No. 14 montrera que les coefficients de torsion de Spin(n), (n= 7), 
sont tous égaux 4 deux. Nous étudions ensuite aux Nos. 15, 16 les éléments 
universellement transgressifs et le produit de Pontrjagin mod 2 de Spin(n) ; 
malgré les calculs assez longs et fastidieux auxquels on est ainsi conduit, il 
nous a semblé qu’il valait la peine d’insister sur ces points ot le comportement 


? Voir le No. 0 pour les notations. 
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de Spin(n), (n = 10), parait quelque peu exceptionnel ; en effet, tandis que 
pour G=U(n), SU(n), Sp(n), SO(n), G2, F., Valgébre H*(G,Z,) est 
engendrée par des éléments universellement transgressifs, et que H,(G, Z2) 
est commutative, on constatera que Spin(n) ne posséde aucune de ces deux 


propriétés pour n = 10. 

L’étude des deux premiers groupes exceptionnels est faite dans V. Celle de 
G., trés simple, utilise les deux fibrations G./S; = V;,. et Spin(7)/G.=S,; 
on trouve notamment que les coefficients de torsion de G, sont tous égaux a 
deux. Pour F, nous nous servirons des fibrations déduites des inclusions 
F,> Spin(9) D Spin(8) D T* et F, D Spin(9) D Spin(?) D G.; un point 
important est le fait que le groupe des permutations de trois objets opére 
fidélement sur H*(F,/Spin(8),Z), trivialement sur H*(F,,Z) et commute 
avec ’homomorphisme naturel du premier groupe dans le second; c’est ce qui 
nous permettra de voir que F, n’a pas de p-torsion pour p > 3. Par contre 
les valeurs de H*(F,,Z;) et de H* (Fs, Z.), (Nos. 21-22), montreront que F, 
a de la 2- et de la 3-torsion; on vérifiera enfin (No. 23) que ses coefficients 
de torsion sont égaux 4 2, 3 ou 6. 

Pour conclure cette vue d’ensemble des résultats acquis sur les groupes 
de Lie simples compacts connexes, signalons que |’on ne sait rien sur la torsion 
des trois derniers groupes exceptionnels, mis 4 part le fait que E, a de la 
2-torsion [5]. 

Les principaux résultats de ce travail ont été résumés dans une Note aux 
Proc. Nat. Acad. Sci. U. S. A. 39 (1953), pp. 1142-1146. Une partie d’entre 
eux avait été antérieurement annoncée aux Comptes Rendus [2]. 
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0. Préliminaires; principales notations. 


0.1. Algébre. jp désignera toujours soit un nombre premier soit zéro, 
K, un corps de caractéristique p, Zp, (p40), le corps des entiers mod. p, 
Z, celui des rationnels. 

Tors. A est le sous-groupe des éléments d’ordre fini du groupe abélien A, 
Tors. ,A sa composante p-primaire; (U) dénotera lV’idéal bilatére engendré 
par une partie U d’une algébre V. 

L’algébre extérieure d’un espace vectoriel sur un corps que le contexte pré- 
cisera (ou d’un groupe abélien libre) P est notée \P ou encore A(a1,°°-, 2m), 
si (a;) est une base de P, et /A\/P est bien entendu le sous-espace (sous-groupe) 
engendré par les produits extérieurs de 7 éléments de P. 

Soient H une algébre unitaire, 7,,- - -,2%m des éléments de H, et P le 
sous-espace qu’ils engendrent. On notera A(a,,---,2m) ou AP le sous-espace 
engendré par 1 et par les mondmes (<< 
lorsque ces mondmes sont linéairement indépendants, (voir la définition de 
systeéme simple de générateurs au No. 1). 


0.2. Homologie et cohomologie. (resp. H‘(X,T)), iéme 
groupe d’homologie, (resp. de cohomologie) de X, relativement 4T; H,(X,T), 
(resp. H*(X,I)), somme directe des Hj, (resp. H‘), (X sera toujours un 
polyédre connexe fini ou éventuellement localement fini 4 homologie de type 
fini). 

Dz est le degré de l’élément homogéne x, P,(X,t) le polynéme (ou la 
série) de Poincaré de X pour la cohomologie 4 coefficients dans Ky: 


P,(X, t) = dim. Ht(X, Ky) 
enfin, on dira que X a de la p-torsion (p 0) si Tors. pH*(X,Z) AO; par 


convention, XY est toujours sans 0-torsion. 


0.3. Algébre spectrale des espaces fibrés. Le systéme formé d’un 
espace H fibré de base B et de fibre typique F’ est noté (Z, B, F) ou (EL, B, F, q), 
q étant la projection de # sur B. Dans son algébre spectrale (#,) on a done, 
sous des hypothéses convenables, toujours vérifiées dans ce travail, 


= H*(B, Ht(F,T)),’ (I, anneau de coefficients), 


* Plus précisément E, est, comme on sait, l’algébre de cohomologie de B & coefficients 
dans le systéme local formé par les algébres de cohomologie des fibres; dans tous les cas 
considérés dans ce travail, ce systéme sera simple, presque toujours soit parce que B est 
simplement connexe, soit parce que le groupe structural de la fibration est connexe. 
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et est Valgébre graduée associée 4 H*(H,I) convenablement filtrée; s, ¢, 
s+ ¢ sont respectivement le degré-base, le degré-fibre et le degré total et sont 
notés DB, DF, D. 

est la somme directe des (s +¢—n). 

kr’ est la projection canonique des d,-cocycles de E, sur Ey,:, et 
Kpt = Kp 10° (4 <b), est la projection sur H, des éléments de EF, 
vérifiant (aSr<b). 

T* — idéal des éléments de filtration =s dans H*(H,T); on pose 
Jet J? H*t(H,T), par conséquent Het J%t/Jsttt-1, 

On sait que F,*°, (resp. #,°*), s’identifie 4 un quotient, (resp. a un 
sous-module), de H*(B,T), (resp. de H'(F,T)). 

La transgression admet plusieurs définitions équivalentes ([3], § 5) ; ici, 
nous n’utiliserons que la définition “spectrale”: ce H*(F,T) est transgressif 
si et seulement si il fait partie de E;,,°', et la transgression est ex dimension f, 
Vhomomorphisme F;,,°! défini par la différentielle d;,, de E441. 

Nous utiliserons sans commentaire toutes les propriétés usuelles de 
Valgébre spectrale (voir par exemple [3], § 4). 


0.4. Espaces universels et espaces classifiants. Soit G un groupe de 
Lie compact. Un espace universel, (resp. classifiant), pour G sera noté Fg, 
(resp. Be). 

L’algebre spectrale universelle pour G est lalgébre spectrale de la fibra- 
tion (He, Be, G@); on a donc = H*(Be,H*(G,T)) et Hy est triviale, 
(i.e. =T, —0 pour 

xe H*t(G,T) est universellement transgressif s’il est transgressif dans EL; 
il Vest alors dans une large classe d’espaces fibrés principaux de groupe 
structural G, ([3], § 18). 


Remarque. En fait, on définit tout d’abord classiquement des espaces 
E(n, G) et B(n, G) universel ou classifiant pour G et pour la dimension n 
qui devrait en principe figurer plus haut. Cependant on peut en toute rigueur 
aisément éviter de mentionner ce n (qui quoi qu’il en soit peut étre supposé 
arbitrairement grand) par un passage 4 la limite convenable; par exemple on 
peut définir des espaces universel ou classifiant pour G et pour tout n comme 
limite inductive d’une suite convenable d’espaces E(n, G@) ou B(n, G), ([6], 
No. 1) ; Valgébre spectrale universelle et celle des fibrations rappelées en 0. 6 
sont alors prises en cohomologie singuliére ou d’Alexander-Spanier 4 supports 
fermés quelconques. On peut aussi directement passer 4 la limite sur les 
algébres de cohomologie et les algébres spectrales comme cela est fait dans 
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[3], §18 et 22. L/’algébre spectrale universelle sera alors par définition 
une algébre spectrale (H,), (r= 2), qui pour tout n est isomorphe pour 
D < n— dim G avec l’algébre spectrale d’une fibration (H(n, @), (B(n, G@), @); 
cette définition est licite car si E’(n, G) est un deuxiéme espace universel 
pour @ et pour n, de base B’(n, @), le diagramme (18.3) de [3], écrit pour 
E et E’ montre non seulement que H*(B(n, G),T) et H*(B’(n, G),T) sont 
canoniquement isomorphes jusqu’éA n, mais encore, vu [3], § 4d, que les 
algébres spectrales de (H(n, @), B(n, G), @) et de (H’(n, G), B’(n, G), G) 


sont canoniquement isomorphes pour D < n— dim G. 


0.5. L’homomorphisme e*(U,G). Si U est un sous-groupe fermé de 
G, Eq est aussi universel pour U, une application p(U, 
= By > F¢/G = Be; elle définit un homomorphisme p*(U, G) : H*(Be,T) 
— H*(By,T) qui sera désigné par p*,(U, G) lorsque T = Kp. 

En fait, on peut faire correspondre une application p(f) : By > Be 4 tout 
homomorphisme f: U > G, ([6], No. 1), mais nous n’aurons pas besoin de 
cette généralisation. 


0.6. Quelques fibrations. 


(a) Si (X,Y, U,q) est un espace fibré principal 4 groupe structural U 
de Lie compact, il existe une fibration dans laquelle l’espace a méme homologie 
que Y, la fibre type est X et la base est By, ([3], Théoréme 22.1); par abus 
de langage, nous parlerons de la fibration (Y,By,X,r); bien entendu, 
YPhomomorphisme 1* déduit de inclusion d’une fibre n’est autre que q*, et r* 
est ’homomorphisme caractéristique de la fibration donnée. 

Dans le cas ot U est discret, la suite spectrale de cette fibration est tout. 
a fait analogue (et probablement isomorphe) a la suite spectrale des revéte- 
ments finis réguliers. de H. Cartan-J. Leray. 


(b) Si U est un sous-groupe fermé de G, la projection p(U, G) donne 
lieu a une fibration (By, Ba, G/U, p(U, G)). 


(c) Si U est invariant dans G, il existe une fibration dans laquelle 
espace a l’homologie de Bg, la base et la fibre type sont respectivement Bg y 
et By, ((3], Prop. 22.2, ou [6], No. 1) ; ici encore on parlera de la fibration 
(Be, Bey, By). L’homomorphisme i* de cette fibration est p*(U, @). 


0.7. Groupes de Lie simples compacts connexes. Les différentes struc- 
tures de groupes de Lie simples compacts connexes se notent classiquement 
A, (I Pa 1), B,, (1 = 2), C,, (1 _ 3), D,, (I = 4), G., F,, E,, E,, E,. Chacun 
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de ces symboles représente une classe de groupes ayant des algébres de Lie 
isomorphes; les quatre premiéres, les structures classiques, ont chacune un 
représentant linéaire bien connu, soit: 


pour A,;: SU(/ + 1) = groupe unitaire unimodulaire de / + 1 variables 


complexes. 


pour C,: Sp(l) = groupe unitaire de / variables quaternioniennes. 


pour B,, resp. D,;: SO(21+ 1), (resp. SO(2/)), groupe orthogonal uni- 
modulaire de 2/ + 1, (resp. 21), variables réelles. 


Les quotients de SU(n), Sp(n), SO(2n), par leurs centres, qui sont 
cycliques 4 resp. n, 2, 2 éléments, sont notés PU(n), PSp(n), PSO(2n). 
Les groupes SU(n) et Sp(n) sont simplement connexes, tandis que SO(n), 
(n = 8), admet un revétement simplement connexe a deux feuillets, le groupe 
Spin(n), dont le centre est cyclique d’ordre 2 pour n impair, d’ordre 4 pour 
n = 2m, m impair, et isomorphe 4 Z,-+ Z, quand n est un multiple de 4, 
(voir E. Cartan, Annali di Matematica, t. 4 (1927), pp. 209-256). 

Les représentants simplement connexes des structures exceptionnelles G., 
F,, E., E,, E, ont des centres cycliques d’ordres respectifs 1, 1, 3, 2, 1, 
(E. Cartan, loc. cit.) ; en particulier il n’y a, 4 un isomorphisme prés, qu’un 
groupe de structure G., (resp. Fs, resp. E,), que nous désignerons par le méme 
symbole. 


I. Le produit de Pontrjagin. 


1. Cohomologie d’un H-espace. Dans la définition suivante, H est une 
algébre de dimension finie sur un corps Ky, graduée par des sous-espaces H*, 
(k= 0), anticommutative (i.e. a°b = (—1)*b-a,ae H*,be H*), munie 
d’un élément neutre engendrant H°. 

La hauteur de ze H est Ventier h tel que 0, —0; elle vaut 
toujours deux lorsque Dz est impair et que p #2. 


DEFINITION 1.1. Soient J=(x%), (1 Sism, Dz; si iS)j), 
un systéme de générateurs homogénes de H, et s; la hauteur de aj. 
(a) J est de type (MM) s'il est minimal et si s; est au plus égale a la 


hauteur de toute élément de la forme 2, -+ P(%,---,2,1), o& P est un 
polyndbme (1=k=™m). 


(b) J est gq-semi-libre, (q nul ou premier), si les monémes 
Lm™, (0 S17, < forment une base d’espace vectoriel de H, 
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et si de plus Dz; est impair pour q = 0, s; est une puissance de q pour g= 2 
ou bien g + 0, Dz; pair. Les monémes précédents seront les mondmes basiques 


associés a J. 


(c) J est q-simple si les mondmes (1 < 
< pour Dz; pair et ~ 0, = 0, 1 sinon), forment avec 
1 une base d’espace vectoriel de H ; ces monémes seront les monémes basiques 
associés 8 J. Un élément xe Hi sera dit décomposable relativement a J s'il 
est somme de mondmes basiques qui sont produits d’au moins deux 2. 


Au lieu de 2-simple ou 0-simple, nous dirons en général simple, con- 
formément a la définition 6.4 de [3]. On peut bien entendu étendre ces 
définitions aux algébres H de type fini, avec quelques légéres modifications, 
(cf. [3], Définitions 6. 1, 6.2). Certaines d’entre elles équivalent a la notion 
d’algébre extérieure et ne sont introduites que pour simplifier certains énoncés. 

I] est clair que si J est qg-semi-libre, les éléments x; forment avec leurs 
puissances non nulles d’exposants q, q?, un systéme q-simple de 
générateurs, le systéme q-simple associé a J. 

Si un systéme g-simple posséde a générateurs de degrés impairs 
*, mq et b générateurs de degrés pairs le polyndme de 
Poincaré de H est: 


(1.2) 


i=a j=b 
P,(H, t) = IT (3 (1+), (¢=0), 
4=1 j= 
et il est clair que tout systéme q-simple de générateurs posséde a éléments de 
degrés et b éléments de degrés m,° -, et enfin que 
dim H = 2¢-q? (q~0) 
(1.3) 
dim H = 2%» (q=0). 


On a vu dans [3], (Théoréme 6.1 et No. 7): 


1.4. Soit X un H-espace* qui soit un polyédre fini connexe. Alors tout 
systéme de type (M) de générateurs de H*(X, Ky) est p-semi-libre. 


En particulier, H*(X,K,) posséde toujours un systéme p-semi-libre de 
générateurs. 


‘C’est 4 dire un espace muni d’un produit tel que les translations & gauche et a 
droite induisent des automorphismes de la cohomologie. 


i=a j=d 
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1.5. H*(X, K,) est Valgébre extérieure d’un sous-espace gradué par des 


degrés impairs lorsque l'une des conditions suivantes est remplie ; 


(a). p~2 et H*(X,K,) a un systéme simple de générateurs ([3], 
Prop. 6. 1b). 
(b). p¥2 et dim H*(X, Ky) est une puissance de deuz. 


i= 
(c). p=2 et P2(X,t) 44"), (nm; impair, 1Si<m). 


(d). X est sans p-torsion ([3], Prop. 7. 2.).° 


Vu (1.3), (a) et (b) sont équivalents, il nous reste donc a établir (c). 
Soit J=(y%), (1SiSk), un systéme 2-semi-libre de générateurs de 
H*(X, Kz) ; le systéme simple associé J’ est d’aprés ’hypothése formé de m 
éléments de degrés n; impairs; il ne peut donc contenir d’élément de la forme 
ce qui implique que (1 Sik); ainsi J’ est égal a J, ne 
comprend que des éléments de degrés impairs et de carrés nuls, et engendre 
bien une algébre extérieure. 


1.6. Soit X un H-espace qui soit un polyédre fini connere. Alors si X 
est sans torsion, H*(X,Z) est Valgébre extérieure d’un sous-groupe abélien 
libre gradué par des degrés impairs ([3], Prop. 7. 3.). 


Remarques. 


1.7%. Les propriétés 1.5 (a), (b), (c) sont en fait valables pour toute 
algébre de Hopf, (au sens de [3], Définition 6. 2) sur un corps parfait puisque 
(1.4) vaut sous cette hypothése, ([3], Théoréme 6.1). 


1.8. En utilisant (1.2) et (1.3), on obtient aisément la généralisation 
suivante de (1. 5b): Soit p42. Si H*(X, K,) posséde un systéme q-simple 
de générateurs, (qp), alors c’est l’algébre extérieure d’un sous-espace 
gradué par des degrés impairs. 


1.9. Les définitions précédentes s’étendent aisément a des algébres quel- 
conques; par exemple: 


DEFINITION. Soit H une algébre unitaire sur un anneau A. L’ensemble 
(7%), (we H,tel, I ensemble totalement ordonné) est un systéme simple 
de générateurs de H si ce dernier est la somme directe faible des sous-modules 


5 Signalons a ce propos une légére erreur de [3]: La réciproque partielle & la Prop. 
7.2, énoneée dans démonstration en remarque, p. 143, est valable sous l’hypothése 
H* (X, K,) = 0, et non pas seulement Tors. ,H?(X,Z) =0. 
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monogénes engendrés par 1 et par les monémes < < 
k 1, 2, 


2. Le produit de Pontrjagin. Soit X un H-espace. Le produit h: 
X¥xXX—X définit un homomorphisme h, (compatible avec les degrés 
totaux), de H,,(X, K,) ® H,,(X, K,) dans H,(X, K,) ; Vélément h,(a@b), 
le produit de Pontrjagin de a et b sera noté ayb. Ce produit ajoute les 
degrés, est distributif, est associatif si h Vest, posséde un élément neutre 
engendrant Hy si h a un élément neutre. 

Nous noterons < , > la forme bilinéaire canonique sur le produit d’un 
espace vectoriel et de son dual; on sait que H* et H,, sont en dualité de méme 
que H* ® H* et H, ®@H, et que 


(2.1) @ y, vd = KB, VD 5 

d’autre part h* et h,, sont transposés l’un de l’autre, ce qui signifie: 
(2. 2) <h*(r),a® by = <a, av by. 

Un élément homogéne xe H*(X, Ky) est dit primitif si 

(2. 3) h*(z) =2@14+1@z, 


ce qui équivaut visiblement & dire que 2 est orthogonal aux éléments 
décomposables de H,,(X,K,); cette formule permet aussi de définir les 
éléments primitifs de la cohomologie relativement 4 un anneau principal 
A, car H*(X,A) @ H*(X,A) est toujours canoniquement contenue dans 
Z, A). 


DEFINITION 2.4. Soit 6 Vautomorphisme de H*(X, K,) @ H*(X, Kp) 
qui transforme x@ y en (—1)* (Dx =s, Dy =t) ; nous dirons que 
h* est symétrique si 0°h* =h*, 


PROPOSITION 2.5. Le produit de Pontrjagin est anticommutatif si et 
seulement si h* est symétrique. 


Démonstration immédiate: = (— 1)**v\vu pour tout ue A,(X, Ky), 
ve H,(X,K,) équivaut a 


UV = <x, pour tout u,v et xe H*#(X, Ky) 


cest a dire a 


<h* (x), u@v> = <h* (x), (—1)#v @ uy 
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ou encore, puisqu’évidemment : 


<O-h* (x), vy = <h* (x), (—1)%v 


<h*(x),u@v> pour u, v, x quelconques donc 4 
= h*. 

Remarques. 

2.6. La démonstration prouve plus précisément que v = (— 1)**v yu 
pour tout we H,, ve H; si et seulement si h* est symétrique sur les éléments 
de degré s + 

2.7%. h* est en particulier symétrique si H*(X, Kp) posséde un systéme 
de générateurs primitifs. 

Proposition 2.8. Les deux conditions suivantes sont équivalentes. 

(a). H*(X, K,), (resp. H*(X,Z) est sans torsion et), a un systéme 
simple de générateurs primitifs. 

(b). H,(X, Ky), (resp. Hy(X, Z)), est anticommutative et est Valgebre 
extérieure d’un sous-espace, gradué par des degrés impairs st p2, (resp. 
d’un sous-groupe abélien libre gradué par des degrés impairs). 

(a) entraine (b). Soit (2), (1 [tS ™m), un systéme simple de primi- 
tifs; les éléments 

forment avec 1 une base additive de H*(X, soit ) la base 
duale, v(1,,- -%,) correspondant bien entendu 4 x(%,° ; nous voulons 
montrer que v(t) \/ v(t) =0 et que +, %) =v(t) V V(t), 
ce qui établira (b). Par hypothése, les x; sont primitifs, done 


{a,,° +, a} étant une partie ordonnée de - -, ix}, {Gj1,° SOD 

complément ordonné, et €q,...2, le nombre d’inversions de la suite {a,,- - -, %, 

%}. On en déduit immédiatement que v(i) v(i) et, par récur- | 

rence sur k, que v(1,,° —v(ti) V v(%) sont orthogonaux 4 

H*(X, Ky), les égalités annoncées. ( 
Méme démonstration pour les coefficients entiers. x. 


(b) entraine (a). Soit (vj) une base homogéne d’un sous-espace dont 
H,,(X, Ky) est Valgébre extérieure; les éléments 


=, V° V Vins 


(1 <.51SkSm) 


V(t, 
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forment avec 1 une base additive de H,(X, Ky), et comme vy; \V 45 = 0 pour 
tout i, les produits d’au moins deux v; engendrent le sous-espace des éléments 
décomposables de H,.(X, Kp). Soit (a(1,---, %)) la base duale; les éléments 
a, = x(t) sont orthogonaux aux éléments décomposables de H,(X, done 
primitifs, et vérifient de nouveau (2.9); cela permet de prouver par récur- 
rence sur k que est orthogonal 4 H,,(X, Kp), 
done que *,%%) ainsi (2;) est un systéme simple 
de générateurs primitifs. 


Méme démonstration pour les coefficients entiers. 


Remarque. Nous dirons que les systémes (2;) et (v;) de la démonstration 
précédente sont en dualité. 


2.10. Conditions suffisantes pour Vexistence d’un systéme simple de 
générateurs primitifs. 


(a) Si le produit est associatif, il suffit que H*(X, K,), (resp. H*(X, Z)) 
soit l’algébre extérieure d’un sous-espace (resp. d’un sous-groupe abélien libre) 
gradué par des degrés impairs. L’existence d’un systéme de primitifs est alors 
assurée par le théoréme de Samelson ([17], ou [11], pp. 133-134) ; en fait le 
théoréme n’y est formulé qu’en caractéristique zéro, mais la démonstration de 
Leray vaut sans changement en caractéristique quelconque, et méme pour les 
coefficients entiers. 


(b) La condition (a) est en particulier remplie lorsque X est sans p- 
torsion, (resp. sans torsion), d’aprés 1. 5(d) et 1.6; comme un polyédre fini 
ne peut avoir de p-torsion que pour un nombre fini de p, cela montre que, sauf 
éventuellement pour un nombre fini de p, H,,(X, Kp) est une algébre extérieure 
anticommutative. 


(c) Soit X—G un groupe de Lie compact connexe. II suffit que 
H*(X,K,) (resp. H*(X,Z) soit sans torsion et) posséde un systéme simple 
de générateurs universellement transgressifs, puisque ces éléments sont primi- 
tifs, [3], Proposition 20.1). Cette condition est toujours vérifiée si (a) 
Vest, ({3], Théoréme 19.1) et en fait lui est équivalente en caractéristique 
différente de deux (resp. sur Z). En caractéristique deux cependant elle vaut 
dans d’autres cas (voir Nos. 8, 17, 22), qui ne semblent pas justiciables d’une 
démonstration algébrique analogue 4 celle du Théoréme de Samelson, mais 
elle n’est pas toujours réalisée (voir Nos. 15, 16). 


Nous nous sommes bornés 4 formuler 2. 8 dans un cas simple, le seul utile 
dans la suite. Indiquons cependant qu’elle admet la généralisation suivante : 
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Proposition 2.11. Les deux conditions suivantes sont équivalentes: 
(a) H*(X,K,) aun systéme p-simple de m générateurs primitifs (2). 


(b) H,(X, K,) est anticommutative et posséde un systéme p-semi-libre 
de m générateurs (v;), (Dui = Da), la hauteur de étant égale a p si Dy, 
est pair, a deux sinon. 


Démonstration tout 4 fait analogue a celle de (2.8), et que nous ne 
détaillerons pas. On s’appuyera sur le fait que si x est primitif de degré pair 


(2. 12) <x", = n!(<z, v>)*, 


(démonstration par récurrence sur n immédiate) ; on utilisera (2.9) lorsque 
les 2; sont de degrés impairs, que l’on complétera par l’égalité suivante, ot 
sont des primitifs de degrés pairs 


r 


PROPOSITION 2.14. Soit J = (2), (1SiSm), un systéme p-simple, 
(resp. simple), de générateurs de H*(X,K,) dans lequel 2 est de degré 
impair pour =k, de degré parr pourk +1515, primitif pour 1 Sissi, 

Un élément x~0, décomposable relativement a J, combinaison linéaire 
de monomes basiques dans lesquels ninterviennent que +, % n'est pas 
primitif. 

Soient y,, (117), les mondmes basiques associés 4 J qui sont pro- 
duits des (1S jS/). Les monémes yj; (1 S1,7 Sn) sont 
donc linéairement indépendants, et vu, (2.9) et (2.13), h*(z) en est une 
combinaison linéaire; supposons que le mondme 


1 Xj," * 


Shisn<ps+tZ2), 


ait le coefficient c 4 0 dans z; on déduit alors de (2.9) et (2.13) que h*(zx) 
contient 


avec le coefficient c si s=1 et qu’il contient 


avec le coefficient r,-c si s = 0, et h*(x) n’est pas primitif; démonstration 
analogue dans le cas du systéme simple. 


| 
q 
C 
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CoROLLAIRE 2.15. Soit J = (a;) un systéme p-simple, ou simple, de 
générateurs primitifs de H*(X, Kp). Alors tout primitif est une combinaison 
linéaire des 2%. 


Si J = (2;) est un systéme p-simple, ou simple, il n’y a pas de raison en 
général pour qu’une nouvelle base de l’espace sous-tendu par les 2; soit encore 
un systéme p-simple, ou simple; cependant: 


PROPOSITION 2.16. Soient J = (2;) un systéme p-simple, (resp. simple), 
de générateurs primitifs de H*(X, K,), P Vespace sous-tendu par les a; Alors 
toute base (y;:) de P est un systéme p-simple, (resp. simple), de générateurs 
de H* (X, K,). 


Si p~2 et si J est simple, H*(X, K,) s’identifie 4 l’algébre extérieure de P, 
la démonstration est immédiate et bien connue. Il reste donc 4 examiner le 
cas du systéme p-simple pour p~0. 

Si p~2 et Dy; est impair, y;-y; = 9; sinon, comme on calcule mod. p, 
Vélément y;? sera aussi primitif, done combinaison linéaire des a; ou des y; vu 
(2.15); il en résulte immédiatement que tout mondme basique associé a J 
est combinaison linéaire de monémes de la méme forme écrits 4 aide des y; 
ces derniers, dont le nombre est égal 4 celui des mondmes basiques de J, sont 
done forcément linéairement indépendants, et forment avec 1 une base de 
H*(X, K,) ; ainsi (y;) est un systéme p-simple de générateurs. 


Remarques. 


2.17%. Si X —G est un groupe de Lie compact connexe, tout élément 
universellement transgressif est primitif, les Propositions 2.14, 2.15, 2.16 
restent donc valables si on y remplace primitif par universellement transgressif. 


2.18. Bien entendu, les résultats (2.8) 4 (2.16) et leur démonstration 
subsistent sans changement pour H*(X, K,) ou H*(X,Z) de type fini. 


3. Opérateurs sur la cohomologie d’un espace fibré principal définis 
par l’homologie du groupe structural. Soient G un groupe de Lie compact 
connexe opérant sur deux espaces X, Y, et f: XY une application continue 
commutant 4 G. Leray, ([12], [13]), a indiqué que l’on pouvait faire opérer 
H,(G, Ko) sur H*(X, Ko) et H*(Y, Ko) et sur l’algébre spectrale de f. Nous 
utiliserons ces opérations au No. 16, aussi voulons-nous en donner ici la 
définition et les principales propriétés en caractéristique quelconque. 


3.1. Soient A et B deux espaces vectoriels sur K,, B le dual de B; on 
vérifie aisément que si l’on pose o,(a@b) — <b, uda, (ae A, be B, ueB), 


= 
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on fait correspondre 4 wu un homomorphisme o, de A @ B dans A. Si de plus 
h* est un homomorphisme de A dans A @ B on en déduit un endomorphisme 
= h* oo, de A; on notera l'image de x par 


3.2. Soient Z un espace sur lequel G opére, £: ZH X G— E Vapplication 

ainsi définie; on a 
H*(E X @, K,) —H*(E, K,) H*(G, Ky) 
et nous appliquons 3.1 au cas A = H* (LH, K,), B= H*(G, Ky) et h* 
autrement dit, si 
= (rt, H* (LE, Kp), H*(G, Kp)), 
on pose 
= UM; 

Comme 


H,(# xX G, Kp) = H,(£, Ky) © H,(G, 


définit un accouplement de K,), H,(G, Kp) 4 H,(E, Kp), que Von 
peut aussi appeler le produit de Pontrjagin, par 


(ve H, (EL, K,),ve H,(G, Kp)), 


et il est clair que #, est le transposé du produit de Pontrjagin, c’est 4 dire que 


= <x, pour tout ve H,.(L, Kp). 


Ces opérateurs ont les propriétés suivantes: 


(a) Soit Dus; %, est un homomorphisme de A*(E,K,) dans 
H**(E, Ky) pour tout k. 


(b) —0; est Videntité; = ad, + (a, be Ky). 
(d) Soient XY, Y deux espaces sur lesquels G opére, f:X—Y une 1 
application commutant 4 G. Alors f* o = #3, 0° f* pour tout ue H,,(G, Kp). 
(e) Soit weH,(G,K,) orthogonal aux éléments décomposables de 
H*(G, K,) et soient homogénes e H*(H,K,). Alors 


(a) et (b) sont clairs; (c) résulte par un calcul facile de la commutativité 
du diagramme 
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EXGXG 


xX id. id. Xh 
EXG EXG 


qui exprime l’égalité (e-g)-g’ = e-(g-g’); la propriété (d) se déduit de la 
commutativité du diagramme 
rx 
(3. 21) fl Lf 
rx 
Enfin (e) est une conséquence immédiate de la définition, de légalité 


f*(x-y) =f*(x)-f*(y) et du fait que, vu Vexistence d’un élément neutre 
dans G@, on a, pour tout homogéne e H* Ky): 


= @1+ 37; 2, (ae H*(E, K,), H*(G, Kp), 0 << Du < Da). 
3.3. Soit maintenant (#,B,G,q) un espace fibré principal connexe, 


disons compact; @ opére sur F, en laissant chaque fibre invariante, d’ou un 
diagramme commutatif : 


(3. 31) 
Bp —id,B 
¢* définit un homomorphisme de l’algébre spectrale (H,) de (H, B, G,q) dans 
celle de (EZ X G, B, G X G,q), soit (H’,) ; du reste £* est biunivoque, 
E’, = ¢*(E,) H*(G, Ky) = E, ® H*(G, K,), (r = 2), 


la différentielle d’, de E’, est égale 4 d, sur £*(£,), nulle sur 1 @ H*(G, Kp), 
voir [3], lemme 20.1, dont la démonstration montre de plus que 


E’ ® H'(G, K,). 


En prenant A = F,, B = H*(G, K,), h* = £*, on définit selon 3. 1, un endo- 
morphisme additif #,, de H,, pour tout r= 2, qui a les propriétés suivantes: 


(a). #, est un homomorphisme de F,/* dans E,5*, 
(ue H,(G, K,), r= 2). 
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= 0; = id.; = a, + (a, be Kp). 
(c). Bod, = 
0d, = d, Dy et Dy Krai” = Krys” Dy. 
(ce). Sur on a 
(be H*(B, K,), ve H*(G, Kp)). 
(f). %, laisse invariants les idéaux J* qui définissent la filtration de 


H*(E, K,), et le diagramme suivant est commutatif. 


Ou, 
ia (x projection canonique). 


0 


La propriété (f) provient du fait que pour définir (/,) et (#’,) on filtre des 
couvertures fines bien choisies de EF et FX G par le degré en les cochaines 
de base, (voir par ex. [3], § 4, pour la définition précise), qui n’est pas altéré 
par le transposé de £; la propriété (e) résulte de l’égalité £*(b @ 1) = b @ h*(z) 
qui exprime que ¢ est l’identité sur B et que sa restriction 4 une fibre G XK G 
est le produit, (voir [3], démonstration du lemme 20. 1) ; les autres propriétés 


sont immédiates. 


3.4. En utilisant le diagramme (3.21) au lieu de (3.31) on pourrait 
aussi faire opérer H,,(G, K,) sur V’algébre spectrale de f; on obtiendrait ainsi 
les opérateurs du No. 3 de [13], sur lesquels nous n’insisterons pas. Le cas 
traité ici au No. 3.3 correspond au No. 4 de [13]. 


3.5. Applications. Nous considérerons ici la situation du diagramme 
(3. 21) dans le cas particulier XY = G, V’application £y — h étant bien entendu 
celle qui définit le produit de G. 


THEOREME 3.6. Soient Y un espace sur lequel G opére, f: G@—>Y une 
application commutant a G, (opérant sur lui-méme par les translations a 
droite). Si H*(G,K,) posséde un systéme simple de primittfs, alors 
f*(H*(Y, K,)) est engendrée par des primittfs. 

En effet, vu 3. 2(c), image de f* est stable par rapport aux opérateurs 
0, de H*(G, K,) ; il suffit done de voir que toute sous-algébre de H*(G, Ky) 


stable par rapport aux #,, (wu parcourant H,(G, K,)), est engendrée par des 
primitifs, ce qui est élémentaire et laissé au lecteur. 


THEOREME 3.7. On conserve les hypothéses de 3.6 en supposant de plus 
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que px 2.° Alors H*(Y, Ky) =A AP’, ou A est une sous-algébre annulée 
par f* et ow AP” est wne sous-algébre que f* applique biunivoquement dans 
H*(G, Kp). 


Soient z,,- - *,2, une base de P dont les & premiers éléments engendrent 
Vimage de f*, (Da; S Dz; si i Sj Sk); nous désignons par la base 
duale et écrivons 9; pour #%,,; les opérateurs #; engendrent une algébre 
extérieure, en particulier nous noterons l’algébre extérieure de Dy. 
La démonstration de 3.7 comprend trois parties. 


(i) A montrer: Il existe ye H*(Y,K,) tel que f*(y¥:) et que 
= symbole de Kronecker, 11,7 =k). 


Vu que f* est un isomorphisme en dimension zéro et que y-3;=0 si 
Dy < Dyj, on voit facilement que l’on peut trouver ¥; © f*-*(a) tel que 


(3. 71) 9; = 8y, 


supposant avoir obtenu y,,° - -, ys vérifiant (i), nous voulons construire 41 ; 
Vélément 


est annulé par le produit - fait partie de f*-*(as.1) car 


et vérifie 
(3. 72) Yours Por = 1, 0 (sti<jsh). 


Admettons maintenant qu’il existe Ys.1,; © f*""(@s.1) vérifiant (3.72), annulé 
par 8s); si 7 > 1, alors 


est dans f*-"(25,1), vérifie (3. 72) et est annulé par /A\4*(0,,---,0,).° Finale- 
ment on arrivera 4 Ys,:,1 qui satisfait aux conditions imposées a ys,, dans (1). 


(ii). Les y; sont de degrés impairs, de carrés nuls et leurs images par f* 
engendrent - 2); par conséquent la sous-algébre B engendrée par 
les y; dans H*(Y,K,) s’identifie 4 A\(y,,---, yz). Soit A Vensemble des 
éléments de H*(¥Y,K,) qui sont annulés par A+; c’est une sous-algébre 
@aprés 3.2(e), qui fait visiblement partie du noyau de f*; nous voulons 
prouver que tout ye H*(Y,K,) est contenu dans la sous-algébre engendrée 
par A et B, que nous notons A-B. 


° H* (X, K,) est donc une algébre extérieure 4 générateurs de degrés impairs d’aprés 
1.5(a). 


) 
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Si y est annulé par /\10’, cela est vrai par définition de A; supposons-le 


établi pour tout élément annulé par /\*d’ et soit y annulé par A**'0’. Alors 


est annulé de méme évidemment que les coefficients et 
ye A-B vu Vhypothése d’induction d’ot notre assertion, puisque tout élément 


est annulé par /A*1 = 0. 


(iii). I] reste encore a voir que A-B est isomorphe a A @ B; il est clair 
qu’il suffira pour cela d’établir: 


(3.73). Sti Vélément 


(di,,...i,€A), est nul, alors chaque coefficient aj,,..,i, est nul. 


Mais - 0; =0 si z=0; cela étant 


et ainsi de suite, on démontre (3.73) par récurrence descendante sur s. 
Remarques. 


3.8. En caractéristique deux, la méme démonstration’ montre que si 
H*(G,K.) posséde un systéme simple de générateurs primitifs, alors tout 
élément de H*(Y, K.) peut se mettre sous la forme (3. 73) ; la seule différence 
avec le cas 3. 7 est que l’espace A Ys) n’est pas forcément une sous- 
algébre. 


3.9. Bien entendu, on a des résultats analogues en cohomologie entiére 
lorsque H*(G,Z) et H*(Y,Z) sont sans torsion, (mémes démonstrations). 
? 


3.10. Dans le cas particulier ott f est l’inclusion de G dans un surgroupe 
de Lie compact connexe, 3.6 et 3.7 donnent respectivement les Prop. 21. 2 
et 21.3 de [8], elles-mémes généralisations de résultats dfs 4 H. Samelson 
[17]. 

Si f est la projection de G sur un espace homogéne G/U, le Théoréme 
3.7 généralise un résultat énoncé en caractéristique zéro par J. Leray [13]. 
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II. Un théoréme de transgression. 


Le but essentiel de cette partie est la démonstration du Théoréme 6. 1 qui 


peut s’envisager comme une réciproque au Théoréme 19.1 et a la Proposition 


19.1 de [3]; tous ces résultats s’obtiennent par l’étude de l’algébre spectrale 
universelle, mais tandis que ceux de [3] étaient cas particuliers de propositions 
purement algébriques, dans lesquelles il n’était méme pas question d’espaces 
fibrés, nous utiliserons ici plus complétement les hypothéses topologiques, en 
faisant notamment intervenir les propriétés de Valgébre de cohomologie d’un 
H-espace rappelées au No. 1. 

Remarquons une fois pour toutes qu’en caractéristique zéro, ’hypothése 
et la conclusion de 6.1 sont toujours vraies, ([3], Théoréme 19.1), aussi 


supposerons-nous p= 0 dans les démonstrations des Nos. 4, 5, 6. 


4, Enoncé d’une proposition et notations. 


Proposition 4.1. Soit Gun groupe de Lie compact connexe et supposons 
que H*(Bg, Ky) est une algébre de polynémes, pour un p donné. Alors 
H*(G,K,) posséde un systéme simple de générateurs universellement trans- 
gressifs, de degrés impairs si pA 2. 

Pour tout entier 7 > 0, nous prouverons, par récurrence sur j, l’existence 
d’un systéme 1/; de générateurs de type (J/) dans lequel les éléments de degré 
<j sont universellement transgressifs (et de degrés impairs pour p~2), 
ce qui ¢tablira 4. 1. 

Les éléments de H'(G,K,) sont toujours tous universellement trans- 
gressifs ; on peut donc prendre pour ./, n’importe quel systéme de type (J) ; 
supposant maintenant connaitre un systéme J/;,, (& = 1), nous nous proposons 
de construire M;.,,;; nous y parviendrons a la fin du No. 5, aprés avoir 


? 


démontré une série de lemmes; auparavant, introduisons quelques notations. 


Notations. 


(yi) systeme de générateurs de H* (Bg, 
Di sous-espace des éléments décomposables de H/(Be, K,). 


Qi sous-espace sous-tendu par les y; dont le degré vaut 7; il est done 
isomorphe Hi(Be, Ky) 


K,[Q/] algébre de polynémes sur 


B; sous-algébre de H*(Be, Ky) engendrée par 1 et par les y; dont le 
degré est <j; on a donc un isomorphisme naturel B; = K,[Q/] ® B;.. 
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N;, = systéme p-simple associé 


Sj: Sous-espace de H*(G,K,) engendré par 1 et par les mondmes 
basiques associés 4 N; qui sont produits d’éléments dont le degré vérifie 
i= Dzr=j, et sont done isomorphes aux sous-espaces ®@1 et 
1@8;,,' de @ S;,,", qui est lui-méme canoniquement isomorphe 4 8;! par 
un isomorphisme qui fait correspondre le produit (ae SJ, b 

est Valgébre spectrale sur K, universelle pour on a donc 
E.*t = H*(Bg, K,) ® H'(G, K,) et Ez est triviale. On sait que dans toute 
algébre spectrale = et Ej,.°% = donc ici 


(4. 2) = = 0 (j > 0). 


Comme de plus = et = N on 
voit que 
(4.3) d; est un isomorphisme de sur (j 22). 


5. Démonstration de la Proposition 4. 1. 


LeMME 5.1. Soit Alors H*(Be, K,) que des éléments de 
degrés pairs et tout élément universellement transgressif 0 est de degré 
im patr. 

En caractéristique ~ 2, un élément de degré impair est de hauteur deux, 
H* (Be, Kp), qui est par hypothése une algébre de polynémes, doit donc étre 
engendrée par des éléments de degrés pairs. D/’autre part le sous-espace 
E;-+-* des éléments universellement transgressifs de degré 7 — 1 est isomorphe 
& Hj..%° Vaprés (4.3), done nul si 7 est impair. 


LemME 5.2. Soitj; =k. Supposons A; = © muni d'une 
différentielle nulle sur K,[Q5| @ 1, qui applique lV’espace des éléments de degré 
tsomorphiquement sur Qi@1. Alors A; a une cohomologie 
triviale. 

Remarquons tout d’abord qu’en caractéristique p32, il n’y a rien a 
démontrer si j est impair, et que pour 7 pair, S;,’-1 est une algébre extérieure 
a générateurs de degrés impairs. 

A Vaide de la régle de Kiinneth, on se raméne immédiatement au cas od 
Q/ est de dimension 1, pour lequel la vérification est aisée; ce lemme résulte 
aussi pour p+ 2 du lemme 11.1 et de la Proposition 11.1 de [3] et pour 
p = 2, des résultats correspondants dans le cas du systéme simple ([3], § 16). 


LEMME 5.3. (a,). Pour D=k-+ 3, x,? est un isomorphisme de 
B,® 8,.* dans E,, (2SrSk-+1). 


far) ae, &, 


| 
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(by) 8,1") contient pour s+tSk+2,tSk, 
(2Srsk+1). 

(a2) est évident, (b2) résulte du fait que H,"° 0, (4.2); supposons 
(a;) et (b;) démontrés pour un indice j, (2 Sj Sk); il en résulte: 

(5.31) 8;.*) = @ = K,[Q)] (DSk+83), 

(5. 32) Ej” = Q), 

(5.33) C «;?(1 @ = est le sous-espace sous-tendu par 
les 2; dont le degré vaut 7 — 1. 

(5.34)  «;?(B; @ Sj.*) est stable pour d;, et pour DS k + 2, d; se transporte 

sur le dernier membre de (5.31) en une différentielle bien déterminée par sa 

restriction 4 H;°/-1, qui est un isomorphisme sur Q’, vu (4.3) et (5.32), 

et par le fait qu’elle esi nulle sur K,[Q‘] @ Bj. @ SF. 

Démonstration de (aj.): d; est nulle sur B;,,® 8S, done pour que 
= © soit biunivoque sur ce sous-espace, (D =k + 3), il suffit que 
kj) le soit, c’est 4 dire que 

(Bin @ dj(*H;j) =0 pour s Sk + 2; 
soit we *H;; si DFusk, alors wex;?(B; = Bj @ et dj(u) est 
contenu dans l’idéal de Q’, vu (5.32) et (5.34), et ne fait done pas partie 
de si DFu>k, alors DBu=0 puisque et que 
Dusk + 2, done DB(dju) = j, et si dj(u) #0, il ne peut faire. partie de 
kj" (Bj @ dans lequel les degrés base > 0 sont 2j-+ 1. 

Démonstration de (bj). Un élément de 

peut s’écrire d’aprés (0;) et (5.31): 

(5. 35) +++ Un, @ +18 

(ue 8.24, Du; > 0; vie @SF; +, vn linéairement indé- 
pendants). L’espace H;,;*' est la projeetion, par x;,,/ de espace des d;-cocycles 
de #;**; pour prouver (b;,,) il suffira done de montrer que si d;(h) =0, chacun 
des n premiers termes de (5.35) est un d,-cobord; en effet, on aura dans 
ce cas: 


= (1 @ Uner) C (Bir @ SF) = (Bur @ SF). 


Or si dj(h) =0, on doit avoir d;(u;) = 0, vu Vindépendance linéaire des 1, 
(lSisSn); comme Du,>0, il existe wie K,[Q/] tel que 
u; = d;(w’;), (Lemme 5.2 et (5. 34)), et alors 


Ui ® d;(w’; Vi) 


est bien un dj-cobord, (1 Sin). 


| 
| 
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LemMME 5.4. Soit x un élément de degré k +1 de M,. II existe ze 8,*, 
de degré k +1, tel que x + 2 soit universellement transgressif. 


Supposons avoir trouvé z;eS,* tel que x-+- 2; soit un d,-cocycle pour 
r<j; alors, vu (5.3): 
h = djxj? (x + 2;) C C «7? (Bj; @ Sj") = 
ou plus précisément, puisque DB(h) =]: 
h=u,@v, +: +--+ vy, 


S*, A SiSn), vp, linéairement indépendants) ; 
évidemment dj(h) =0, dou dj(uj) —0 et (lemme 5.2), Vexistence de 
K,[Qi| @ tel que = (1 Si Sn); mais ici DBu; = j done 


DBuw’; = 0, ce qui signifie que w’;e S;./-'; on en tire 
+ 2;) 8 m) = da), (ae 1®@ 


et ainsi + 2; Zur, (211 © Si*), est un d,-cocycle pour r j +1, 
de maniére analogue on montre l’existence de z2e€1@8S,* tel que d.(x + 2) 
= 0; on obtient done par récurrence tel que «+ 2;,. soit un d,- 
cocycle pour r << k + 2, autrement dit soit universellement transgressif. 


Lemme 5.5. Dans les notations du lemme 5.4, la hauteur de x + z est 


égale a celle de x. 


Pour p~ 2, x et x+ z sont de degrés impairs, (s’ils sont différents de zéro), 
d’aprés le lemme 5.1, donc tous deux de hauteur deux. I] reste 4 considérer 
le cas p= 2. 

Nous supposons les éléments du systéme simple N;,= (2;) associé 4 M; 
numérotés de telle sorte que les / premiers soient les générateurs de JJ; de 
degrés ainsi que leurs puissances non nulles d’exposants 2,4, 8,- 
la transgression commutant aux Sq‘, éléments (1 SiS 1), sont tous 
universellement transgressifs. Soit S D 8,* le sous-espace engendré par 1 et 


par les mondmes basiques 


Comme l’ensemble des x, (111), contient le carré de chacun de ses 
éléments, lorsqu’il est ~ 0, on voit que S est une sous-algébre dans laquelle, 
puisque nous calculons mod 2, le carré de tout élément décomposable relative- 
ment a J = (1 SiS), (Déf. 1. 1c), est aussi décomposable relative- 
ment a J; en particulier, l’élément z de 5.4, de degré & + 1, est forcément 
décomposable relativement 4 J, d’ot vu (2.14) et (2.17): 
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(5.51) Soit j un entier > 0; si z* 40, ce n’est pas un élément uni- 


versellement transgressif. 


La hauteur de x est une puissance de 2, (1.4), disons 2%; celle de x + z 
est = 2¢ par définition du systéme de type (1); il reste done 4 montrer 
qu’elle est S 2*; or, comme nous calculons mod 2, 


(x + z)** = + = 


et 2°" est universellement transgressif, en tant que puissance 2’-iéme d’un 
élément universellement transgressif, done nul d’aprés (5. 51). 


5.6. Construction du systéme My... Soit Mj, = (hi) le systéme donné ; 


nous définissons un nouveau syst¢éme (h’;) par 
si Dj 
si Dij=k+1 


z, ¢tant décomposable tel que h’; soit universellement transgressif. Les 4% 
étant décomposables, (/’;) est aussi minimal, et de plus tout polynéme en 
h’,,: +, h’; est aussi un polynéme en hy,- hj, (7 = 1, 2,- +); un élément 
écrit done aussi hj + +, et sa hauteur 
est toujours = que la hauteur de hj, done que la hauteur de h’; vu (5.5) ; 
ainsi (h’;) est un systeme de type (1/7) dans les éléments de degrés =k +1 
sont universellement transgressifs; c’est le systemeM/;,,, chercheé. 


6. Un théoréme de transgression. En fait, tout l’essentiel est contenu 
dans la Proposition 4. 1, mais pour l’application que nous avons en vue, il sera 
plus commode d’utiliser l’énoncé suivant: 


THEOREME 6.1. Soit G un groupe de Lie compact connexe et supposons 
que pour un p premier donné, H*(Bg, Ky) soit une algébre de polynémes 
engendrée par m éléments Ym, (forcément de degrés pairs si p¥2). 

Alors H*(G, K,) posséde un unique systéme simple de générateurs unt- 
versellement transgressifs (uj), (t=1,° +,m, Du; = Dy, —1), tel que y% 
soit une image de u; par transgression dans Eg. 


Prenons un systéme (h;) de type (1) d’éléments universellement trans- 
gressifs, (Prop. 4.1), et soit N = (2;) le systéme simple associé, (il faut bien 
remarquer que pour p > 2, les h; sont de degrés impairs, et V = M est bien 
un systéme simple, et non pas seulement p-simple). L’algébre spectrale uni- 
verselle vérifie relativement 4 N et pour D quelconque les propriétés que nous 
avons démontrées pour NV; et DSk-+2, en particulier: (5.31), (5.32), 
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(5.33). Ainsi espace T(G, Kp) engendré par les éléments universellement 
transgressifs, et qui est la somme des E;,,°", (7 = 1), admet les 2; comme base. 
Comme d; est un isomorphisme de 7°!" sur Q/ = Ej, ((4.3) et (5. 32)), 


on voit que VN compte m éléments et que si est base de Q/, l’espace 
E;°7-" posséde une base Up» bien déterminée telle que 
(6. 2) djxj?(1 ui) = 1), (aSisSb); 


ce qui exprime justement que y; est une image de u; par transgression dans Eg. 
Le systéme (u;) est complétement déterminé par (6.2), et comme c’est une 
base de 7(G, K,) il constitue aussi un systeme simple de générateurs (voir 
(2.16) et (2.18)), d’ot le théoréme. 


III. Groupes quotients des groupes classiques. 


7. Sur l’algébre spectrale des espaces fibrés. Dans ce No, A est un 
anneau principal, l’algébre spectrale sur A d’une fibration (L, B, F, q), 
ou espace, base et fibres sont des polyédres finis. 

On notera l’inclusion de F dans F, celle de = J*° dans H*(H, A), 
(cf. 0.3 pour les notations). Rappelons que 


(7.1) 6° a= g*; = i*(H'*(#, A)). 


Nous nous proposons d’indiquer deux cas particuliers dans lesquels F,, est, 
(additivement et multiplicativement) isomorphe 4 A); on supposera 
réalisée la condition suivante: 


7.2. Hy sidentifie au produit tensoriel (gauche sur A) de U =3,>, 
et par un isomorphisme qui applique le produit x-y sur x ®@y, 
(xe ye ; en particulier sidentifie a 

LEMME 7.3. Sous V’hypothése (7.2), J* est Vidéal engendré par 

A)). 

Etant donné heJ** il existe d’aprés (7.1) et (7.2) des éléments 

(H*(B,A)) =1(£,*%°) et A) tels que 
h—2"9,—- Yn 
soit dans J**+#-1; pour s + ¢ fixé l’inclusion de J** dans l’idéal de 


s’en déduit alors par récurrence descendante sur s; l’inclusion contraire résulte 
de —=J*° C J® et de la régle J*-J® C Ja, 


m teat 
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PROPOSITION 7.4. On suppose, outre (7.2), qual existe un homo- 
morphisme v:V—+>H*(H,A) tel que t* ov soit Videntité. Alors V’homo- 
morphisme .®@v:U @ V > H*(E, A) est un isomorphisme sur. 


C’est une légére généralisation d’une proposition de J.-P. Serre (Annals 
of Math., 54 (1951), p. 473), qui s’établit exactement de la méme fagon, 
(rappelons que l’on filtre V par le degré en U et l’on montre que 
induit un isomorphisme des algébres graduées). 


PROPOSITION 7.5. On ajoute a (7.2) les hypothéses suivantes. 


(a) £ est un H-espace. 
(b) A=K, est un corps de caractéristique p. 


(c) U, (resp. V), posséde un systéme p-semi-libre de générateurs (u;), 
Dus Du; si iS j), (resp. (v1), 1 St Sn, Dy, S Dr; 
sitj) et Yona Dum S 


Alors H*(H, Ky) est isomorphe a U® V. 


Le systéme (w:,° * *,Un) est un systéme p-semi-libre de générateurs de 
U® il est en particulier minimal et méme de type (M). Soit 
et soit H*(H, K,) que on voit aisément que c’est un 
systéme de générateurs de H*(H, K,), ([{3], Prop. 8.1); il est du reste 
minimal: En effet vu que Du» S Dv, la condition de minimalité est visible- 


ment vérifiée par les %;, qui engendrent une sous-algébre isomorphe 4 U; 


dautre part, si = - alors 

ce qui est absurde. Soit enfin = 0; + Pi(d,,---, de hauteur minimum 
parmi les éléments de la forme 3; + P(%,° +, 7:1), P est un polyndme. 


Comme «(U) =U et que Dum Dv,, les 3; forment avec les % un: systéme 
de type (IM) de gén‘rateurs de H*(H,K,). Ce systéme est p-semi-libre vu 
Vhypothése (a) et (1.4), et par conséquent 

H*(E, Ky) =.(U) @V, (V sous-algébre engendrée par les 
Tl nous faut encore montrer que V = V. La hauteur de %, soit 7, est égale 
a celle de uj; celle de %;, soit &, est au moins égale 4 la hauteur de 
*(0;) =u, + +, qui est 2 que la hauteur de 
soit s; Mais la définition de systéme p-semi-libre montre que 
dim H*(H, K,) = 1° 


| 

0 
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Comme ces deux dimensions sont égales, on doit avoir 5%, (lSisSn), 


d’ot Visomorphie de V et V. 


Remarques. 

7.6. Soit = 1* + Vir); sa hauteur est égale a 
celle de v;, car elle est d’une part = que la hauteur de @;, qui vaut s;, et 
d’autre part = que la hauteur de v;, qui vaut s;; on en déduit sans difhiculté 
que (v’;) est aussi p-semi-libre, et que la correspondance v’; > 0; définit un 
homomorphisme v: V > K,) tel que soit Pidentité ; Pisomorphisme 


de 7.5 peut done étre obtenu comme celui de 7. 4. 


7.7. Les résultats et démonstrations de ce No valent pratiquement sans 
changement lorsque /, B, F ont des cohomologies de type fini. 


8. Homologie et cohomologie des groupes classiques. 


Notations. Soient 2,,- + -,2, n variables, o; la jéme fonction symétrique 
élémentaire en les 2;, p un nombre premier. 

On note (J =i + k(p—1)), le polynéme qui exprime 
la fonction symétrique de terme typique en fonction des 
aj, et b,*/ le coefficient de o; dans B,*’, réduit mod. p. Par exemple on a 
([6], 12.3): 

(8. 1) = j mod. p. 


THEOREME 8.2. U(n), SU(n), Sp(n) sont sans torsion. On a 
H*(U(n),Z) = H*(S, X Ss X- K Sans, Z) 
H* (SU (n),Z) = H*(S; X Ss X- XK Sena, Z) 
H*(Sp(n), Z) = H*(S3 XK Sant, Z). 


(Au point de vue additif, ce résultat est di 4 Ehresmann [9] et Pon- 
trjagin [16]; pour une démonstration de 8.2, voir [3], Proposition 9.1); 
Visomorphisme de 8. 2 est bien entendu valable additivement et pour le cup- 
produit. Par contre, au point de vue des puissances réduites on a [6], 11.4 
et 13.5): 


THEOREME 8.3. Sott p un nombre premier. H*(U(n),Z,), (resp. 
H*(Sp(n),Z,)), posséde un systeme de n générateurs universellement trans- 
gressifs (hi), (Dhi = 21 —1), (resp. (vi), Dvi = 41 —1), tels que 


h 


| 

9 

li 

| 

de 

ét 

id 

qu 


t 


S- 
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P 
*(v;) =0 si p=2 et si k est impair, 
P = sinon, (j + k(p—1)/2). 
En combinant (2.8), (2.10), et (8.2), on voit que: 
THEOREME 8.4. Pour G=U(n), SU(n), Sp(n), Valgéebre H,,(G, Z) 
est une algébre extérieure anticommutative, isomorphe a H*(G,Z). 


Avant de passer aux groupes orthogonaux, rappelons que la cohomologie 
de la variéte V,,. des vecteurs unité tangents 4 S,_,, déterminée par E. Stiefel 
(Comm. Math. Helv., t. 8 (1935), pp. 3-50), est pour n impair: 


(8.5) H°(Wy,2,Z) = Z) =Z; H™(V,,2,Z) = Z2; Ht =0 sinon. 


THEOREME 8.6. Aw point de vue additif, SO(n) a méme cohomologie 
entiere que le produit 


Pasa M X (i plus grand entier impair Sn), 
multiplié encore par S,_, quand n est pair. 


(voir [8], [9], [16], [15], [3] Proposition 10.4). Les résultats suivants 
montrent que cet isomorphisme n’est pas valable au point de vue multiplicatif 
pour n = 5. 


THEOREME 8.7%. J/*(SO(n),Z.) posséde un unique systéme simple de 
genérateurs universellement transgressifs (hi), ASisSn—1, Dhi=1), 
liés par les relations: 


(1 (iS ji +j7Sn—1); sinon. 


({15], Theorems 5. 3, 6.1, [3], Prop. 23.3, [4], Théoréme 9. 1, remarquons 
en passant que l’existence du systeme de générateurs universellement trans- 
gressifs pourrait aussi se déduire du fait que H* (Byocny> 
de polyndmes en les classes caractéristiques réduites w.,-- Wn, qui est 
établi dans [4], No. 8, sans recours 4 H*(SO(n), Z.), et du Théoréme 6. 1 
du présent travail.) 


Z.) est une algébre 


Le théoréme (8.7) montre en particulier que 


8.8. H*(SO(n),Z.) admet un systéme 2-semi-libre de générateurs hy, 
hs, hs,- - +, la hauteur s; de ho, étant la plus petite puissance de 2 telle 
que s,(21—1) =n. 


1 
§ 
e 
3s 
a 
- 
4 
| 
\ 
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En ce qui concerne la cohomologie mod p impair on a ([3], Prop. 10. 2, [6], 
No. 14): 


THtorEME 8.9. Soit p un nombre premier impair. H*(SO(2n + 1), Zp), 
(resp. H*(SO(2n),Z,)), est une algébre eaxtérieure engendrée par n éléments 
On, (resp. Wen), (Dbi = 4t—1, = 2n —1), uni- 
versellement transgressifs, vérifiant : 


P = 0 lorsque k => 1. 
P (Bi) = (J =t+ k(p—1)/2). 


En utilisant (2.8), (2.10), (8.7) et (8.9), on obtient le théoréme suivant, 
dai a Miller ([15], Theorems 4. 2, 4.3): 


THEOREME 8.10. Soit H,(SO(n), est une algéebre extérieure 
anticommutative, isomorphe a H*(SO(n), Zp). 

H*(SO(n), Z.) est une algébre extérieure commutative, engendrée par 
n—1 éléments de degrés respecttfs 1,2,- --,n—1. 


Rappelons encore pour terminer ce No que certains des théorémes précédents 
sont en fait des cas particuliers de théorémes sur les variétés de Stiefel (réelles, 
complexes ou quaternioniennes). Pour ne pas trop allonger nous nous con- 
tenterons de mentionner un résultat qui sera utilisé plus loin ([15], Nos. 3, 
5, 6, [3] Prop. 10.4 et [4] Théoréme 9.1): 


8.11. Les coefficients de torsion de la variété de Stiefel Vn, des k-repéres 
orthonormaux de R” sont tous d’ordre deux. La projection naturelle px: 
SO(n) > induit un isomorphisme de H* (Vn,x, Z2) sur la sous-algébre de 
H* (SO(n), qui est engendrée par les éléments hi, (n—k SisSn—1; 
1=ksn—1}). 


9. Revétements réguliers finis; cohomologie des groupes cycliques. 
Ce No est principalement consacré au rappel de notions et résultats pour la 
plupart bien connus. 


Soit V un groupe fini. H*(By,T), (T anneau quelconque) s’identifie a 
Valgébre de cohomologie de WN relativement 4 I, au sens des groupes discrets. 
Supposons que WN soit sous-groupe d’un groupe de Lie compact connexe G; 
il agit forcément trivialement sur sa cohomologie, par conséquent dans la 
fibration (G/N, By, G) associée 4 (G,G/N,N), (voir 0.5(a)), le systéme 
des H*(G,I) est simple et en particulier: 


(9. 1) E, = H* (By, Ky) H*(G, Ky). 


et 


| 
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On sait que V a une cohomologie triviale relativement 4 K, lorsque p = 0 
ou lorsque p ne divise pas l’ordre de NV; dans ce cas, FH, est évidemment égale 
a E,, dot la Proposition bien connue: 


PROPOSITION 9.2. Soient G un groupe de Lie compact connexe, N un 
sous-groupe discret de G, d’ordre n, et q la projection de G sur G/N. 

Alors g*: H* (G/N, K,) ~ H*(G, Ky) est un isomorphisme sur lorsque 
p= 0 ou lorsque p est premier a n. 


CoROLLAIRE 9.3. Si p premier ne divise pas n, G et G/N sont simul- 
tanément avec ou sans p-torsion. 


Rappelons encore la cohomologie du groupe cyclique Z,, (voir par exemple, 
8. Eilenberg, Bull. Math. Soc., 55 (1949), pp. 3-37, No. 11): 


PROPOSITION 9.4. Z, a une cohomologie triviale en caractéristique zéro, 
ow en caractéristique p si p ne divise pas n. 

Si p impair divise n, H*(Bz,,Zp) est isomorphe au produit tensoriel 
gauche \(a) ®Z,[b] (Da=1, Db = 2); ona le méme résultat pour p = 2 
lorsque n = 0 mod. 4. 

Si n= 2m, m impair, H*(Bz,,Z2) = Z.[a], (Da=1). 


PROPOSITION 9.5. Soient Zm C Zy des groupes cycliques d’ordres divi- 
sibles par p. Si (m/n,p) =1, Vhomomorphisme p*p(Zm,Zn) est un iso- 
morphisme sur, si m/n est divisible par p, Vhomomorphisme p*p(Zm, Zn) est 
un isomorphisme sur pour les éléments de degrés pairs et annule les éléments 
de degrés impairs. 


Soient m = p’q, n= p'r, ((q, p) = (7, p) =1). On a donc 
Zn = X Za Zn=Zyt X 
Bz, = Bz, X Bz, Bz, = Bz, X Bz, 


et comme les deuxiémes facteurs des seconds membres ont une cohomologie 
triviale, ’homomorphisme p*y(Zm,Zn) se raméne 4 Vhomomorphisme p*, 
calculé pour les composantes p-primaires; il suffit donc d’examiner le cas 
m= p§, n= pt. 

Notre assertion est évidente pour s = ¢; soit done s < ¢ et posons k = p*-*. 
Dans Valgébre spectrale de (Bz,, Bz,, Bz,,), (voir 0.5 (c)) on a 


E, = H*(Bz,, H* Zp) ) 


et les coefficients sont ordinaires, car Z; agit certainement trivialement sur 
H*(Bz,,, Zp) = Zp», done 
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By = H*(Bz,,Z,) H*(Bz,, Zp) = A (a) Z,[y] ® A(b) ®Z,[2] 


(Da = Db = 1, Dy = Dx = 2). Comme 


dim = dim = 1= dim 1F, 


il faut que H,°! = 0, ce qui n’est possible que si 


(9. 51) d,(1®@b) =ky@1, (ke Z,,kA0). 


E.,* a Vélément y © b comme base, il ne contient done pas de d.-cocycle ~ 0, I 
ni a fortiori de d.-cobord ~ 0, par conséquent i 
(9. 52) d.(1@z) =0, 
ces deux formules déterminent complement d, et on calcule aisément (voir ti 
aussi Lemme 5. 2): e 
E; = A(a) @Z,[2x]; 
ainsi pour r => 3, FL, ne contient pas de degré base = 2, d, est nulle et H; = Fy. | 
Z,|«] s’identifie 4 ensemble des éléments de degré fibre zéro de c’est 
bien Vimage de p*p(Zm, Zn), (voir 0.5), d’oi notre proposition dans le cas | 
p impair. Pour p= 2, la démonstration est pratiquement identique 4 la 
précédente et laissée au lecteur. P 
10. Quotient d’un groupe de Lie par un sous-groupe cyclique. y 
Notation. Dans tout de No, lm désigne un sous-groupe isomorphe a Zm ” 
du groupe de Lie compact conneze G. : 
de 
Nous voulons étudier la cohomologie mod. p du quotient G/Tm, (p/m), F 4 
lorsque H*(G,Z,) posséde un systéme simple de générateurs universellement ? 
transgressifs. di 
LEMME 10.1. Sotent p un nombre premier, m un entier divisible par p. F % 
On suppose que H*(G,Z,) est Valgébre extérieure d’un sous-espace P gradué 
par des degrés impairs. Soit 2s le premier degré pour lequel p*p(Tm, G) est 
non nul.’ Alors dans Valgébre spectrale sur Z, de (G/Tm, Br,,, G, q): 
Bay Bags,  H* (Bry; Zp) /(H* (Bry; Zp)) AP’ 
ou, (pour un P convenable), P’ est un sous-espace gradué de P tel que 
dim P’ = dim P — 1, dim P’?*-? = dim P?**— 1. Si de plus Tm est invariant 
dans G, Ventier s est une puissance de p. 
(¢ 


7™La démonstration montrera l’existence de s. 


| 
g 
| 
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On peut supposer que P est l’espace des éléments universellement trans- 
gressifs de H*(G,Z,), ([3], Théoréme 19.1), par conséquent ([3], Prop. 
22,1): 

11) d,x,2(1® =0 si r Si, 


(10. 12) dix;2(1® = x? (p*p(I'm, @) (H*(Be, Zp) @1)). 


Si p*p(Im, @) est nul sur les éléments de degrés 1< k, ces formules 
0, montrent que d,«,?(1@P) —0 pour r < k, donc que d, = 0 et que LH, = 
pour r< &; mais H, est de dimension infinie, alors que #, est de dimension 


finie, il existe donc un premier entier 2s tel que p*, ne soit pas nul pour ce 
| degré; ’espace H**(By,,,Z,) étant de dimension 1, le noyau P’ de la restric- 
ir | tion de dys & Ko5"(1@ P) est de dimension inférieure de 1 4 celle de P; soit 
encore x un élément de P?s-', non contenu dans P’; on peut écrire 


Eo, = E, = H*(By,, Zp) ® A(x) @ AP’, 


ou y est un générateur de H**(By,,, Zp) ; on en tire aisément 
aS 


la Boo = H* (Bry, Zp) (Brn, Zp)) @ AP", 
et Hos, qui ne contient plus d’élément non nul de degré base = 2s +1, vu 
(9.4), est foreément égal 4 H,. Remarquons encore que s est la hauteur de 
Pélément o*(y), ott y est un générateur de H?(Br,,, Zp), et o* est Phomo- 
morphisme caractéristique de la fibration (G, G/Tm, I'm, 
Soit maintenant [,, invariant dans G. Si p est impair o*(y) fait partie 
| de tout systéme de type (J) de générateurs de H*(G/Tm, Zp), et sa hauteur 
); | est une puissance de p, (1.4). Si p—2, o*(y) ne fait pas forcément partie 


ot dun systéme de type (J/), mais on vérifie facilement 4 V’aide de (1.4) que 
| dans Palgébre de cohomologie mod 2 d’un H-espace, les éléments de degré S 2 
p. | ont toujours comme hauteur une puissance de deux. 
‘ CoROLLAIRE 10.2. Le polyndme de Poincaré mod p de G/Tm est 
P,(G/Tm, t) = P,(G, t) (1 — #8) (1 — t) (1 — 
PROPOSITION 10.3. On conserve les hypothéses du lemme 10.1 et on 
suppose de plus que p est impair. Alors, 
ue 
H*(G/Tm, Zp) = A (a) @ Zy[y]/(y*) @ AP’. 


En effet, d’aprés (9.4) et le lemme, on a dans l’algébre spectrale de 
(G/Tm; Br, G, q) 


= = N(a) @Z,[y]/(y*) AP, - 
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et comme on est en caractéristique impaire, l’algébre extérieure AP’ se 
“remonte ” dans H*(G/Tm, Zp), et Yon peut appliquer (7. 4). 


PROPOSITION 10.4. On suppose p= 2, I invariant dans G, et on con- 
serve les hypothéses du lemme 10.1. Alors si m == 2 mod. 4: 


H*(G/Tm, Zp) = Z.[a]/(a**) @ AP’, (Da = 1), 
si m= 0 mod. 4 et si H'(G, Z,) = 0, 
H*(G/T'm,Z2) = A (a) @ Z2[y]/(y*) @ AP’, (Da = 1, Dy =2). 
D’aprés (9.4) et (10.1), on a dans V’algébre spectrale de (G/T'm, Br,,, G, q) 
= Z,(a)/(a*) @ AP’ si m=2 mod. 4, 
E, = /\(a) @Z.[y]/(y*) AP’ si m =0 mod. 4, 


(Da = 1, Dy =2); comme nous supposons invariant, on déduit 1’iso- 
morphie de H*(G/T'm,Z,) avec FE, de (7.5), (qui s’applique aussi dans le 
deuxiéme cas puisque nous y supposons Ht — 0). 


ProposiTion 10.5. On suppose que H*(G,Z.) = AP posséde un 
systéme simple de générateurs universellement transgressifs formant une base 
de P. Soit T, un sous-groupe d’ordre deux de G et soit s le premier degré pour 
lequel p*.(T2, G) est 40.7 Alors, dans Valgébre spectrale de (G/T2, By,, G), 
on a 

= = Z,[a]/(a*) AP’ 


(P’ sous-espace gradué de P, dim P’ —1, dim P’** = dim P**—1). ST, 
est invariant, s est une putssance de deuz. 


Nous laissons au lecteur la démonstration, qui est pratiquement identique 
a celle de (10.1), si l’on tient compte du fait que toute base de P est un 
systéme simple de générateurs de H*(G,Z.,) d’aprés (2. 16). 

On pourrait aussi étudier le cas ot m est divisible par 4, mais l’énoncé 
est plus compliqué car on doit envisager deux possibilités pour l’algébre 
spectrale, suivant que s est pair ou impair, et nous ne le reproduirons pas ici. 


De méme, si l’on cherche 4 étendre (10.4) au cas systéme simple de géné- 


rateurs, on se heurte 4 quelques complications et nous nous bornerons au cas | 


particulier suivant: 


PROPOSITION 10.6. On suppose que H*(G,Z.) posséde un systéme 2- 
semi-libre de générateurs universellement transgressifs J = (1 
Dz, < Dx; sii<j). Soit T. un sous-groupe invariant d’ordre deux de G et 


e 
P 
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soit s le premier degré pour lequel p*.(T2,G) est non nul. Alors J contient 
un élément de degré s—1, soit et H*(G/T2, Z2) =Z,[a]/(a*) @ V, 
(Da =1), ou V est isomorphe a la sous-algébre de H*(G,Z2) qui est en- 
gendrée par les x; d’indices 1 Aj et par 


Soient J’ le systéme simple associé 4 J et P le sous-espace sous-tendu par 
les éléments de J’; ces derniers sont universellement transgressifs et les 
formules (10. 11) et (10. 12) leur sont aussi applicables; par conséquent, tant 
que d, est nulle sur J, sur J’, (ce qui a lieu pour r < s), elle est identiquement 
nulle, d’oi H, = EF,. De plus d, n’étant pas nulle il doit exister z;eJ, non 
annulé par d;, unique d’aprés ’hypothése Dr, < Dz, quand a <b. Le noyau 
P’ de la restriction de d, 4 P admet visiblement une base formée des éléments 2, 
(t7), de et de leurs puissances non nulles d’exposants 2, 4, 8,- 
comme J est 2-semi-libre, le sous-espace AP’ est une sous-algébre isomorphe 
a Valgébre V décrite dans l’énoncé, et (10.5) montre que dans l’algébre 
spectrale de (G/T:, Bz,, G) on a 


By = 4,[a]/(a*) OV; 


mais le systeme formé par les x, (1347) et 2;* est évidemment 2-semi-libre, 
et comme Da 1, H*(G/T., est isomorphe 4 H, d’aprés (7.5), ce qui 
établit notre proposition. 


11. Groupes quotients des groupes classiques. Dans le No précédent, 
nous avons vu que |’étude de H*(G/I'm, Z,) se raméne essentiellement 4 celle 
de p*,(I'm, G@) lorsque G posséde un systéme simple de générateurs univer- 
sellement transgressifs; dans le cas ot G@ est classique, cette condition est 
toujours vérifiée et p*,(Tm, G) se caleule aisément, comme nous allons le voir 
ici lorsque [',, est un sous-groupe du centre de G. 


LEMME 11.1. Soient Zm le sous-groupe des éléments d’ordre ‘m d’un 
tore a une dimension T*, p un nombre premier divisant m. Alors p*y(Zm, T*) 
est biunivoque. 


En effet, T*/Z,, est aussi un cercle, H*(Bm, Z,) est une algébre de poly- 
némes 4 un générateur de degré deux, donc, vu (9. 4) 


t) Py(T*/Zm, t) (Bn, t), 


Palgébre spectrale mod. p de est triviale, et 
est done biunivoque. 


LEMME 11.2. Soient (Zm)" le sous-groupe des éléments d’ordre m d’un 


ef 
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tore n dimensions, et p un nombre premier divisant m. Alors 
p*n((Zm)", I") est biunivoque. 
C’est une conséquence évidente de (11.1). Remarquons encore que 
p*,((Zm)", T”) se décrit facilement: Si p = 2 et si m = 2 mod. 4, 
et l'image de p*p)((Zm)", 7") est la sous-alg¢bre des polynémes en les 
Si p= 3 ou bien si p—2 et si m=0 mod. 4: 

H* Zp) = A (b1,° +5 On) (Dbi=1, Dyi = 2), 
et l’on peut visiblement prendre comme deuxiéme facteur (qui n’est pas 
univoquement déterminé), image de p*p((Zm)", T”). 

Nous passons maintenant aux quotients des groupes classiques par des 
sous-groupes de leurs centres, (pour les notations, cf. 0.7). 


THEOREME 11.3. Soient n un entier > 0 et s = 2* la plus grande puis- 
sance de deux divisant n. Alors, (avec Da =1, Dx;=1): 

H* (PSp(n), = Z.[a]/(a**) @ A (2s, * * 

D’aprés (8.3), les hypothéses de (10.1) et (10.4) sont vérifiées, il nous 
suffit done de prouver que 


(11.31). 4s = 2* est le plus petit degré > 0 pour lequel p*.(T2, Sp(n)), 
est ~0, T. étant le centre de Sp(n). . 


Soient J” le tore maximal de Sp(n) que forment les matrices diagonales | 
4 coefficients complexes, (Z.)" le sous-groupe de ses éléments d’ordre 2; les 


inclusions : 
Tr, C (Z.)" C C Sp(n), 
montrent que 
p*2(T2, Sp(n)) = p*2(T2, (Z2)") p*2((Z2)", p*2(T", Sp(n)). 

L’homomorphisme p*.(T", Sp(n)) est biunivoque, et son image dans 

H*(B qn, Ze) = Un], (Du; = 2), 
est ensemble des fonction symétriques en w,7,- - -,Un?, ([3], § 29); par 
conséquent, vu (11.2), l'image de p*.((Z2)", Sp(n)) dans 

H* (Be,)n, Z2) = Z2[4,° tn), (Dr;=1), 


est l’ensemble des fonctions symétriques en +, tn‘. 


® Comme d’habitude, une variable surmontée d’un * doit étre omise. 
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Il reste 4 calculer p*.(T., (Z2)"). Or, on peut identifier Z, et (Z.)” 
aux premiers groupes d’homologie (entiére ou mod. 2) de leurs espaces classi- 
fiants de maniére a ce que l’inclusion T, C (Z.)” se transporte en la restriction 
de ps(Z2, (Z2)") a la dimension 1.° D’autre part (Z.)" est formé des matrices 
diagonales dont les coefficients valent + 1, et l’élément non nul de YL., est 
la matrice —1; si v, resp. *, Un sont des systémes de générateurs de 
H,(Br,,Z2), resp. H,(Biz,)n, Z2), Pimage de v par py, sera 1; 
(pour un choix convenable évident des v;), d’ou, z, resp. %,° +, 2, étant 
les bases duales des premiers groupes de cohomologie 


p*2(T2, (Z2)") (ti) = 2, (lStSn), 


(o; iéme fonction symétrique élémentaire) ; mais on sait que 


(11. 32) (4) =0mod.2 pour i < 2%, (ox) = 0 mod. 2, 
(n = 2*-n’, n’ impair), 
ce qui, joint aux propriétés déja établies de p*.((Z2)", Sp(m)), donne (11. 31). 
THEOREME 11.4. Sotent Ty, le sous-groupe d’ordre m du centre T, de 


SU(n), p un nombre premier divisant m et s = p* la plus grande puissance 
de p divisant n. Alors, avec Dx =1, Dy = 2, Duy =1: 


H* (SU Zp) = Zplyl/(y*) ® A (15 
pour p = 8, ow bien p= 2, m=0 mod. 4, et pour p = 2, m =0 mod. 4: 
H* (SU (n)/Tm, Z2) = A (as, * * 
D’aprés (8.3), (10.3), (10.4), il nous suffit de montrer que: 
(11.41) 2s = 2p* est le premier degré > 0 pour lequel p*p(T'm, SU(n) ) 
est non nul. 


SU(n) est totalement non homologue 4 zéro dans U(n), relativement a 
des coefficients quelconques, done p*,(SU(n),U(n)) est sur, ([3], Cor. a la 
Prop. 21.3), et puisque 


(I'm, U(n)) = p*p(Tm, SU(m) ) p*p(SU(n), U(n)) 


il suffit de faire voir que (11.41) vaut aussi pour p*,(Tm, U(n)), ce qui va 
sétablir exactement comme (11.31). On part des inclusions 


® Pour une discussion plus détaillée d’un cas analogue, cf. [4], No. 3. 
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T* étant le tore maximal des matrices diagonales de U(n), d’ot Von déduit 
p*p (I'm, U(m)) = p*p(T'm, (Zm)") p*p((Zm)", p*p(T", U(n)). 
p*,(T", U(n)) est biunivoque, et son image est l’ensemble des fonctions 
symétriques en Yn, ([3], § 29). Sip 2, m = 2 mod. 4, on démontre 
comme plus haut que l’image de - +, Un) par p*p(I'm, I") est 

Si maintenant p = 3, ou bien p = 2, m =0 mod. 4 

H* Zp) = An) Yn], (Dar—1, Dy, = 2), 


et Pimage de p*,((Zmn)",U(n)) est, vu (11.2), Vensemble des fonctions 
symétriques en les y; Enfin ’homomorphisme p*p(Tm, (Zm)" dans 


H* (Br, Zp) = @Z,[y], (Db = 1, Dy= 2), 


est de nouveau transposé de l’inclusion, applique les 0; sur b et les y; sur y, done 


n 
(Tm, (Zm)") Yn)) = 
et (11.41) se déduit des relations 


=0mod.p sii< p*, Omod.p, (n=p*-n’, (n’,p) =1). 


THEOREME 11.5. Sotent n un entier > 0, s = 2* la plus grande pus- 
sance de deux divisant 2n. Alors, 


H*(PSO(2n), Z2) =Z.[a]/(a*) ® V, (Da = 1), 


ou V est une algébre unitaire ayant un systéme simple de 2n — 2 générateurs 
(1SiS n—1, i~2#—1, liés par les relations 14:17, = ty 
st 20S 2n — 1, sinon. 


PSO(2n) est le quotient de SO(2n) par un sous-groupe T, a deux 
éléments; (8.8) montre les hypothéses de (10.6) sont réalisées et qu'il 
suffira de faire voir que: 


(11.51). s est le plus petit degré > 0 pour lequel p*2(T2, SO(2n)) est 
non nul. 


L’homomorphisme p*,(SO(2n),O(2n)), ob O(2n) est le groupe ortho- 
gonal complet a 2n variables, est sur, ([4], Prop. 8.1), il suffit done d’établir 
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(11.51) pour p*.(T2,O(2n)). Soit Q(2n) le sous-groupe des matrices 
diagonales de O(2n), il est isomorphe a (Z2)*", done 


H* (Bowen, * Len], (Dz; = 1); 
les inclusions T, C Q(2n) C O(2n) montrent que 
p*2(T2, O(2n)) = p*2(T2, Q(2n)) p*2(Q(2n), O(2n) ), 
p*.(Q(2n), O(2n)) applique sur Vensemble des fonctions 
symétriques en les ([4], Théoréme 5.1). Quant a p*.(T2,Q(2n)) cest 


de nouveau en dimension un le transposé de l’inclusion, et comme dans le cas 
de Sp(n), on aura: 


et (11.51) résulte des propriétés (11.32) des coefficients binomiaux. 


Remarques. 

11.6. Si » est impair, s est égal 4 deux, le théoréme montre que l’on 
obtient H*(PSO(2n),Z.) a partir de H*(SO(2n),Z.) simplement en y 
remplacant h, par un élément de carré nul; en particulier, PSO(2n) a aussi 
un systéme simple de 2n—41 générateurs de degrés 1,2,---,2n—1. On 
pourrait du reste déduire s = 2 du fait que le groupe fondamental de PSO(2n), 
qui est isomorphe au centre de Spin(2n), est isomorphe a Z,, (0.7). 

Par contre si n est pair, s est > 2, (ce qui si l’on veut correspond au fait 
que 7;(PSO(2n)) = Z.+ Z.), et H*(PSO(2n),Z.) a un systéme simple 
de 2n —2 générateurs. 


11.7. Nous n’avons étudié la cohomologie mod. p de G/T que dans le 
cas ol p divise m, le seul qui présente quelque intérét. Lorsque (p,m) —1, 
il suffit d’appliquer (9. 2). 


IV. Le groupe des spineurs. 


12. Cohomologie du groupe des spineurs (résultats). 


THEOREME 12.1. (a). Pour p impair, Spin(n) n’a pas de p-torsion et 
H*(Spin(n),Z,) est isomorphe a H* (SO(n), Z,). 


(b). Spin(n) a de la 2-torsion si et seulement sin = 1, et ses coefficients 
de torsion sont tous égaux a deua. 


(c). Soit s(n) Ventier tel que 22M-* <<nS2%™, L’algébre H*(Spin(n), 
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Z2) posséde un systéme simple de n—s(n) générateurs * Un-s(n)—1) U3 
le dernier est de degré 28 —1 et la suite des degrés Duy, Dus, Dun-s(ny)1 
s’obtient a partir de la suite 3,4,---,n—1 en y supprimant toutes les 


puissances de deux. Ces éléments vérifient les relations sutvantes: 
Sqi Pui) si iS Duy i+ Duy = Dug; = 0 sinon; =0 
qiuj= (5 st tS Duy t+ Duy = Dux; Sq'uj = 0 sinon; = 0. 


12.1(a) résulte de (8.6) et (9.3); donnons maintenant les valeurs des 
degrés Du;, Du pour n = 9. 


n s(n) >, Du 
3 2 3 
4 2 3 3 
5 3 3 j 
6 3 3, 5 v4 
3 3, 5, 6 7 
8 3 3, 5, 6, 7 v4 
9 4 3, 5, 6, 7 15 


13. Cohomologie mod. 2. Nous diviserons la démonstration de 12. 1(c) 
en trois parties: (i). Détermination d’un systéme simple de générateurs, (i1). 
Détermination des u; vérifiant les Sq‘ annoncés, (iii). Détermination d’un 
générateur wu de carré nul complétant le systéme des 1%. 


(i). Un systéme simple de générateurs. L’espace homogéne Von 
= SO(n) /SO(2) est aussi le quotient de Spin(n) par l’image réciproque 
de SO(2) dans Spin(n) ; cette derniére est connexe puisque Vy,»-2 est simple- 
ment connexe, c’est un revétement 4 deux feuillets de SO(2), done un cercle, 
d’ot une fibration (Spin(n), Vnn-2, $i, q) dont nous voulons étudier l’algébre 
spectrale sur Z,. On déduit immédiatement des propriétés de H* (Vn,n-2, Z2) 
rappelées au No. 8: 


13.1. L’idéal engendré par h, posséde, en tant qu’espace vectoriel sur 
Z2, une base formée des mondmes (41 < %), OU 
Yun des 1; au moins est une puissance de deux. 


13.2. Les n— s(n) —1 éléments h; dont les degrés ne sont pas des 
puissances de 2 constituent un systéme simple de générateurs d’une sous- 
algébre unitaire K de H*(Vu,n-2,Z2) qui, en tant qu’espace vectoriel, est un 
supplémentaire de Vidéal (he). 
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13.3. La hauteur de h, est égale 4 28-1; Vannulateur de he, soit Ann. 
ho, est Pidéal engendré par |’élément: 


h* = h. hs Rosm- = (Dh* == 28(n) 2) 


Yapplication y—y-h* est une application linéaire biunivoque de K sur 
Ann. ho. 


Nous pouvons maintenant passer a Valgébre spectrale sur Z, de 
(Spin(n), On a tout d’abord: 


E, = H* (Van-2, Z2) H*(S:, Z2), 
E, = Eo. 


Soit 2 V’élément non nul de H1(S,,Z.); comme Spin(n) est simplement 
connexe, on doit avoir #;°* —0 et x ne peut étre un d--cocycle, d’ou 


d,(1®@2) =h.@1, 


ce qui détermine complétement d2, comme on sait; les d2-cocycles forment 
la sous-algébre 


(Ez) = Ann. @ + H*(Van-2; Z2) @1, 


et les ds-cobords la sous-algébre: d.(H2) = (hz) @1. 

En tant qu’espace vectoriel, Ann. h, ®a-+ K @1 est un supplémentaire 
de d.(H,) dans C(F,) d’aprés (13.2); mais, vu (13.3) et (13.1), cette 
sous-algébre admet un systéme simple de générateurs formé des n — s(n) —1 
éléments h; dont le degré n’est pas une puissance de deux et d’un élément 
h* @az de degré égal 4 28 —1; comme elle est appliquée isomorphiquement 
sur H; par xs", on voit que #;=#,, done aussi ([3], Prop. 8.1) que 
H*(Spin(n),Z:) possede un systéme simple de n—s(n) générateurs dont 
les degrés sont égaux aux entiers compris entre 3 et n —1 et non puissances 
de deux et 4 28) — 1, ce qui établit (i). 


(ii). Détermination des uy. Soient (ui), (1Si<n—s(n)), les élé- 
ments g*(h;), (7 non puissance de 2), rangés par ordre de degrés croissants 
et soit u* un représentant dans H*(Vnn-»2,Z2) de h*®@z. Vu (8.7) et 
(8.11), Sq*u; a bien la valeur énoncée dans 12.1(c); de plus les w% sont 
représentés dans #, par les éléments x~*(hi), (¢ non puissance de deux), qui 
forment avec h* ® x un systéme simple de générateurs de Z., par conséquent 
([3], Prop. 8.1), les u; et u* constituent un systéme simple de générateurs 


de H*(Spin(n), Z2). 
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(iii). Le générateur u. Posons U = q*(H*(Van»2,Z2)) et a(n) 
= 28") 1, De (i) et (ii) on tire que U est isomorphe 4 H* (Vy,n-2, Z2) / (he) 
et contient tous les éléments de degrés < a(n) de H*(Spin(n),Z.), que 
H (Spin(n),Z.)/U*™ est de dimension 1, et que tout élément de degré 
>a(n) de H*(Spin(n),Z.) est décomposable. Un systéme de type (J) 
de générateurs de H*(Spin(n),Z.) comprendra done un systéme de type 
(M) de générateurs de U, soit J, complété par un élément de degré a(n) 
soit wu. 


Nous voulons montrer que la hauteur k de wu vaut 2. Comme (J, w) est 
2-semi-libre d’aprés (1.4), on a: 


(13.4) P.(Spin(n),t) =P.(U,t). (14 + 4 Daln)) 


mais nous savons déja que les u4, (resp. les wu; et u*), forment un systéme 
simple de générateurs de U, (resp. de H*(Spin(n), Z,), d’ou 


(13. 5) P,(Spin(n), t) = P.(U,t). (1+ 1), 


ce qui, joint a (13.4), donne bien k = 2. 

u étant ainsi de carré nul, et (J,u) étant 2-semi-libre, on voit que 
H*(Spin(n),Z.) est isomorphe 4 U@/A(u), done que (u,u) en est un 
systeme simple de générateurs. 


14. Cohomologie entiére. Nous savons déja que Spin(n) n’a pas de 
p-torsion pour p~ 2; d’autre part, 12.1(a), (c) et le No. 8 montrent que 
pour n< 7, Spin(n) a méme polynéme de Poincaré mod. 2 qu’en carac- 
téristique zéro, il n’a donc pas de 2-torsion; naturellement cela résulte aussi 
du No. 8 et des isomorphies classiques: 


Spin(3) = Sp(1), Spin(4) = Sp(1) X Sp(1), 
Spin(5) = Sp(2), Spin(6) = SU(4). 


Par contre, pour n=7, P.(Spin(n),¢) contient ¢ et ¢®, qui ne figurent 
pas dans P,(Spin(n),¢), et Spin(n) a bien de la 2-torsion. Pour terminer 
la démonstration de 12.1(b), (et de 12), il nous reste done 4 prouver que 
les coefficients de torsion de Tors. H*(Spin(n),Z) sont tous égaux a deux 
pour n= 7. 


14.1. Soit X un polyédre fini. On sait que l’homomorphisme 
Sq': H*(X, Z,) H*(X, Z2) 


il 
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n’est autre que l’homomorphisme de Bockstein attaché a la suite exacte de 
coefficients : 
2325272250 


suivi de la réduction mod. 2; il en résulte que dim. Sq'(H*(X, Z.)) est égale 
au nombre des coefficients de torsion de Tors. ,H***(X,Z) qui sont égaux a 
deux; d’autre part les formules 


H*(X, = H*(X, Z) + Tor(H**(X, Z), Zs) 
H*(X, Z)) = H*(X, Z) Z = H*(X, Z)/Tors. H*(X, Z) 


montrent que dim. H*(X,Z.) —dim.H*(X,Z,) est égal au double du 
nombre des coefficients de torsion de Tors.,H*(X,Z). Par conséquent: 


(14.11). Pour que les coefficients de torsion de Tors. xH*(X,Z) soient 
tous égaux a deux, il faut et il suffit que 


2- dim. Sq'(H*(X,Z.)) = dim. H*(X, Z,) — dim. H* (X, Zo) 
ow encore, puisque le premier membre est toujours = aw second, que: 
2-dim. Sq'(H*(X, Z.)) 2 dim. H*(X, — dim. H*(X, 
D’aprés 12. 1(a), (c), et le No. 8, ona 
dim. H* (Spin(n), = 
dim. H* (Spin(n), Zo) = 2° (b(n) = [n/2]), 
il nous suffira donc d’établir que 
(14. 12) 2-dim. Sq'(H* (Spin(n), = 27-8 — Quin), 
14.2. Nous reprenons les notations du No. 13; on a done 
H*(Spin(n), 22) =U @ A(u) = A(t, +, @ A(u), 


ol U = q*(H* (Vann, Z2)) est stable pour Sq‘; soit (2;) une base d’un 
supplémentaire dans U du noyau de Sq'; comme H'(Spin(n), Z.) =0, il 
est clair que Sq'(u) eU, done que les vecteurs = @ 1 
et les vecteurs 


Sq' @u) = Sq' (xi) @u+ @1 
forment un systéme libre; par suite 
dim. Sq*(H*(Spin(n), = 2° dim. Sq?(U) 
et (14.12) résultera de 


(14. 21) 4-dim. Sq?(U) = 27-8) — Qr(n), 


316 ARMAND BOREL. 


14.3. Nous avons vu que 
U = q* 22) = H* Z2)/(h2) 5 
on peut donc écrire U = A @ B avec 
A A(het), (2 Si <s(n)), pour n impair, 
A A (hot) @ A(hn-), (2 St <s(n)), pour n pair, 


hing), (5 S mi Sn — 2, m;,+ 1 pair, non puissance de 2). 
i i 
Nous noterons c(n) le nombre de facteurs A (hm,, Mmjs1) de B, done 


c(n) = b(n) —s(n) +1 (n impair), 
c(n) = b(n) — s(n) (n pair). 
Nous avons rappelé au No. 8 les Sq‘ de Vn,n-2; en particulier 
Sq' (hi) = his. pour impair S n— 2, Sq'(hi) = 0 sinon, 
si 20S n—1, sinon, 


ce qui montre que A et B sont des sous-algébres stables par Sq’, et en fait 
que Sq'(A) —0; par des calculs élémentaires, que nous ne reproduirons pas, 
on en déduit que (14.21) équivaut a 

(14. 31) 2-dim. Sq?(B) = 27 — 

14.4. Soit B,, (& impair = 5), la sous-algébre de B, stable pour Sq’, 
qui est engendrée par les h; d’indices = k; elle posséde un systéme simple de 
générateurs, formé d’un nombre pair, que nous noterons 2/(k), d’éléments h;; 
en particulier 

B, = B, 1(5) = c(n); l(k —2) =I(k) +1; 
Vinégalité (14.31) résultera done de: 
(14. 41) 2-dim. Sq'(B,) = 274%) — 21H), == 5, 7%,° 


que nous démontrerons par récurrence sur /(k), (done par récurrence descen- 


dante sur k). 


Cette inégalité est immédiate pour /(/) = 1, supposons-la vraie pour un 
indice k > 5, et soient, dans B,, Nx le noyau de Sq’, J, son image, S;, un 
supplémentaire de N;, et P; un supplémentaire de J, dans N;, (on a bien 
I, C N;, puisque Sqto Soit encore s l’indice tel que = +1, 
autrement dit tel que 


B,=A (hs, Nes1) By. 


gE. 
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L’espace Sq'(B,) contient: 

OL Hh+h 

Vo = + 18 we Sy} = Sy 

Vs = hes, Py = P, 

Vi= =h, + he @ v, v € Sy} = Sy. 
On vérifie facilement que des vecteurs 4,40, (vje Vi) sont linéairement 
indépendants, et comme d’autre part 

dim. S; = dim. J;, 
dim. B;, = dim. N;, + dim. J;, = dim. P; + 2: dim. i, 
on voit que 
dim. Sq'(B,) = dim. + dim. = 2+ dim. J; + dim. B,. 

Mais dim. B; = 2?" et par hypothése d’induction, 


2-dim. J, = 224) — 
d’ou finalement 
dim. Sq?B, = 224 — 4. 2210) 


2-dim. Sq'(B;) = Q21(k)+2 DUK) +1 921(s) 
ce. q. f. d. 


15. Eléments universellement transgressifs de H*(Spin(n), Z.). 


PROPOSITION 15.1. Dans les notations de 12.1, les éléments (uw), 
(1St<n—s(n)), sont universellement transgressifs. 


Soient « et 8 les projections canoniques: 


B 


Spin(n) —*— > SO(n) Van-o; 


mod. 2, B* est biunivoque, applique h; sur l’élément universellement trans- 
gressif de degré 1 de H* (SO(n), Z.), (voir [3], § 10, 23), soit 2; considérons 
dautre part le diagramme 


A 
Espin(n) E spincn) /T. 
id. 
B spin n) B spin(n) 


ou T, désigne le centre de Spin(n), et ob A, wu, v sont les projections canoniques ; 
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il est clair que v fait de E spin(ny/T2 un espace fibré principal de groupe struc- 
tural SO(n), dans lequel 2; est donc transgressif. Comme la restriction de A 
a une fibre est «, l’image réciproque par A* d’une cochaine de transgression 
pour 2; dans la fibration de droite sera une cochaine de transgression pour 
a*(a;) = a* o B¥ (hi) = q* (hi) (7 étant tel que Duj=7), dans la 
fibration de gauche, ce qui montre justement que wu; est universellement 
transgressif. 

Il reste 4 savoir si l’on peut compléter le systéme des uw par un générateur 
universellement transgressif ; nous verrons que cela n’est possible que si n S 9; 
plus précisément : 


Proposition 15.2. H*(Spin(n), Z.) a un systéme simple de générateurs 
universellement transgressifs si et seulement sin S 9. 


Compte tenu de 6.1 et de la Prop. 19.1 de [3], cela équivaut a: 


PROPOSITION 15.3. H*(Bspinin), Z2) est une algébre de polynémes si et 
seulement sin 9. 


Pour étudier H*(Bspinin), Z2), nous nous servirons de l’algébre spectrale 
sur Z, de (Bspin(n), Bsoin), Br,) qui correspond a la fibration (Spin(n), SO(n), 
C2), (cf. 0.6). 


Remarquons que si H#, est une algébre de polynémes, il en est évi- 
demment de méme pour H*(Bspinn), Z2) ; d’autre part, si H*(Bspinny, Z2) 
est une algébre de polynémes, alors 6.1 et 12.1 montrent’qu’elle posséde 
nm—s(n) générateurs de degrés Du; +1, (lLSi<n—s(n)), et Du+1, 
autrement dit que sa série de Poincaré est: 


t=n-3(n)-1 


(15. 4) (1 — 


la Prop. 15.3 sera done conséquence de la: 


Proposition 15.5. (H,) Valgébre spectrale sur Z, de (Bspin(n), 
Bsown), Br,). Alors pour n=9, Ey, est une algébre de polynémes et pour 
n>9, EH, a pour le degré total une série de Poincaré différente de (15.4). 


La démonstration de (15.5) sera donnée au No. 15.18, aprés une série 


de remarques préliminaires. 


15.6. Soit X un espace et soit a—=a,'d.- - -a, un élément de degré 
j de H*(X,Z.2), (ae H™(X,Z,)). Alors Sqi-ta est une somme de mondmes 
qui ont la forme 


e’est une conséquence évidente des propriétés classiques des Sq‘. 


ce 


A 


et 


pa 


| a 
| 
( 
3 et 
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15.7%. On sait que H*(Bsom,Z2) = Z2[wWe,- -,Wn], (Duw,—t), est 
une algébre de polyndmes en les classes caractéristiques réduites w,, dont les 
i-carrés sont donnés par les formules de Wu Wen Tsiin: 


avec les conventions: w, —0; sik>n; (0) ==], (5) =0 sia<b 
et b 40, (voir par ex. [4], No. 8). En particulier 
(15. 8) S qi = + Woy 5 
on en déduit que les éléments 1 définis par 

sit n’est pas de la forme 2/ + 1 
(15.9) 

= Sq = Sq?! 0 - 0 Sq? o Sq* (we) 


(0<j< s(n)), forment aussi un systéme algébriquement libre de généra- 
teurs de H*(Bsgin), Z2). 


Notations. I(k) idéal de H*(Bson), Z2) engendré par OU, 
ce qui revient au méme, par Wo,° - -, Wx. 


L(k), idéal engendré par les éléments v2),,, (07 < k). 
a(n) = 28) —15; d(n) = 28-1, 


15.10. Soit QelI(k), (resp. Qe L(k) et k< s(n) —1), de degré j. 
Alors Sqi*(Q) (resp. Sqi*(Q) e L(k + 1)). 


C’est une conséquence évidente de (15.6), (15.8) et (15.9). 
15.11. Ona voi,, = mod. 1(27"), (1Sj<s(n)). 


La démonstration se fait aisément par récurrence sur j, 4 l’aide de (15. 8) 
et (15.9), et est laissée au lecteur. 


15.12. On a Sq@™vainyie L(s(n)) pour nS 9. 
La formule Sq?v; = w; + w.ws montre que (15.12) est vérifiée pour n < 5 
et que sin = 5 
mod. L(2), ou encore L(3) ; 


par conséquent, d’aprés (15.8) et (15.10): 
Sqtvs = Ws + + + mod. L(3) 


| 

| 
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ce qui donne (15.12) pour 5 n < 9 et de plus, pour n= 9; 
Vg ==W, mod. L(3) ou encore wye L(4) ; 


on en tire 4 l’aide de (15. 10): 


= mod. L(4), 


en particulier, si n = 9: 
Sq*vy5 = we, mod. L(4), done Sq*v,e L(4). 
Notons encore en passant que pour n = 10, on a: 
= + mod. L(4 
(15. 18) q (4) 
Sq?vo = + V7Vi0 mod. (4). 


15.14. Pour n> 9, Sq@™vanyarf L(s(n)). D’aprés (15.11), on peut 
écrire 
Va(n)s1 = + + 
(Qe —1), R de degré +1). On en déduit 
vacnysr = SQ™™ (Wainyar + Q) 
+ v7 (BR) + (varny/2), 

d’ol, vu (15.8), (15.10) et n> 9, done s(n) = 4: 

S$ Va(nys1 = Wainy Wainy sr + mod.-I(d(n) — 2)) 

Va (ny +1 = Vain) Vainysa Va(n)-1Vainy +2 mod. I(d(n) — 2). 


Sq*™vainys1 est un polyndme en les v;, dans lequel le produit Va(ny+2 
figure ainsi avec le coefficient 1; il ne fait done pas partie de l’idéal L(s(n)) 
pour s(n) =4,c.q.f.d. 


15.15. Nous pouvons maintenant étudier l’algébre spectrale de la fibra- 
tion (Bspin(n), Bsoin),Br,) ; puisque Bson) est simplement connexe, on a 
E, = H*(Bsow, Z2) © Z.[2] (Dz = 1), 
ce qui peut s’écrire 
E, = Z,[v2| Z2[vs,° +, Un] A(z) @Z.[27] ; 
Bspin(n) étant simplement connexe, on a = 0 et n’est pas un d,-cocycle, 


ce qui implique: 
d.(1®z2z) =v,@1; 


évidemment 


d,(1@ a?) 


vu 


3 
q 
n 
a 
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et l’on voit, a Vaide de la régle de Kiinneth et de 5. 2, que: 


= Vn] ® 27]. 


Notons encore que, la transgression commutant aux Sq‘, les éléments 


a’, x4, 8,- - -, sont transgressifs et de plus 

® = 1), (1 = < s(n) ), 
doa(n)+1Ked(n) +1" (1 ) = Kod(n)+1— (Sq@™ van) +1 1). 


15.17%. A montrer: Pour 2S rS 2d(n) = 28, ona =E,,, 
nest pas de la forme 2/ + 1, et 


Era = H* Z.)/L(j +1) @ ] 
(0<j<s(n)), 


Nous avons déja calculé #;; supposons (15.17) établie pour F,, ot k < 2d(n) 
alors 
EL, =U®@YV, U = V = 


et V est une algébre de polynédmes engendrée par 2’, ot s est une puissance 
de deux. Si &2/+ 1, V est engendrée par un élément transgressif de 
degré ~k—1, done d,(1®V) —0, don, ici, d,=0 et = Ey; Bi 
maintenant = 2/-+ 1, (1Sj <s(n)), alors 


Ey, = = H* (Bsoqn, 42)/L(j) 
Ey, = H* (Bsow, Z2)/L(j) A(z”) @ Z,[4""], 
et, vu (15.16), il en résulte de nouveau par (5.2) et la régle de Kiinneth: 
= H* Z2)/L(j + 1) @ 


15.18. Démonstration de la Proposition 15. 5. Nous savons par (15. 17) 
que 
(15. 19) = Zo[V2,° +, Unj/L(s(n)) @ 


Soit n= 9. Les formules (15.12) et (15.16) montrent que deainys, est 
nulle sur Z,[2?4)**], done identiquement nulle sur comme 274) 
est un d,-cocycle pour tout r > 2d(n) + 1, les différentielles ultérieures seront 
aussi identiquement nulles, et #. = Eeainy41 est une algébre de polynémes. 


Si 2 > 9, doacnys1 n’est pas nulle sur 
Kea(n)41- (1 


vu (15.14), (15.16) et (15.19); par conséquent la série de Poincaré de 


4 
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et, a fortiori, celle de seront majorées par celle de qui est 
justement égale 4 la série (15.4), ce qui termine la démonstration de la 


Prop. 15. 5. 


15.20. Nous ne chercherons pas 4 déterminer complétement la structure 
de H*(Bspin(n), Z2), qui parait étre assez compliquée dans le cas général, et 
nous nous bornerons 4 décrire H*(Bspiniio), Z2). D’aprés (15.13), (15. 16) 
et (15.19), on a 


Ey, = +, (4) Z2[2"*] 
Ey, = Z2[ v4, Ve, Vz, Va, V10] @ A(2z**) @ Z,[2*?] 
(1 @ = (07 * Vio @ 1) 
d’ou, par un calcul facile, 
E13 = Z2[ V4, Vey V2) Vs) Vio] / (V7 * V10) @ 


(cela résulte aussi du reste du lemme 11.1 de [3] appliqué au cas du systéme 
simple, cf. [3], § 16). Par conséquent H,, = et 


dggkg3*(1 ® = 10) ® 
Par ailleurs 


= (Wet Wz + Ws + Ws + W3*W10)* W107 wz? * We" W103 


Ws, Ws, Wy sont dans L(4), (voir démonstration de 15.12), et les expressions 
de E,, et Hi, écrites au début de (15.20) montrent que L(4) et le produit 
= Wz" Wio font partie du noyau de ainsi ds, est nulle sur image 
de dans done identiquement nulle; les différentielles ultérieures, 
toujours nulles sur un élément de degré 32, seront alors aussi identiquement 
nulles, et = = Hy. L’algébre de polynémes Z,[2*?] peut évidemment 
se “remonter” de H, 4 H*(B spincio), Z2), on peut appliquer (7.4), ce qui 
donne finalement: 


PROPOSITION 15.21. L’algébre H*(B spina), Z2) est tsomorphe a 
We, Wz, Ws, (Wz *W10), (Dw; 
16. Sur l’homologie du groupe des spineurs. En combinant (2. 8), 
(12.1) et (15.2), on obtient: 


THEOREME 16.1. Pour pA 2, Valgébre H,,(Spin(n), K,) est une algébre 
extérieure, anticommutative, isomorphe a4 H*(Spin(n), K,). 


4 

( 

le 
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Si n9, H,(Spin(n),Z.) est une algébre extérieure commutative a 
n-—s(n) générateurs (v;), (1 Sisn—s(n)); le degré de est égal 
a 28") —1 et la suite des degrés Dv,,- - +, Dvn-s(n)-1 S’obtient en supprimant 
les puissances de deux dans la suite 3,4,---,n—1. 


Nous nous proposons maintenant de déterminer le produit de Pontrjagin 
dans H,,(Spin(10),Z.); a cet effet, nous aurons besoin de quelques pro- 
priétés de V’algébre spectrale universelle sur Z, de Spin(10), qui font l’objet 
du Lemme 16. 2. 


Notations. Il sera commode de mettre en indices les degrés des généra- 
teurs aussi désignerons-nous dorénavant le systéme simple de générateurs de 
H*(Spin(10),Z.) décrit dans le Théoréme 12.1 par ws, Us, Ue, Uz, Uo, U*i5, 
(Du; = 7). 

D est Valgébre des éléments décomposables de H*(Spin(10),Z.) et 
Di=DN Hi(Spin(10), Z.). 


LEMME 16.2. existe H'®(Spin(10), Z.), congru a u*,, mod. D ° 
tel que l’on att, dans V’algébre spectrale universelle sur Z, pour Spin(10): 


(a). ts est un d,-cocycle pour 2SrS9 et 

ok107 (1 @ ts) = k10? (Wi @ Ue) FO. 
(b). EB, = wz, Ws, Wi0, @ A (te, Ur, Us, Uis)- 
(c). @ 1) KO. | 


t | Nous utiliserons sans autre commentaire le fait que les uw, (t < 15), sont 
2 universellement transgressifs (15.1) et que leurs images par transgression 
dans l’algébre spectrale universelle sont 40 (puisque FZ, est triviale). Vu 
t (15.21), on a: 


E, = Wa, We, Wz, Ws; Wi0) @ A (Us, Us, Ue, Uz, Ug, U* 15). 
d’ou évidemment, d, — d,; — 0, FE, = E, et 
us) = xy? (ws @ 1). 
les quatre u; suivants sont des d,-cocycles, et, en ce qui concerne u*,;, on peut 
en tout cas écrire: 
h = dyx.?(1 @ = @ 0, +: - + .0,@D,), 
e *° Comme le systéme (u,) contient u,? lorsqu’il est = 0, on vérifie aisément que tout 


élément congru & wu mod. D forme avec les u, un systéme simple de.générateurs. 


| 
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(a,ew,® A(us), a Die A (us, Us, Uz, Us) +, bs linéairement indé- 
pendants); l’élément h est forcément un d,-cocycle, d’ot d,(a;) =—0 et 
a4,=w,@1, (1S1Ss); ainsi, élément 


= U* 15 + Us3° (b, bs) 


est un d,-cocycle pour 2 =r 4, congru a u*,; mod. D.%°. On en tire par 
un calcul direct facile, ou l’aide de (5. 2), que: 


E; => Z2[ We, W7, Ws, Wio; W32|/ Wi0) A (Us, Ue; U7; Ug; > 
par conséquent d; = 0, L; = EF, et 
(16. 21) dexe?(1 @ us) = ke" (We @ 1). 


Par un raisonnement en tous points semblable 4 celui qui nous a conduit 4 
u’;, et A H;, on en déduit l’existence de xe A (te, Uz, Us), de degré 15, tel que 
= + soit un d;-cocycle pour 2 =r 6, et 


(16. 22) = Wz, We, A (Ue, Uz, Us, 


il en résulte que 
(16.23) d;x7z7(1 ue) = @ 1), 


et que x;?(Wi9 @ ue) est un d;-cocycle, non nul, qui engendre H,*°*. Mais LF, 
est triviale, et d,H,1°* 0 pour tout r > 7, il doit done exister un indice 
t{=7 et un élément z tels que: 


(16. 24) 0 (Wio @ = (2), (z © ; 


cependant, H,.%—0 pour 0< p33, Végalité (16.24) n’est donc possible 
que pour ¢ = 10 et 
ge C 


E,° est, d’aprés (16. 22), de dimension 2, engendré par ts, et ; comme: 
= kz? (Wz @ Uy) 
est 0 d’aprés (16. 22), on doit avoir 
+ C Us, (ceZz). 


et z = U5 est un élément congru 4 u*,,; mod. D qui vérifie 16. 2(a) ; l’assertion 
16.2(b), résulte de (16.22); enfin, H,7%° est, vu (16.22), engendré par 
@ 1), 


ce qui prouve 16. 2(c). 


Cc 
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Notations. h* et h,, seront les homomorphismes induits en cohomologie 
et en homologie mod. 2 par le produit de Spin(10). 

{vi,,...,i,} Sera la base de H,(Spin(10), Z.), duale de la base {uj,- - us} 
de H* (Spin(10), (t 6). 


H,,(Spin(10), Z.) opére sur H*(Spin(10), Z.), ainsi que sur lalgébre 
spectrale universelle, (cf. No. 3), et nous noterons ¥,,._4, Popérateur défini 
par 

LEMME 16.3. On a 15), et 

h* (5) = U15 ® 1 1 ® U15 Ug Us. 


Pour uj, (i < 15), il suffit le remarquer qu’un élément universellement 
transgressif est aussi primitif, ({8], Prop. 20.1). 


En utilisant les propriétés des opérateurs #, énoncées au No. 13, et (16.2), 
on obtient : 


(1 Us" Vi,,.. ® U15) ) (r 9). 
(16. 31) (1 = (dyok10°(1 ® Urs)) (K10°(W10 ® Ue) ) Oe, 


(1 ® Uis5 = (W10 ® De) K10" ® 1) 0, 
done 
(16. 32) Urs" ~ 0. 


Par ailleurs, 16. 2(b) montre que H*(Spin(10), Z.) ne contient qu’un élément 
non nul de degré strictement positif et < 15 qui soit un d,-cocycle pour 
2=rs 9, c’est wy; les formules (16. 31) et (16. 32) entrainent par conséquent : 


= Us 
U5 = 0, (1, < 15, (th, ° tx) AF (6)). 


Si ’on remonte a la définition de #,, No. 3.1, on voit que cela signifie pré- 
cisément que h*(w,;) a la valeur annoncée dans (16.3). 


THEOREME 16.4. H,(Spin(10),Z.) posséde un systéme simple de 
géenérateurs Vs, Vs; Ve, Vz, Vo, Vis, (Dui vérifiant les relations: 


UV = 0, 15), U5 J, (t, 7) (6,9)), 
Vp = Ve + 
On tire tout d’abord de (16.3): 


(16.41). A* est symétrique (au sens de la définition 2.3) sur 
A (Us, Us, Ue, Uz, Uo). 
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(16.42). A*(a-u5) = h*(a)(Uis 1+ 18 + h* (a) + (uy @ ue), et 
le premier addende du deuxiéme membre est symétrique si h*(a) Vest. Il 
est clair que 


<h* (a) (up @ Ue), Vi, =O Si ty, 
<h* (a) (uy @ ue), 41 si (1,7) A (9, 6). 
ce qui, joint 4 (16.41) et (16.42) montre que 
—= % pour (i <j, (i,j) (6, 9)), 
et, par récurrence sur 7, que 
donc en particulier que (v;) est un systéme simple de générateurs. 
De (16.3) et (16.42) on tire que 


ne contient jamais de terme de la forme u;® uj, d’ow les égalités v4, vy 45 = 0. 
Enfin H'*(Spin(10),Z.) est engendré par w,,5 et par 


D* == H**(Spin(10),Z2) A (us, Us, Ue, Ur, Us) 5 
par définition des 4; on a <D**, v15> = 0, et, vu (16.3), on a 
KZ, Ve\/ Vo> = KX, Va Ve> (2 e D**) 
Ve> = Vis> = 1 


Ve \/ = 0 
ce qui implique 
Vo = Up Ve + 
Remarque. Nous ne pousserons pas plus-avant l’étude de H,(Spin(n), 
et nous bornerons 4 signaler qu’en s’appuyant sur les résultats des Nos. 15, 16, 
et sur un théoréme non encore publié de H. Cartan on peut voir que cette 
algébre n’est pas commutative pour n= 10. 


Principe de démonstration. Nous reprenons les notations de 12. 1(c) et 
soit {v;,,...,4,} une base de H,(Spin(n),Z.), duale de la base formée par les 
mondémes basiques associés au systéme simple (wi,u), (l1<i<n—s(n), 
correspondant 4 wm, (1S1< n—s(n)), et @U. Les u, étant primitifs, 
on voit tout d’abord aisément que 4, \V = 0 (1 <i < n—=s(n)); si main- 
tenant H,(Spin(n),Z,) était commutative, l’application diagonale en ferait 
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une algébre de Hopf, 4 laquelle on pourrait appliquer les résultats du No. 1; 
les éléments v;, (1 <<i<n—s(n)), feraient alors partie d’un systéme de 
type M lequel le dernier générateur v* = vin) mod. D serait de carré nul, 
(méme argument que dans 14(iii)), et ainsi H,(Spin(n),Z.) serait iso- 
morphe & A(v1,° *, Un-s(n))- Mais alors, d’aprés H. Cartan, H*(B spin(n), 22) 
aurait méme série de Poincaré qu’une algébre de polynémes & n—s(n) 
variables de degrés Du; + 1, Du+ 1, ce qui contredit (15.5) pour n= 10. 


V. Les deux premiers groupes exceptionnels. 


17. Le groupe G.. On sait que le groupe compact G, peut étre envisagé 
comme le groupe des automorphismes des octaves de Cayley; il est de rang 2 
et posséde 14 paraméetres. 


LEMME 17.1. G, contient un sous-groupe isomorphe a SU(2), tel que 
Vespace homogéne G./SU(2) soit homéomorphe a la varvété de Stiefel V;,. des 
2-repéres orthonormauz de R’. 


Soient (¢), (t©Z;), une base des octaves purement imaginaires, les 
systémes quaternioniens étant les triplets ¢i+1, Soient f; et deux 
octaves purement imaginaires orthonormaux, relativement a la métrique 
usuelle; posons et soit f; orthonormal a fi, fo, fs; On voit sans 
difficulté que 


fs =fae'fs, o=fs' fs, fr =fatfs, 


forment avec les f;, (1 [i 4), un systéme libre et que la correspondance 
&—fi, (te Z,) définit un automorphisme des octaves, (pour plus de détails, 
voir N. Jacobson, Duke Mathematical Journal, 5 (1939), pp. 776-783). 
Cela montre: 


(i) G, est transitif sur les systémes f;,f, de deux octaves purement 
imaginaires orthonormaux, autrement dit sur V;,». 


(ii). Les éléments de G, qui laissent f; et f, fixes sont en correspondance 
biunivoque avec les vecteurs unitaires f; de l’espace 4 4 dimensions orthogonal 
a f1, fe, fs, c’est a dire avec les points de S;; le groupe d’isotropie relativement 
a V,. est donc homéomorphe a S;, il est bien isomorphe 4 SU(2). 


THEOREME 17.2. H*(G.,Z) posséde un systéme de 2 générateurs hs, hiy 
=1), jouissant des propriétés suivantes: 


(a). = = Ris = 0. 
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(b). H*(G.,Z) est la somme directe faible des sous-groupes cycliques 
engendrés par les éléments d’ordre infini hg, hii, et par les éléments 
d’ordre deux h;°*. 


Compte tenu de la structure de H*(V;,.,Z), rappelée au No. 8, ce 
théoréme résulte visiblement du: 


THEOREME 17.3. (a). H*(S3 X V72,Z) est Valgebre graduée associée 
a H*(G.,Z) convenablement filtrée. 


(b). Pour p> 2, H* (G2, Z,) = A (#3, 211), (Dx; = 1) ; les purssances 
réduites y sont triviales si et = 


(c). H*(G.,Z.) posséde un systéme simple de générateurs universelle- 
ment transgressifs 13, 25,2, (Dx; = 7), liés par les relations: 


= 25, Le, — U3, Sqta; —0 sinon. 


Dans la suite spectrale de la fibration (G2,V;,2,S;) on a pour T=Z 


ou 
E, H*(V;2,T) H*(S;,T), 


(@ sur ©), LH, = pour des raisons de degrés évidentes, ce qui établit 
17. 3(a). 

Considérons maintenant la fibration Spin(?)/G.—S,, (voir [1]), et 
soit 7 Vinclusion d’une fibre. D’aprés le No. 8, 12.1(a) et.17.3(a), on a 
pour les polyndmes de Poincaré: 


P,(S;, t) Pp(Ge, t) = P,(Spin(7), t), (p nul ou premier), 


et ainsi G, est totalement non homologue a zéro mod. p dans Spin(7),!? en 
particulier, H*(G@.,Z,) admettra comme systéme simple de générateurs les 
éléments i*(u;), (notations de 12.1(c), (1Si3)), dot les Sq‘ de 
17. 3(c) ; ’élément de plus petit degré positif 1*(u,) étant forcément 
universellement transgressif, il en sera de méme pour 


ce qui termine la démonstration de 17. 3(c). 


Enfin, mod. p impair, H*(Spin(7),Z,) s’identifie canoniquement 4 
H*(SO(7),Z,), posséde trois générateurs b,,b.,b,; de degrés 3,7, 11 et 


72 Pour un choix convenable de générateurs universellement transgressifs 2. 

*2T] Vest aussi relativement 4 la cohomologie enti¢re, comme on peut le voir en 
montrant que l’algébre spectrale sur Z de la fibration (Spin(7),S7z,G2) est triviale; 
cela résultera aussi de la remarque du No 23. 
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H*(G., Zp) est Valgeébre engendrée par 1*(b,) et (bs), et 17.3(b) résulte 
de 8. 9. 


THEOREME 17.4. Pour p¥2, H,,(G2,Z,) est une algébre extérieure 
anticommutative, isomorphe & H* (Gz, Zp). 

H,,(G2,Z2) est une algébre extérieure commutative, engendrée par 3 
éléments de degrés 3, 5, 6. 


C’est une conséquence de (2.8) et (17.3). 


18. Groupes opérant sur un espace fibré principal. Nous aurons 
besoin au No 20 de quelques remarques relatives 4 une situation souvent 
considérée par J. Leray (voir par ex. [14]) et aussi dans [3], aussi pour 
plus de clarté, voulons-nous énoncer ici, sous des hypothéses générales, les 
propriétés en question. 

Soient (/, B, G,q) un espace fibré principal connexe, de groupe struc- 
tural G, compact pour fixer les idées, U un sous-groupe fermé de G et NV un 
sous-groupe fermé du normalisateur de U dans G. Les éléments de N définis- 
sent des homéomorphismes de # compatibles avec la fibration (#, #/U, U,s), 
(i.e. ils envoient fibres sur fibres) ; par conséquent N opére sur les algébres 
de cohomologie de ces espaces et sur l’algébre spectrale de cette fibration. 
Supposons dorénavant U connexe. On a alors les propriétés suivantes: 


(a). Par Vintermédiatire des translations, N/N OU opére sur 
H*(E/U,Y), H*(£,Y), 


en commutant a s* et en respectant la filtration de H*(EH,T), (T, anneau de 
coefficients). 


(b). N/N OU opere sur Valgébre spectrale (E,) de E/U, U, s) pour 
r= il commuté avec Visomorphisme canonique de sur HX(E/U,T); 
sur identifié canoniquement a H4(U,T), il opére par Vintermédiaire des 
automorphismes turn, (ue U,neN). 


(c). Si T est un corps, tl existe un isomorphisme d’espaces vectoriels 
gradués de E, sur H*(E,T) commutant a N/N U."4 


Ces propriétés, qui sont bien naturelles, se démontrent aisément si l’on 


** Cela signifie qu’A tout élément de V/N M U correspond un endomorphisme (de la 
structure d’algébre bigraduée) de H,, (r=2), commutant a d, et & K,,", cette corre- 
spondance étant homomorphique. N opére sur toute l’algébre spectrale, mais NN U 
n’opére trivialement que pour 7 => 2, d’ow cette restriction. 

% Pour Ex il s’agit bien entendu du degré total. 
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remonte a la définition et aux principales propriétés de l’algébre spectrale des 
espaces fibrés (résumées par exemple dans [3], § 4), et si l’on utilise pour 
définir cette derniére des couvertures fines de HY et de H/U sur lesquelles N 
opére, (par exemple les cochaines d’Alexander-Spanier). 

N agit aussi sur la fibration (H/U, F/G, G/U, q) en laissant chaque fibre 
invariante, et sur G/U envisagé comme espace des classes de restes & gauche, 
il opére par les translations 4 droite; ces opérations jouissent des propriétés 
suivantes, analogues aux précédentes: 


(d). N/NOU opére sur H*(E,T) en respectant la filtration, opére 
trivialement sur H*(E/G,T), et commute a q*. 


(e). Supposons G connexe: Alors N/N ON U opére sur Valgebre spectrale 
(r= trivialement sur E.”°, et commute avec Vtsomorphisme cano- 
nique de E,°% sur H4(G/U,T). 


(f). Si T est un corps il existe un isomorphisme d’espaces vectoriels 
gradués de E,, sur H*(E/U,T) commutant a N/N N U.** 


19. Homologie et cohomologie de F,, (résultats). F, est de rang 4, 
a 52 paramétres, il agit transitivement sur le “ plan projectif des octaves ” W, 
variété 4 16 dimensions, et le groupe d’isotropie est isomorphe 4 Spin(9), 
(voir [1]) ; dans W le complément d’une droite projective, ” e. d’une sphere 
8 dimensions), est homéomorphe a 


(19. 1) H*(W,Z) =Z[x]/(2*), (Dz = 8). 
THEOREME 19.2. (a). Fy, n’a pas de p-torsion pour p+ 2,3 et 
H*(F,,Zy) = (25) Lis, Tos), (Dx, =i). 
(b). KF, posséde de la 3-torsion et 
H*(F,,Z;) = A (23, £7, 11, Lis), (Dz = 8, Dx; =1). 


(c). Fy, a de la 2-torsion; H*(F,,Z.) posséde un systéme simple 
de générateurs universellement transgressifs et H*(F 4, Z2) 
= H*(G. X Sis X S23,Z2) au sens du cup-produit et, pour D = 22, aw sens 
des 


(d). H*(F.,Z) est additivement et multiplicativement isomorphe au 
produit tensoriel U @ H* (Gz X Si5,Z), ow U est un anneau unitaire gradué 
dans lequel = U** = Z, U® = U*-U* = U** = Z,, Ut = 0 sinon; ses coeffi- 
cients de torsion sont donc égaur a 2, 3, ou 6. 


é 


d 


i 
4 0 
Se 
Sx 
au 
80] 


é 


GROUPES DE LIE COMPACTS CONNEXES. 


Remarques. 


19.3. Les isomorphismes 19.2(a) et (b) sont bien entendu valables 
pour la structure additive et le cup-produit; nous obtiendrons également les 
renseignements suivants au sujet des puissances réduites: 


(a). P = 21, Pz 


(b). = et (notations de 19.2(b)), provient 
de x; par Vhomomorphisme de Bockstein (pour un choix convenable de %, x). 


19.4. Pour p= 5, on peut supposer les 2; universellement transgressifs ; 
on a done toujours 
P pia; = Up,j' (dp,;* Zp, k= 1) 
et en particulier P,* est identiquement nulle sur H*(F,,Z,) pour p= 13, 


19.5. Nous ignorons si l’isomorphisme de 19. 2(c) est valable pour les 
i-carrés en général; en fait, comme nous le verrons, la seule question qui reste 
a trancher est de savoir si Sq*z,, est nul ou égal a 23. 


THEOREME 19.6. Pour p2,3 H,(Fs,Z,) est une algébre extérieure 
anticommutative, isomorphe H*(F4, Zp). 


H,,(Fs, 22) est une algébre extérieure commutative engendrée par 5 
éléments de degrés 3, 5, 6, 15, 23. 


Cela résulte de (2.8) et (19.2). Le reste de ce travail est consacré & la 
démonstration de 19. 2, 19.3 et 19. 5. 


20. L’homomorphisme H*(F,/Spin(9),Z) > H*(F,,Z). Nous 
choisissons une fois pour toutes une suite de sous-groupes emboités 
F, > Spin(9) D Spin(8) D T 


ou T est un tore 4 quatre dimensions, maximal dans chacun des trois autres 
groupes. Nous considérerons les projections naturelles 


F, F,/Spin(8) F,/Spin(9) = W et y= Boa 


Soient Ng le normalisateur de dans G et ®(G) (G=F,, 
Spin(9), Spin(8)). Ce dernier est le groupe de Weyl de G, qui s’identifie 
au groupe des automorphismes de 7 (ou du revétement universel de 7’), qui 
sont induits par les automorphismes intérieurs de @ laissant T invariant. 
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LEMME 20.1. &(Spin(8)) est invariant dans ®(F,) et ®(F,)/®(Spin(8)) 
est isomorphe au groupe o; des permutations de 3 objets; Np, fait partie du 
normalisateur de Spin(8). 


Bien que nous ne puissions guére indiquer de référence précise, ce lemme 
est connu et résulte trivialement de propriétés classiques de F,, et nous en 
donnerons la démonstration sans entrer dans tous les détails.'® 

Pour un choix convenable des coordonnées x; dans le revétement universel 
R* de T, les racines de F, s’écrivent, en omettant le facteur 24 /— 1: 


(20. 2) (i<j), 


ot (20.2) est l'ensemble des racines de Spin(8). Soit S*, la transformation 
contragrédiente de la symétrie 4 ’hyperplan 0 de la formule 


= B— 2(@, B)° (a, «)-*a, (a, 8 formes linéaires sur R*), 


ou (, ) est le produit scalaire défini par la métrique,’® on déduit immé- 
diatement que le systéme (20.2) est invariant par les symétries S*,, ou ¢ 
parcourt les racines de F,; comme le groupe de Weyl d’un groupe de Lie 
compact connexe G est engendré par les symétries aux hyperplans obtenus en 
égalant 4 zéro les racines de G, il en résulte que @(Spin(8)) est invariant 
dans ®(F,). 

@(F,) et 6(Spin(8)) sont d’ordres respectifs 192. 6 et 192; leur quotient F 
est bien d’ordre 6; on vérifie directement, ce que nous ne ferons pas, qu’il est 
isomorphe a o; (en fait au groupe des permutations de trois des sommets d’un | 
simplexe fondamental de Spin(8)); on peut du reste s’épargner cette 
verification en s’appuyant sur des résultats connus: En effet, nous avons 
vu que ®(F,)/®(Spin(8)) est un sous-groupe d’ordre 6 du quotient 
®’ (Spin(8))/@(Spin(8)), ot &’(Spin(8)) est le groupe de Cartan de 
Spin(8), c’est & dire le groupe de tous les automorphismes de F* laissant le 
systéme des racines (20. 2) invariant, et l’on sait que ce quotient est isomorphe 


a 


15 Pour tous les résultats de la théorie des groupes de Lie utilisés ici, voir par 
exemple E. Cartan, loc. cit., 0.7, F. Gantmacher, Ree. Math. Moscou N.S. 5 (1939), 
101-144, E, Stiefel, Comm. Math. Helv. 14 (1941-42), pp. 350-380. 

16 Au sens de la métrique définie par la somme des carrés des racines de F,. 

17Ce n’est qu’une autre maniére de dire que la quotient du groupe des automor- 
phismes de Spin(8) par le sous-groupe des automorphismes intérieurs est isomorphe 4 
73, (principe de trialité) ; en somme, (20.1) montre que tous ces automorphismes sont 
induits par des automorphismes intérieurs de F,. 
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Enfin, les automorphismes t > n-t-n-? (te T, n © Np,) laissant le systéme 
(20.2) invariant, il est clair que Np, est dans le normalisateur de Spin(8). 


LEMME 20.4. Aw point de vue additif, 
H*(F,/Spin(8),Z) = H*(W, Z) @ H*(S;,Z). 
En effet, Spin(9)/Spin(8) = S,, le terme 
= H*(W, Z) H* (Ss, Z) 


de la suite spectrale de la fibration (F,/Spin(8),W,S.) ne contient que des 
éléments de degrés pairs et est done égal a F,. 


LEMME 20.5. La représentation de Spin(8) dans 
H®(F,/Spin(8),Z) induite par les translations a droite est fidéle. 


I] suffira évidemment de montrer que la représentation correspondante 
dans H*(F,/Spin(8), 2) est fidéle (R corps des réels). 

Relativement 4 R, Spin(8)/T est totalement non homologue 4 zéro dans 
la fibration (F,/T, F,/Spin(8), Spin(8)/T), ([3], Prop. 26.1 et Cor. a la 
Prop. 18.3), done 


E, = FE, = H*(F,/Spin(8), R) H* (Spin(8), 


(ce qui du reste résulte aussi du fait que H. ne contient que des éléments de 
degrés pairs d’aprés la formule de Hirsch). 

Par ailleurs, si 7 est un tore maximal d’un groupe de Lie compact 
connexe (7, la représentation naturelle de ®(G) = N7/T dans H*(G/T, R) est 
équivalente & la représentation réguliére d’aprés J. Leray ([14], Prop. 11.1, 
ou [3], Lemme 27.1). En appliquant les remarques du No. 18 au cas 


on voit done que FL, (resp. 1 @ H*(Spin(8)/T, R)), est espace de la repré- 
sentation régulicre de @(F,), (resp. @(Spin(8))), et que &(Spin(8)) agit 
trivialement sur H*(F,/Spin(8),R)@1. Il s’ensuit évidemment que la 
représentation de o; = ©(F,)/®(Spin(8)) induite dans ce dernier espace 
est la représentation réguliére, et les 2 sous-espaces invariants de dimension 
® H®(F/Spin(8), Rk), H'*(FSpin(8),R) sont espaces de la représentation 
irréductible non triviale de os, qui est fidéle comme on sait. 


LEMME 20.6. On a a*(3x) pour tout xe H*(F,/Spin(8), Z). 


@(Spin(8)) opére trivialement sur H*(F,,Z) (puisque ces opérations 
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sont induites par des translations homotopes 4 lidentité), fidélement sur 
H*(F,/Spin(8),Z) et commute & a*; soit v un élément d’ordre 3 de o,, 
et soit A l’automorphisme de H*(F,/Spin(8), R) ou de H*(F,/Spin(8), Z) 
qui lui correspond ; cet espace étant de dimension 2 et la représentation étant 
fidéle, ’automorphisme A n’a pas de valeur propre réelle, et l’égalité 


(A?+A+1):(A—1) =A?—1=0 
entraine 


A*+A+1=0 
a*(A?(x) + +2) =0 


et le lemme, puisque 
a*(A(x)) =A(a*(x)) = 


PROPOSITION 20.7. L’image de y*: H*(W,Z) > H®(F.,Z) est égale 
& H*®(F,,Z) ; elle est soit nulle, soit tsomorphe a Z;. 


On obtient la seconde assertion en appliquant (20.6) 4 B*(z), ol 
engendre H*(W,Z). Pour établir la premiére il suffit de remarquer que dans 
Valgébre spectrale sur Z de la fibration (F.,, W, Spin(9), y), on a, vu (12.1): 


8H, = H®(W, Z) 
d’ot 
= et y*(H*(W, Z)) = H8(F,,Z).. 


21. Cohomologie mod. p de F,, (p42). Nous voulons démontrer ici 
19.2(a), (b) et 19.3, en déterminant l’algébre spectrale mod.p de 
(F.,W,Spin(9),y). Ona 


E, =H*(W, Z,) H*(Spin(9), Zp) = A (B1y ba, Bs, bs) 


(Dz=8, =—4i1—1, universellement transgressif). Evidemment, 
FE, 

Soit p~ 2,3; on tire de (20.7) que y*(3x) = 0, ce qui n’est possible 
que si 


dsxs?(1@ = xg” @ 1), (COZ, 0); 
les ds-cocycles forment donc la sous-algébre 


H*(W, Zp) A (baybsy bz) + 2? @ bz ® A (bi, bs, Bs) 
et 
d; (Ez) = (x) ® bs, b,) 


z 
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ainsi, H, est l’algébre extérieure engendrée par les éléments x,?(x? @ bz) et 
di), (t= 1, 3,4), de degrés 23, 3, 11, 15. Les différentielles d,, 
(r= 9) sont nulles sur les deux premiers; d’autre part on voit que 


et les différentielles d, (7 = 2) seront done toutes nulles sur les éléments 
transgressifs 1 @ b, et 1 @ b,; cela montre que d, —0 (r= 9), d’ou (compte 
tenu de la Prop. 8.1 de [3]): 


(435 11, L15, Hy = Hy = H* (Fi, 
Les polynémes de Poincaré de F, en caractéristique zéro et mod. p (p = 5), 
sont done égaux, d’ou l’absence de p-torsion et 19. 2(a). 


De plus, on peut prendre comme générateurs 23, %1, 2,5 les éléments 
univoquement déterminés tels que 


= (j = 1, 3, 4, 7 inclusion de Spin(9) dans F,), 
et les formules 


= 21, P23 = 15, 
résultent de (8.9). 


Soit maintenant p—3. L’algébre H*(W, Z) ne contenant pas d’élément 
non nul de degré 4 ou 12, les éléments universellement transgressifs bi, bs 
sont des cocycles pour toutes les différentielles, autrement dit font partie de 
Pimage de i*; il en sera alors de méme pour 


P et bs — 


(voir (8.9)), Spin(9) est totalement non homologue 4 zéro mod. 3 dans F,, 
d’ou, compte tenu de (7. 4): 


H* (F,, Z;) = =F, = H*(W, Z;) H* (Spin(9), 


ce qui établit 19. 2(b) ; on voit en outre que l’on peut prendre comme systéme 
3-semi-libre de générateurs de H*(F,,Z;) des éléments 2; tels que 


= y*(z), = b,, 
P 3123, = P 510115 


Enfin, y*(H*®(F,,Z)) est 40 done (20.7) d’ordre 3, d’ot le fait que pour 
un certain multiple 2’, de z, on a 


8* (27) = 7's, (27° 2's) 


(8*, homomorphisme de Bockstein), ce qui termine la démonstration de 19. 3. 
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Remarque. Nous venons de voir que H*(F,,Z) 40, ce qui, vu (20.7) 
donne: 
(21. 1) H®(F,,Z) =2Z;. 


22. Cohomologie mod. 2 de F,. H*(Spin(9),Z.) posséde un systéme 
simple de générateurs universellement transgressifs de degrés 3, 5, 6, 7, 15, 
(15. 2), et d’autre part y*(H*(W, Z.)) =0 d’aprés (20.5); on pourrait en 
déduire comme précédemment la structure de l’algébre spectrale sur Z, de la 
fibration (F,,W, Spin(9),y) et le fait que H*(F,,Z.) a un systéme simple 
de générateurs de degrés 3, 5, 6, 15, 23. Cependant, pour obtenir quelques 
précisions supplémentaires sur les 1-carrés et la transgression, nous retrou- 
verons et compléterons ce résultat par une méthode légérement différente, qui 
détermine aussi la cohomologie mod. 2 de Spin(9), mais utilise celle de G,. 
Nous choisissons une fois pour toutes une suite de sous-groupes emboités: 


F, Spin(9) DSpin(7) D 
tels que Spin(9)/Spin(?) = S,, et Spin(?)/G. =S,, (voir [1]), et soient 


A, » les projections naturelles 


F, F,/Spin(9) —W 
LeMME 22.1. H*(Spin(9)/G.,Z) = H*(S; X Sis,Z) = (uz, Us), 
(Du; = 1). 
En effet, l’algébre spectrale sur Z de la fibration 
(Spin(9)/G., Spin(9)/Spin(7), Spin(?)/G2) = (Spin(9)/G., S10, S:) 
est visiblement triviale. 


LEMME 22.2. Hi(Fs/G.,Z) =0 pour0 <i< 7, 1=9; A* est un tso- 
morphisme de H*(F./G.,Z) sur H*(F.,Z); en stat vu (21.1), 
H® (F4/G2, Z) = Z;. 


La premiére assertion résulte du fait que dans l’algébre spectrale sur Z 
de la ‘fibration (F./G.,W, Spin(9)/G:,) le terme EF, n’a aucun élément 
non nul de degré total i, pur0<<i<7,1=—9. 

On en déduit, & V’aide de (17.2), que dans l’algébre spectrale sur Z de 
(F 4, Fs/G2, Ge, 


(p+q—=—%q>0); 
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ce qui implique 


done que A* est un isomorphisme sur en dimension 8. 
LEMME 22.3. Soit p38. Alors H*(F,/G2,Z,) = H* (Sis X S2s, Zp) 5 
en particulier, F,/G. n'a pas de p-torsion. 
Dans l’algébre spectrale mod. p de W, Spin(9)/G2, on a 
= H*(W, Z,) ® H*(Spin(9)/G2, = Zp[x]/(x*) @ A(uz, Us) 
(Dz = 8), d’ot 
E, = Es; dgks"(1@ = 0; 
mais le lemme précédent montre que 
H?(F,/G., Z,) = H®(F./G2, Z) @ Z, = 0 
ce qui n’est possible que si 
dgks?(1 @ uz) = 1) ; 
on en déduit immédiatement que 
Ee = Ey = N(x? uz, U5) 
et le lemme. 


PROPOSITION 22.4. Le groupe G, est totalement non homologue a zéro 
mod. 2 dans F,; H*(F,, Z.) est isomorphe & H* (G2 X S15 X S23, Z2) pour la 
structure additive et le cup-produit, et, au moins pour D S 22, au sens des 
i-carrés. 


Nous désignerons par i l’inclusion de G, dans F, et par 43, Ys, Ye les 
générateurs universellement transgressifs de H* (G2, 

étant nul d’aprés (22.2), image par transgression dans 
F, de y, est nulle, autrement dit y; fait partie de l’image de 7*, et il ne sera 
de méme pour 


Ys" 


Ainsi, G, est totalement non homologue a zéro mod. 2 dans F,, et dans l’algébre 
spectrale de G2,), on a, vu (22.3): 


Ey = = H*(Fs/G2, Z2) @ H* (G2, Z2) = Y2s) A (Ys; Ys) 


(Dy, == 7), et ([3], Prop. 8.1), H*(F.,Z2) posséde un systéme simple de 
générateurs 23, tels que 


5, 
en 
la 
le 
es 
u- 
ui 
), 

Z 
de 
5 
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= Yi, (1 = 3, 5,6), 


(22. 5) 
A* (yi) = (1 = 15, 23). 


En fait, pour des raisons de degrés évidentes, les éléments 23, 2;,2_ sont 
univoquement déterminés par 22. 5, et engendrent une sous-algébre isomorphe 
a H*(G.,Z.), ce qui, compte tenu de (7.4) et de (17.3), donne 


H* = = H* (G6, Sis X Sos, 


Sq?r3 = 25, Sq'ts Xe, — Xo, Sqiz; —0 sinon, [6), et, joint 
aux égalités 
Sq*tis = A* (Sq‘yis) = 0 (0<tS%), 


= A* (Sqtyos) = 0 (t>0), 


démontre (22.4), (19.5), et (19.2(c)), mis & part cependant le fait que 
les 2; sont universellement transgressifs, qui fait l’objet de la Proposition 22. 7. 


LemME 22.6. Les générateurs et de H*(F4/G2, sont trans- 
gressifs dans la fibration (Be,, Br, Fs/G2, p(G2, Fs) ). 


Dans l’algébre spectrale mod. 2 de cette fibration, on a évidemment 


= 0, (q 0, 15, 23, r= 2), 


re =~ 


Y:5 est done transgressif et de plus 

= (p+q< 15), 
en particulier p*.(G., F,) est un isomorphisme sur pour D < 15; mais d’aprés 
17.3 et la Prop. 19.1 de [3]: 


H* (Be, Z2) = Z2[Vs, Ve, V7] (Dy=1), | 
d’ou 


ainsi, nous voyons que 


d,(E,%?8) C = 0, (r = 2), 


et est transgressif. 


Proposition 22.7%. Les générateurs x; (i = 3, 5, 6, 15, 23), de H*(F,, Z2) 
définis par (22.5) sont universellement transgressifs. 


C 
Cela est clair pour 2s, qui a le plus petit degré > 0, done pour 2; = Sq*a E 
et Considérons maintenant le diagramme 


| 
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By, Be, 


p(G:, F,) 
id. 


By, By, 
ou v est la projection naturelle: 


Er, => Ee, Be, = E¢,/G:. 


la restriction de v 4 une fibre G, est A et comme 415, Y23 sont transgressifs dans 
la fibration de droite, les éléments 2 (t= 15,23) seront trans- 
gressifs dans la fibration de gauche, c’est & dire universellement transgressifs. 


23. Cohomologie entiére de F,. 


PROPOSITION 23.1. H*(F/G2,Z) est additivement et multiplicative- 
ment isomorphe au produit U ® H*(S,;,Z), ou U est une algébre unitatre 
graduée définie par = = Z, U8? = U?-U? = U" = Z,, Ut =0 sinon. 

On a vu (22.2) que A*: H*®(F,/G., Z) H®(F,, Z) est un isomorphisme 
sur, et comme H*(F,,Z;) = H°(Fy,Z) @Z; est engendré par un élément 
de carré non nul, il en sera de méme pour H*(F,,Z) et H*(F./G2,Z) ; 
cela va nous permettre de construire l’algébre spectrale sur Z de la fibration 
(F;/G., W, Spin(9)/G2,). On a, compte tenu de (22.1), 


E, = H*(W, Z) © H*(Spin(9)/G2, Z) = Z[r]/(x*) @ A(uz, tis) = Es 
dgxg”(1 @ uz) = @ 1), 
les ds-cocycles forment done la sous-algébre 
2? @ uz; ® A(us) + @ A 


et les ds-cobords forment 
(32) A(u5), 


ce qui donne 
E, = H*(W, Z)/(3z) @ A(urs) + (2? uz) @ A(uis) = @ Aus). 


Comme F£,,1%° 0, il faut que disxis?(1@ 15) soit nul, ce qui entraine 
E,=£E,. Ainsi, E.. est formé d’un groupe Z en dimensions 0, 15, 23, 38 et 
d'un groupe Z, en dimensions 8, 16, 23, 31, ce qui montre déja 
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Hi(F,/G.,Z) =*E, pour 1 23 
H*® (F4/G.,Z) =Z ou Z+Z;; 
mais par dualité de Poincaré, puisque dim. F,/G, = 38, 
Tors. (F,/G., Z) = Tors. H'*(F,/G2, Z) = Tors. = Z; 


ce qui prouve que H**(F,/G.,Z) =Z-+-Z, =*E,, done que H* Z) 
est additivement isomorphe & F,,; que cet isomorphisme soit multiplicatif 
résulte alors directement des faits suivants: (a) H, est Valgébre graduée 
associ¢e 4 H*(F,/G.,Z) convenablement filtrée, (b) »* est un isomorphisme 
de H*(W,Z)/(3xr) dans H*(F./G2,Z), (c) des représentants quelconques 
dans H*(F,/G.,Z) de u,; et 2? ® u, sont forcément de carré nul. 


PROPOSITION 23.2. Dans les notations de (22.1), H*(Fs,Z) est addi- 
tivement et multiplicativement isomorphe a U H*(G. X S45, Z). 


Tors. H*(G.,Z) ne contenant que des éléments d’ordre 2, le résultat 
précédent montre que 


Tor. (H* (F.,/G2, Z), H* (G2, Z)) =0; 
dans l’algébre spectrale sur Z de la fibration (F., F,/G2,G2,) on a donc 
(23.3) E, = H*(F,/G,.,Z) H*(G., Z) =U A(us) H*(G,, Z). 


Le plus petit degré-base > 0 est 8, les différentielles d, (2=r=7), sont 
done nulles; par ailleurs H*(G.,Z) est engendré par deux éléments 93, yu: 
d’aprés (17.2); les différentielles d, (r= 8), sont évidemment nulles sur 
ys), de plus elles annulent aussi x,?(1 © y,,)car = 0; ainsi finale- 
ment d,—0 pour tout r= 2, FH, est isomorphe 4 F, et ses coefficients de 
torsion sont égaux a 2, 3, ou 6; en explicitant (23.3) on obtient: 


= UE, = =0 
= = Z,, 


par conséquent : 


(23. 4) H®(F,,Z) =H" (F,Z)=0;  H(F,,Z) 
Soit Hi(F,,Z) tel que 
(2’;) = 9;, (j = 3, 11, inclusion de G, dans F,), 


y; étant le générateur de H*(G.,Z) mentionné dans (17.2). Les égalités 
(23.4) montrent que 


340 


GROUPES DE LIE COMPACTS CONNEXES. 341 


et aussi que 2’,,” = 0 car si cet élément était non nul, il devrait étre d’ordre 
deux; 2’, et z’,, engendrent done une sous-algébre isomorphe 4 H*(G2,Z), 
et ’isomorphie additive et multiplicative de H*(F,,Z) avec 


U H*(6, x Sis, Z) 
résulte de (7. 4). 


Remarque. Nous avons vu que dans V’algébre spectrale sur Z de la 
fibration F,/G2, G2, d) 


Ee = = H*(F,/G,, Z) ® H*(G,, Z) 


par conséquent, G, est totalement non homologue 4 zéro dans F, relativement 
4 la cohomologie entiére. 


THE INSTITUTE FOR ADVANCED STUDY. 
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ON A PROBLEM OF PARTITIONS ILI.* 


By ALFreD Braver and B. M. SEELBINDER. 


A number of years ago, the first of the authors studied together with 
I. Schur the following problem of Frobenius: Given k relatively prime positive 


integers find bounds -,a,) such that the linear 
Diophantine equation 


always has solutions in positive integers 21, 2, for n > F(di, do,* dx). 
In a paper appearing in vol. 64 (1942), pp. 299-312 of this JouRNAL, 


the following results were obtained: Let (1,d2),° dx 
= @2,° * *, =1 be the greatest common divisors. Then 

is such a bound F'(a,, a2,- - +,a,). For k =2 this bound becomes T = a,a, 


and it is the best possible bound. But T is not, in general, the best bound. 
For instance, if a, d2—=m-+1, a4,—m-+2 are three consecutive 
integers and m is even, then 7(m, m + 2, m + 1) = m(m + 2)/2 + 2(m + 1) 
= (m? + 6m + 4) /2 is the best possible bound, while , 
min {7 (m,m + 1, m + 2), m + 2,m+1),T(m +1, m+ 2,m)} 
=m? + 2m +2 
does not give the best bound for odd m. Here (m? + 5m + 4)/2 is the best 
bound, since for the case of & consecutive integers _— 
U(m,m + 1, m + 2,---,m+k—1) 
— [(m + k*—3)/(k—1)]m + (#—k—2)/2 
was shown to be the best bound.’ Moreover, it was proved that T is the best 
bound if the integers 2«/dx are representable in the form 


(2) Ax/ dx ( AyYxv) /Ax-1 


* Received August 14, 1953. 
* Loe. cit. Theorem 7, p. 310; on p. 301, 1. 6, the factor m is missing in the formula 
for U. On p. 306, 1. 20, the summation should extend to « instead of k. 
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with yx,=0 for x =3,4,---,k. For k=3 the condition is not only 
sufficient but also necessary in order that 7 be the best bound. It was stated 
(loc. cit., p. 301) that the condition (2) is not necessarily for k > 3, but this 
is not correct. 

It will be proved in this paper that the condition (2) is necessary for 
every & in order that T be the best bound. Moreover, if (2) is not satisfied, 
then T* = T —min(q, +, ax) = +, is a smaller bound 
for the solvability of (1) in positive integers. First we prove the following: 


Lemma. Let +, a, be relatively prime positive integers. Then 
every integer m is representable in at least one of the forms 


(3) m= + dete +° ++ > 0 for «x —1,2,---,k), 
(4) m=T — — > — (te 20 for —1,2,:--,k). 


Proof. Since (dx, —1, the congruence a,z,==m (mod dx-,) has a 
solution such that 0 < 2, dy,. We set m= aye, + Next con- 
sider the congruence 
(5) (mod dy2). 


Because dy. = (dy, * *,@x-2) and = * = (dx-2, 
we can divide (5) by d,_, and obtain 


(6) = Jr» (mod 


Now (dx-1/dx-1, = 1, hence (6) and (5) have a solution 2, such 
that 0 < S Hence we may set m = + + 
We continue in this way and obtain by induction 


(7) M = + + + digi, 
where 


If gi > 0, then it follows from (7) that m is representable in form (3), since 
d, = a,. Assume now that 9,0. Let g, =—h. Adding and subtracting 


T in equation (7) we obtain 


th 


nu 
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or, replacing d, by a, 
m= T— a,h (d1/dz 22) a3(d./ds — 23) Ay dy, 


Since all the parentheses are non-negative by (8), it follows that m is 
representable in the form (4). 


THEOREM 1. Jf at least one of the integers ax/de (x =3,4,: 
is not representable in the form 


K-1 
(9) ax/ ax => = AyYxy/An-1 with 0, 
p=1 


then -,a,) is not the smallest bound. 


Proof. For k =3 this was proved loc. cit. Let us assume that the 
theorem is true for /—1. Let x be the smallest integer for which (9) does 
not hold. We have to consider two cases. 


The casex<k. Set 
(10) T'« = T (a;/dx, ax/dx) 
+ + Axdx+/ dx) 


It follows from the inductive assumption that 7, is not the smallest bound. 
Hence the equation 


has a positive solution 2,,%2,- + -,2. On the other hand, by (10), 


(12) T= T (a, de,- +, ax) 
= + Ans des + + Ay. dy1/ Ax. 
Hence 
T = 0,0, + aor, + 


by (11). Since are positive integers, T is not 
the best bound. 


The case xk. Since a, cannot be represented in the form (9), the 
number 7',.; — a; cannot be written in the form 


y=1 


k-1 
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Therefore, by the Lemma, 7, — ax, can be represented in the form 7, — a, 
integers. Hence, by (12), 

Choosing x; = 2d,_, we obtain the representation for T. Hence T is not the 


best bound. 


THEOREM 2. If T is not the best bound, then the best bound is less 
than or equal to T +, a,) = T*. 


Proof. We can formulate the Lemma in the following way: If for an 
integer m the number 7 —™m cannot be represented in the form 


(13) T — m = + + + = 0), 
then m can be represented in the form (3). Let r be any of the numbers 
1, 2,° We set 7—r=R. Since 

r< min(q,, ax), 


r= T—R cannot be represented in the form (13) and it follows that R 
can be represented in the form (3). Hence all of the numbers greater than 
T* can be represented in the form (3). 
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FOOTNOTE TO A RECENT PAPER.* 


By ANDRE WEIL. 


In a recent paper, L. Carlitz [1] has given formulas for the number of 
solutions of special pairs of equations F =a, G = b, where F, G are quadratic 
forms over a field & with gq elements, g being odd. More complete results 
may be obtained more briefly by the same method. Observe first that, if N 
is the number of solutions of / =a, G =), and if N’, N” are the numbers of 
solutions of F = G = 0 and of F — at? = G— bt? = 0, respectively, where ¢ 
is a variable not occurring in F, G, then N = (¢q—1)7*N” —N’; s0 it is 
enough to consider the homogeneous case F = G = 0, where F’, G are quadratic 
forms in any number of variables 2, 2%,- - *,2%,, which may of course be 
assumed to be linearly independent. By det(F), if F = > a,a,z;, we under- 
stand the determinant of the coefficients aj; We put 


$(u,v) = det(uF + vG) ; 


this is a form of degree n + 1 in u, v, which may be identically 0. 

Let y be any non-trivial character of the additive group of the elements 
in k; let x be the character of order 2 of the multiplicative group of non-zero 
elements in k, i. e. x(x) = 1 or —1 according as z is or is not a square in k, 
and x(0) 0. Then the Gaussian sum g = > x(x)y(2) satisfies g? = x(— 1)q. 
Carlitz’s method depends upon the following well-known and elementary fact: 
if the quadratic form H(2,-+-,%,) is equivalent over k to the form 


> cyi?, where the c; are ~0 and 0S then 
i=0 


so that in particular, if det(H) 0 and therefore m =n, the left-hand side 
has the value x[det(H) ]g”*?. 


Now, calling N the number of solutions of F = G = 0, we observe with 
Carlitz that we have 


& )y(vG) = 9? (Ly(uF + vG)), 


* Received November 27, 1953. 
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and evaluate the last sum by the above formula. For u =v = 0, we get 
For $(u, v) we get x[¢(u, v) ]g"**. Let now bpu — apv, for 1 =p Sr, 
be all the essentially distinct linear forms with coefficients in k which divide 
¢(u, v) ; two linear forms which differ only by a constant factor are not to be 
reckoned as essentially different. If ¢ is identically 0 we have r—q +1, 
otherwise r= n+ 1. Let mp + 1 be the rank of Hp = apF -- bpG; let Hp be 
equivalent over k to p> Cpiyi', Where 0 [1S mp, and put Cp = CpoCp1° * * Com). 


Then we get 
N= + + 
p 


where § and the 7p are defined by 


S= xlo(u,v)l, 
(u,v) 40 to 
Clearly the latter sum is 0 if mp is even, and has the value (¢ —1) x(Cp) if 
mp is odd. In the former sum, the restriction ¢(u, v) #0 may be removed, 
as x(0) = 0; replacing u,v in it by ut, vt changes S into x(t"*')S, so that § 
is 0 if n is even. If n is odd, it is clear that we have S = N — q?, where N 
is the number of solutions of t? = ¢(u, v) and is related in an obvious manner 
to the number of points on the hyperelliptic curve y? = ¢(2,1). Specializing 
this to the case considered by Carlitz, one gets all his results immediately. 
For a non-singular variety V in a projective space over a field k with q 
elements, I have defined ([2], p. 507) a zeta-function Z(U) by the formula 


Dlog Z(U) N,U", (D =d/dU), 


where N, is the number of points on V with coordinates in the extension 
k, of degree v of &k. Now, for the variety defined by the pair of quadratic 
equations F — G —0 to be non-singular, it is necessary and sufficient that 
(u,v) should not be identically 0 and should have no multiple linear factors 
in any extension of k (i.e., that its discriminant should be #0). If that 
is so, then, with the same notations as above, mp cannot have any other value 
than n—1. If therefore n is odd, the Tp are all 0. The curve y* = ¢(zg, 1) 
has here the genus (n —1)/2, and its zeta-function is known to be of the 


form 


Z(U) — TT (1— aU) 
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where the a, are of absolute value q%. Then the above formulas give, for 
the zeta-function Z(U) of the variety F = G = 0, the value 


n-2 n-1 


this seems to suggest some relationship between the intrinsic properties of 
the variety F — G=—0 and those of the curve y7—¢(2,1). On the other 
hand, one finds, when m is even: 


n-2 
Z(U) II (1 q’U)- II (1 (—- 1) 
h=0 


where the d, and e, are as follows. The d) are the degrees of all the irreducible 
factors @\(u, v) into which (u,v) splits up over the field &; and, if ky is 
the extension of & of degree A, and byw—ayv is a linear factor of (wu, v) 
in ky, e is 1 or —1 according as a,F + b)G@ is or is not equivalent over ky 
to a sum of n squares. 

The same method still yields interesting results if one applies it to any 
number of homogeneous quadratic equations -:-—F,—0, provided 
the quadratic forms F’,,- --,F, contain only the squares of the variables 
(this is the assumption made by Carlitz in the case r= 2, unnecessarily as 
we have shown). It gives a relation between the number of solutions of 
that system and the numbers of solutions of certain equations of the type 
where the right-hand side is a product of an even 
number of linear forms. In the case of a non-singular variety defined by 
three quadratic equations of the above form, one gets in this manner a fairly 
simple formula for the zeta-function. 

The above results provide additional evidence concerning the conjectures 
on the zeta-functions of non-singular varieties formulated in my article [2]; 
for a comparison with the topological properties of varieties defined by pairs 
of quadratic equations F = G0 over complex numbers, results of Hirze- 
bruch (still unpublished) are now available, which include the calculation of 
the Betti numbers of all non-singular complete intersections of hypersurfaces 
in a projective space. Since somewhat misleading statements have crept into 
the recent literature on this subject, it seems worthwhile to emphasize that 
all the evidence on hand (including some which is still unpublished) fully 
confirms those conjectures, not only as to the rationality of the zeta-functions 
but also as to the existence of the functional equation and the relationship 
with Betti numbers in the classical case. 

Naturally, one may also attach a function Z(U), defined by the same 


h=0 p=1 
J 
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formula as above, to varieties with singular points, to varieties in affine 
spaces, etc., and it seems very likely that all functions defined in this manner 
are still rational ; at least this is so in all the examples known to me at present 
where they can be calculated explicitly. However, the most trivial examples 
(e.g., that of a quadratic cone in a 3-dimensional projective space) show 
that such functions need not satisfy a functional equation of the kind which 
is typical for zeta-functions, so that it is not altogether proper to call them by 
that name. 


UNIVERSITY OF CHICAGO. 
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ON THE LOCAL BEHAVIOR OF SOLUTIONS OF NON- 
PARABOLIC PARTIAL DIFFERENTIAL EQUATIONS. 


II. The Uniqueness of the Green Singularity.* 


By Puitip HARTMAN and AUREL WINTNER. 


1. Under the assumption that the coefficients of the partial differential 
equation 
(1) Ure + Uyy + + y) Uy + y)u=0 
are analytic functions of (z,y) for small 2? + y?, Bécher [1], pp. 462-464, 


has shown that if u—wu(,y) is a C? (hence, analytic) solution of (1) for 
small positive x? + y* and satisfies 


(2) u(x, y) 
as + y? 0, then 
(3) u(x, y) ~ Const. log (x? + y?)-? (Const. > 0). 


Bécher’s method depends on the use of the Green function belonging to the 
adjoint of (1). Thus, while his methods are applicable if d, e, f, instead of 
being analytic, are sufficiently differentiable, they cannot be used if it is only 
supposed that d, e, f are continuous. It will be shown, however, that his 
assertion is true under the latter assumption on d, e, f. The proof will 
depend on a device of Carleman [2], used in [5] for the study of local 
asymptotic properties of solutions of (1). 

It should be pointed out that the proof below is limited to elliptic partial 
differential equations in two independent variables, while Bécher’s arguments 
and assertions can be generalized to the case of equations involving n indepen- 
dent variables. 

The main theorem to be proved is as follows: 


(*) Let a(z,y), b(x,y), y), where ac-—b? > 0, be functions of 
class on 


(4) PR’, 
normalized by a(0, 0) = 1, b(0, 0) = 0, c(0,0) = 1. Let d(a, y), y), f(z, y) 


* Received December 23, 1953. 
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be continuous functions on (4). Let u—u(ax,y) be a function of class C? 
satisfying 

(5) + + + duz + euy + fu=0 

or, more generally, let u(x, y) be of class C1! on0 <a? +y?S and satisfy | 


(6) J (Bue + cry) dz — (atte + buy) dy = (duty + -+ fu) dedy 
J E 


for every simply connected set E bounded by a piecewise smooth Jordan curve 
Finally, as 2? + let (2) hold. Then (3) 
holds and, in fact, as x? + y?—> 0, 


(7) lim (x + iy) (Uy + tuz) = + ic, exists and c; — 0, 0. ( 
The assertion (7) means that, if z—rcos¢, y=—rsin ¢, 
= cos + Uy(x, y) = cor sin d + o(r*), 

as r—>0, uniformly in ¢. Thus, the quadrature 

R a 


y) =— (uz cos + uy sin ¢)dr + cos ¢, sin 
shows that (3) holds, as r—>0, with Const. = $c.( 0). 

Remark. It will be clear from the proof that (*) remains valid if the 
expression du, + eu, + fu in (5) or (6) is replaced by a continuous function 
of x, Y, Uz, Uy Which is majorized by a constant multiple of | wz | + | uy | for 
(x,y) on (4). 


2. Theorem (*) complements the following results on solutions 4 
=1u(z,y) of (5) or (6): Let the coefficients a, b, c, d, e, f satisfy the con- 
ditions of (*) and let w—vu(z,y) be a solution of (6) of class C* on the 
disk (4) satisfying 


(8m) u(x, y) =o0(1r™), asr—> 0, 
for some fixed integer m =1,2,---. Then 
(9m) | ty | = o(r™*) 


and, in fact, (7) can be replaced by the statement that 
(10) lim (x + ty)-™(uy + tuz) exists (r— 0); 


[5], pp. 449-450. Also, if w—u(z,y) is a solution of class C1 on 
0<2?+ and satisfies (9,,) for some integer m = 0, — 1, —2,--°, 


| ?P 
f 
i a( 
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a 
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then (10,,) holds. This last assertion was proved in [5], pp. 452-453, for 
m = 0 under the assumption that a=c=1, )=0 and f=0. It is easily 
seen that the proof (loc. cit.) remains valid for m = — 1, — 2,--- ; the argu- 
ments in [5], pp. 462-464, show that there is no loss of generality in assuming 
a=c=1, b=0 and the last part of the proof of (*) below shows that the 
same is true of the assumption f = 0. 

Curiously, the assertion that (10m) is a consequence of (9m) ts false for 
every m (—0,+1,---) if the assumption that a, b, c in (6) are of class 
C1 is weakened to the hypothesis that a, b, c are just continuous (even when 
d=e=f=0). In order to see this, consider (for small 2? +- y?) the binary 
elliptic system 
(11) Ve = buy + cuy, Vy = — Az — Duy, 
where 

9 
¢= (1—2°F/r) (1— F)4 
and F=—F(r) —1/logr or F(r) according as r= (z?+ = 0. 
Clearly, (11) is the Cauchy-Riemann system (Beltrami) belonging to the 
positive definite metric 


ds? (1 — dx* — 2 /r*)dady + (1— y°F’/r*) dy’. 


The system (11) for the unknowns uw, v is equivalent to the integro-differential 
equation 


(13) f (buz + cuy)dx — (auz + bu,)dy = 0 
J 
for wu alone. 


It was shown in [3], Appendix, that (11) has no solution u—u(z, y), 
v=v(z, y) of class C* in (4), with arbitrarily small R, for which the Jacobian 
A(u, v)/0(x,y) is not 0 at (x,y) = (0,0); the system (11) was first con- 
sidered by Lavrentieff [7], p. 420, in a similar connection. 

Since F(r) is continuous for 0=r<1 and F(0) —0, the functions 
a, b, c are continuous on (4) when R <1. Define p= p(r) by 

R 
—exp(— f F(r))4dr) or p(r) =0 according as r=0; 


so that, by the definition of F(r), 
(14) p(r) ~ const. r/(— log r)4, as r—> 0, _ (const. > 0) 


J 

4 

4 
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and p(r) has a continuous derivative given by 

(15) dp/dr =r"*p(r)(1— F(r) or dp/dr = 0 according as r= 0. 
For an integer k ~0, put 

(16;.) u(x, y) = p*(r) cos v(x, y) =— p*(r) sin kd, 


where t—rcos¢, y=rsing. It follows readily from (15) that if k —1, 
2,- - +, then (16) is a solution of (11) of class C1 on (4) (correspondingly, 
u(z,y) is a solution of (13)) and that if k —-—1,—2,---, then (16) isa 
solution of (11) of class Ct on 0 < + y? R* (correspondingly, u(z, y) 
is a solution of (13)). It is clear from (14) that the function u(z,y) in 
(16,) satisfies (8) if It is seen from 


Uz = kr“ p*[ (1 — F)4 cos ¢ cos kp + sin ¢ sin kd], 
Uy = kr p*[ (1 — F)4 sin ¢ cos kp — cos sin kd] 


and from (14), that (9;) also holds; in fact, both wu, and w, are majorized, 
as r—> 0, by constant multiples of *1/(— log r)#. But (10;,) does not hold. 
This proves the italicized statement for m= +1,+2,°:-. 

In order to prove the statement for m = 0, let F(r) be replaced by its 
negative, that is, in (12) and in the definition of p(r), let F(r) be —1/logr 
or 0 according asr=0. Then (14) is replaced by 


(14 bis) p(r) ~ const. r(— log r)3, as r > 0, * (const. > 0) 


while the formula in (15) for dp/dr remains valid when r>0. The func- 
tions (16;) are of class C1 and satisfy (11) on 0<2?+ y?= R? (corre- 
spondingly, u(z, y) is a solution of (13)); they are continuous at the origin 
if k= 1 (but are not of class C* on (4) unless k >1). The above formulae 
for uz, Uy are valid and show that u, and u, are majorized by constant multiples 
of r*1(— log r)#* and are, therefore, o(r*) as r—>0, if k—=1. Thus, the 
function w—u(z,y) in (16,) when &=—1 satisfies the condition (9m) 
corresponding to m0. On the other hand, (10,) does not hold. This 
completes the proof of the last italicized statement. 

It remains an open question whether or not (*) is true if the assumption 
of the C'-character of a, 6, c is weakened to the hypothesis that a, b, c are 
just continuous. (Note that, in the above examples, wu —— log p(r) is @ 
solution of (13) on 0 < 2?+ 9? < R? satisfying (2), and that (3) and (7) 
hold.) If (*) is false under the weaker assumption, there remains the ques- 
tion whether or not the “infinity” of wu is independent of the solution # 
satisfying (2). 
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8. Proor or (*): THE case f =0. Since ac—b?>0anda>0,c>0, 
standard procedures involving the use of conformal coordinates (Lichtenstein) 
show that there is no loss of generality in supposing that 


(17) a(z,y)=1, b(2,y)=0, = 1; 


cf. [5], pp. 460-464. Following Bécher [1], pp. 462-464, the proof of (*) 
will first be given for the case 


(18) f=0. 


The general case will be deduced from the case (18). The identities (17) 
and (18) reduce (6) to 


(19) f uydxz — u,dy -f f (duy + euz)dxdy. 

J 

It follows from Corollary 1 of Theorem 1 in [5], pp. 449-450, that the 
zeros of the gradient of w, that is, the zeros of the complex-valued function 


cannot cluster at any point (2, yo.) (0,0). Note that the condition 
U(%, Yo) = 0 of Theorem 1 of [5] need not be satisfied, since the integrand 
on the right of (19) is a homogeneous linear combination of wu, and wy 
(so that (19) is satisfied by the function u(z, y) —wu(%, Yo)). Thus the 
zeros of w are enumerable, at most. Hence (2) shows that there exist 
arbitrarily large constants c with the property that u(z,y) —c implies 
w(x,y) 0. It follows that, if such a number c is sufficiently large and if 
«=e, >0 is sufficiently small, the set of points z—2r-+ ty satisfying 
u(z,y) —=c is on the open ring «2? < 2?+ y? < R? and, consequently, con- 
sists of one or more disjoint Jordan curves of class C*. One of these Jordan 
curves, say L = JL,, surrounds the origin, by (2). On the other hand, 
u== u(x, y) cannot have a local maximum or minimum at any point (2p, yo) 
# (0,0); ef., e.g., [5], pp. 449-450. Hence, the locus u(2z, y) =c consists 
of the Jordan curve L = L, only. 
Clearly, u(x, y) —c on L means that, if z—«-+ iy, the expression 


wdz = (uydx — u,dy) + + uydy) 


reduces to — tds + idu — — u,ds, where u, is the derivative of u in the 
direction of the inward normal to L and s is the arc-length on J. The assump- 
tion (2) implies that u, = 0 on L. Thus, on 


(21) — wdz = u,ds, where up, = 0. 


(20) W = Uy + Wz, 
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Let S denote the ring-shaped domain bounded by LZ and the circle 
|z2|—=R. If (19) is applied to the domain EF = S, then 


(22) Unds = — f (uydx — u,dy) + Wdzdy, 


lel=R 
where 


(23) W = duz + ety. 

Let K denote a constant satisfying 

(24) je|SK on |2|SR. 
Then, by (20) and (23), 

(25) on |z|SR. 


Hence (22) gives 


(26) J unds <I, + KD, 
L 
where 
(27) n= f | wdz| and D, = (f | w | dady. 
lel=R “8 


The identity (19) implies that, in terms of the notations £=é-+ ty, 
w(é, 


(28) 


(z — ¢)- (z — £)"*Wdady 


if £ is in 8; cf. formula (17,) in [5], p. 455. Hence, (21) and (25) show § 
that 


L 
where 
(30) | wdz| and | w| dedy. 
|jz|=R 


Let (29) be integrated with respect to dfdy over 8. Then the inequality i 


(31) Sf < <2) 


80 
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shows that 27D, = 4rRI, + tok f Unds +4rKRD,. If this is combined 
with (26), it follows that L 


(32) (2 — 8KR)D, = 8RI,. 
Let R be so small that 
(33) 4KR <1. 


Since J, does not depend on the inner boundary ZL of 8S, it is seen from (27) 
and (32) that w is absolutely integrable over | z| < R, 


(34) D <e, where D= ff | w | dedy. 
lelSR 
Since the level curve L = L,: u(z, y) =c shrinks to the origin as c 00, 
an immediate consequence of (22) and (34) is 


(35) lim f Unds = 2ae, 
L 


where 2rc, = — (uydz — + f f Wadrdy. 


lel=R 
The relation (34) will now be improved to 
(36) w(f) as £0. 


To this end, let z = z, be a point of 8. If (29) is multiplied by | Z oni |-*d&dy 
and the result integrated over S, it follows from the inequality 


and from (31) that 


2rD,. = 8rRI, | —€ Unds 8rKRD., 
L 


if z, is renamed ¢. In view of (33), this implies that 


const. (I, + f | unds) ; 
L 


so that, by (29), 
| w(£)| S (14+ K const.) (22 + |z—£ unds). 
L 


Since u, = 0, there exists a point z= <* on L with the property that the 


= 
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last line integral is | z* —£ |-1 times the line integral on the left of (26), 
Hence (26) gives 


f S | KD,), 
L 


and so 
| w(£)| S (14+ K const.) + | 2* (1, + KD,)}. 


If the constant c defining the level curve 1. = L,: u(a,y) =c tends to o, 
then 2* tends to 0 and D, to the finite limit D; cf. (34). Hence the last 
formula line implies (36). 

The estimate (36) and the definition (30) of D, show that D, 
= O(log |¢|-*) as Hence (21), (25) and (28) give 


(37) = 0(1) + + O(log | 
L 


as £—0. Since u,=0, applications of the mean value theorem to the 
real and imaginary parts of the last line integral show that there exist two 
points + 2° = 2° + on L such that the last integral can be 
written in the form 


L L 
If the constant c defining L = L,: u(r, y) =c tends to o, then both points 
Zo, 2° of L tend to 0 and so, by (35), 


(38) f (2 — unds — (c—>0). 
L 


Since w(¢) and the constants implicit in the O-terms in (37) are independent F 


of c, it follows from (37) and (38) that, as £0, 
w(t) = + O(log | g!-*). 


Consequently, the limit in (7) exists. It is purely imaginary, since the 
number cz, defined in (35), is real. 


It remains to show that c,40. If c,—0, then (9) holds with m —0, 


and so (10) holds with m0. In other words, uz and u, tend to (finite) 


limits as (z,y)— (0,0); in particular, uw, and wu, are bounded near ff 


(x,y) = (0,0). But this contradicts the assumption (2). Consequently, 
C20. This proves (*) in the case (18). 
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4, Proor or (*): THE GENERAL CASE. For the general case, the 
following lemma will be needed: 


Lemma. If R> 0 is sufficiently small, then, under the assumptions of 
(*) on the functions a, b, c, d, e, f, there exists a function u=U(a2,y) of 
class C' on (4) satisfying 
(39) U(0,0) #0 


and the relation (6) for every domain E bounded by a piecewise smooth 
Jordan curve J in (4). 


This lemma is contained in [8]. Another method of proof of this lemma 
will be indicated. It can be supposed, without loss of generality, that (17) 
holds, so that (6) reduces to 


(40) uydx — u,dy = ff (duz + euy + fu) drdy. 
‘J E 
By standard methods of successive approximations (Korn, Lichtenstein), it 
can be shown that if R > 0 is sufficiently small and if u(# cos ¢, R sin d) are 
arbitrarily assigned boundary values of class C* on x? + y? = R?, then the 
boundary value problem associated with the integro-differential equation (40) 
has a (unique) solution wu = u(za,y) of class C' on (4). Furthermore, there 
exists a constant C with the property that if the 0-th approximation is the 
harmonic function w(2,y) on 2?-+ y? < R? which assumes the assigned 
boundary values on 2 + y? = R?, then | u(x, y) —uo(a,y)| SCR on (4); 
the constant C is independent of R and the boundary values u(R cos ¢, FR sin ¢), 
if (without loss of generality) the latter do not exceed 1 in absolute value. 
In particular, if the boundary values are identically 1 (that is, w(rcos ¢, 
Rsin ) =1), then w(x, y) =1 and so | u(a,y) —1| SCR on (4). If 
U(a,y) is identified with the corresponding solution u(z,y) and if R>0 
is sufficiently small, then it is seen that (39) holds. 
The general case of (*) can be deduced from its special case (18) by the 
variation of constants, employed by Bécher for the same purpose. Suppose 
that (6) is of the form (40). For small 2? + y?0, define v = v(z, y) by 


(41) u(x, y) = U(x, y)v(2,y), 


where u = u(z, y) is the given solution of (40) satisfying (2) and u = U(za, y) 
is a solution of (40), as supplied by the Lemma. Then (40) and (41) imply 
that v(2, y) satisfies 


(42) f vydz — v,dy = ff (d*v, + e*v,) dady, 


E 


y 
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if Z is any simply connected domain bounded by a piecewise smooth Jordan 
curve J in a sufficiently small punctured disk 0 < 2+ y*? Se and d*, e* 
are the continuous functions d* = d+ 2U,/U, e* =e-+2U,/U. This is 
clear if U, v are of class C? on 0 << 2 + y? =@ and otherwise can be deduced 
from the form of the Green’s identity given in the lemma of [6], p. 761. 

Since (2) and (39) imply that | v(2z, y)| > as 2? + y’— 0, it follows 
from (41) and the case (18) of (*) that, as 7* + y*?— 0, 


(43) lim(x + iy) (v, + wz) =C, + iC, exists and C, = 0, C, 
and, therefore, that 
(44) v(x, y) ~ const. log (x? + y?)"* (const. ~ 0). 


Since U is of class C* on (4) and since (39) holds, the assertion (7) follows 
from (41), (43) and (44). This completes the proof of (*). 


5. Theorem (*) is, in a certain sense, a uniqueness theorem. ‘The 
corresponding existence theorem is the following: 


(**) Let a, b, c, d, e, f satisfy the conditions of (*). If R>0 1% 
sufficiently small, then there exists a function u=u(az,y) of class C* on 
0<2?+y= R?’, satisfying (2) and the relation (6) for every simply 
connected region E bounded by a piecewise smooth Jordan curve J in 
PR’. 


When the coefficients of (5) are analytic, the assertion of (**) is con- 
tained in a result proved by Hilbert around 1900 (before the development of 
his theory of integral equations) ; cf. [9], p. 570 and [4], pp. 100-101. 

Theorem (**) is contained in a result of Lichtenstein [8] dealing with 
the existence of a Green’s function for the integro-differential equation (6). 
His proof depends on the theory of integral equations. The assertion (**) 
can also be proved by the method of successive approximations, after an 
application of a variation of constants, as follows: 

Let (6) be of the form (40) and define a new dependent variable 
v=v(z,y) by 

u=v log(z? + y’). 


Then the identity (40) is equivalent to the identity 


(45) f vydz — v,dy = ff (d*v, + e*v, + f*v) dzxdy, 
J 


E 


] 
[ 
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where 
d* = d + + y°)log(2* + y’)}, e* =e + 4y/{(2? + y*)log(a? + y’)}, 
f* =f + 2(ad + ye)/{(2 + y*)log(z? + y’)}. 


The fact that d*, e*, f* are majorized by Const./{(a? + y?)4 log(a? + y?)"*} 
makes possible the application of the standard method of successive approxi- 
mations to (45), proving that if R > 0 is sufficiently small and if v(R cos 4, 
Rsin ¢) are arbitrarily assigned boundary values of class C' on 2? + y? = R’, 
then the boundary value problem associated with (45) has a solution 
v=v(z,y) which is continuous on (4) and of class C1 on 0 << 2?+y?S R? 
(in fact, such that v,/log(a*-+ y’) and v,/log(z?-+ y*) are bounded as 
z’+y*—0). Finally, as in the proof the Lemma above, v(0,0) #0 if R 
is sufficiently small and the boundary conditions are suitably chosen, for 
example, if v(R cos ¢, R sin ¢) =1. 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


[1] M. Bécher, “Singular points of functions which satisfy partial differential equa- 
tions of the elliptic type,” Bulletin of the American Mathematical Society, 
vol. 9 (1903), pp. 455-465. : 

[2] T. Carleman, “ Sur les systémes linéaires aux dérivées partielles du premier ordre & 
deux variables,” Comptes Rendus, vol. 197 (1933), pp. 471-474. 

[3] S. 8S. Chern, P. Hartman and A. Wintner, “On isothermic coordinates,” to appear 
in the Commentarii Mathematici Helvetici. 

[4] J. Hadamard, “ Le probléme de Cauchy et les équations aux dérivées partielles 
linéaires hyperboliques,” Paris, 1932. 

[5] P. Hartman and A. Wintner, “On the local behavior of solutions of non-parabolic 
partial differential equations,’ American Journal of Mathematics, vol. 75 
(1953), pp. 449-476. 

and A. Wintner, “On the fundamental equations of differential geometry,” 
American Journal of Mathematics, vol. 72 (1950), pp. 757-774. 


[6] 


[7] M. Lavrentieff, “Sur une classe de représentations continues,” Recueil Mathé- 
matique, vol. 42 (1935), pp. 407-424. 

[8] L. Lichtenstein, “ Neue Untersuchungen in der Theorie der linearen partiellen 
Differentialgleichungen zweiter Ordnung vom elliptischen Typus,” Bulletin 
de l’Académie Polonaise des Sciences et des Lettres (1931), pp. 571-598. 

[9] A. Sommerfeld, Article II A7e (1900) in the Bncyklopidie der mathematischen 

Wissenschaften. 


361 


SOME APPLICATIONS OF THE LOCAL UNIFORMIZATION 
THEOREM.* 


By Serce LAne. 


1. Introduction. Let K be a function field over a constant field k. 
We shall always assume that K/k is regular: & is algebraically closed in K 
and K/k is separably generated. If K —k(a,- + -,%,) =k(x), we may 
view (x) as the generic point (over &) of a variety V, which is then called 
a model of K. If p is a place of K/k, i.e. a place which is an isomorphism, 
or identity, on &, and if we put %—2;(p) ~o, then (#) is a point of V 
and we say that () is at the center of p. 

Zariski’s Theorem of Local Uniformization [10] states that given a place 
p of K/k (characteristic zero) it is always possible to find a model of the 
field for which the point at the center of p is non singular. 

In the classical case where k& is the complex numbers, this implies that 
the place has a certain neighborhood. 

It is our purpose to show that it is not the algebraic closure of the 
complex numbers which is essential for the existence of this neighborhood, 
but rather its topological completeness. 

More precisely: Suppose that the constant field & of K-is complete under 
a valuation, and of characteristic zero. We say that a place of K/k is [ 
rational over k if it is k-valued. Let 9 consist of all rational places of K/k, | 
and assume that {Yt is not empty. Mt may be called the rational Riemann 
manifold of K/k, but as we shall deal only with rational places, we shall 
call it the Riemann manifold. We topologize Mt (as it is done classically) 7 
by giving it the weakest topology such that the map c—>2(p) ek, is con- 7 
tinuous for each z in K. : 

One can then prove that § contains infinitely many places, and in fact F 
enough places to distinguish between elements of K. If k& is locally compact, 
then 2 is a compact space, and in the case where K has dimension 1 over k 
(the case of curves) then Mt is locally homeomorphic to & itself. Our proofs 
are closely related to Chevalley’s [2]. 

Kawada [4] has also considered the abstract manifold in the case of 
curves, but restricted himself to the case where k is algebraically closed. In 


* Received March 16, 1953. 
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that case, the complex numbers are the only locally compact field. If however 
we do not assume & algebraically closed, then we can include the real numbers, 
and p-adic fields. In fact, in the case of curves, we can also treat the power 
series over finite fields because the uniformization theorem is true irrespective 
of the characteristic. 

If & is the real numbers, then Qt consists of a finite number of com- 
ponents, each homeomorphic to a circle. It is a classical theorem of Harnack 
[3] that the number of these components is at most equal to g + 1, where g 
is the genus of K. We shall give a simple new proof of this theorem, based 
on the Riemann-Roch Theorem. 


2. The Riemann manifold. We let & be a field complete under a rank 
one valuation, and do not make any restrictions on the characteristic. We 
view the value group as a subgroup of the positive reals. If aek, we let 
|a| denote its value. 

We suppose that K/k is a function field with the properties stated in 
the introduction. We assume that there exists at least one rational place 
of K/k, and let Yt be the set of all such places. For each ze K, we let T, 
be the “ k-sphere ” &,. obtained from & by the formal adjunction of 0, and 
topologized in the obvious way: Neighborhoods of oo are the outsides of circles 
around 0. We let [=—J]T, be the Cartesian product taken over all ze K, 


with the product topology. 

Let pe Mt. Then p maps K into k,. Let ® be the map of Mt into © 
defined by and denote by MY. Then is 
obviously 1-1. 

We topologize 9’ as a subspace of I, and give Mt the weakest topology 
so that ® is continuous. In other words, the open sets of M consist of the 
sets 6-1(U) where U is open in Di’ 

The following theorem and its proof are identical with Chevalley’s [2], 
except for the fact that only the completeness of & is used, and not its 
algebraic closure. 


THEOREM 1. i’ is closed in I. 


Proof. Let w=(-+-:,Uz,:*-) be in the closure of MY. Let 
o={xeK | u,4«}. We shall prove that o is a valuation ring (or v-ring) 
whose induced place is such that z(p) = up. 


We first note that k C 0, because a(p) —<a for all ack, and all pe M. 
Let z and y lie in 0. Then uz, uy. Since wu is a limit point of 
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elements in Mv’, there exists a prime p e Mt such that x(p), y(p), and (x + y)(p) 
are arbitrarily close to uz, Uy, and Us respectively. For such p, we see 
that 2(p) and y(p) eo, whence («+ y)(p) =2(p) +y(p) Hence 
a-+yeo. Similarly, c—y and zy lie in 0, which is therefore a ring. 

Furthermore, the map x—u, is a homomorphism of o into & and is 
identity on k. This follows from a continuity argument as above. 

Finally, 0 is a v-ring. Suppose Then u;=o. Let 
There exists a prime pe Mt such that x(p) is close to uz and y(p) is close 
to u,. But «(p) close to uz means | x(p)! is very large. Hence | y(p)| is 
very small. Hence uy —0,s0 yeo. Thus o is a v-ring. 

Since x— u, is a homomorphism of o into k, and since 0 is a v-ring, 
it follows that x — u, is a place of K, which is rational because u,ek,. Our 
theorem is thereby proved. 

In particular, if & is locally compact, then &, is the ordinary one-point 
compactification, and k,, is therefore compact. By Tychonoff’s theorem, 
ris compact. Since MW’ is closed in T, it is also compact. We obtain there- 
fore the following. 


Corotuary. k is locally compact, then Wt 1s compact. 


We shall now investigate more closely the special one-dimensional case. 
For the rest of this section, we suppose that K/k has transcendence degree 1, 
and is regular. For the convenience of the reader, we reproduce a proof of 
the local uniformization theorem. 

Let p be a prime of K, whose residue class field is separable over k. 
Let ze K be a separating variable, such that ordyz 1. (It is well known 
that such an z can always be found. See for instance [1], Ch. 1%, 4.) 
If p is the rational prime induced by p on k(x), then p is unramified in K. 
Hence there exists a generator y of K over k(x) which is integral at p and 
whose local p-different is not =0 (mod). This means: If f(z, y) =0 is 
the equation for x and y over k, and if we put T=-2(p), y=y/(p) then 
fy(@, 9) ~0. Hence (#,%) is non-singular at the center of p as was to be 
shown. 

We recall one more useful’ theorem from the theory of curves. 

Let K =k(z,y) be the function field of a curve, and let f(z, y) =0 
be its equation over k. Let (z,%) be a non-singular point on the curve, 80 
that f(z, 7) =0 but Then (#,%) is at the center of exactly 
one place of K/k. In other words, there exists one and only place p of K/k 
such that Z—-2(p), 7—y(p). Furthermore, the residue class field K of K 
under p is naturally isomorphic to k(Z, 7). 
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For a proof, see Artin [1], Ch. 16, 2. 

In particular, if (,%) is a non-singular rational point (that is if its 
coordinates lie in &), then the place above it is also rational. 

Let p be a given prime of Qt. We shall find a neighborhood for p. 
Let z be a local uniformizing parameter at p, separating for K/k. In other 
words, assume ord, « = 1, and K/k(x) is separable. According to the local 
uniformizing theorem, we can find a model K = k(2,y) such that the point 
(, 7) at the center of p is non-singular: f,(z, 7) By multiplying x and 
y by a constant aek having sufficiently small value, we may assume that 
éand gj are integral in k. This will not affect the non-singularity of the point. 
We may also assume that f has integral coefficients by clearing denominators 
if necessary. These preparations are necessary in order to be able to apply 
the Newton approximation method later. (Cf. [5].) They apply of course 
only if the valuation of & is non-archimedean. 

Split the polynomial /(%, Y) in the algebraic closure & of k. Then y is a 
root of multiplicity 1, so we have f(#, Y) = (Y — 9)(Y — y,)%-- > (Y — yr). 
We recall that the roots of an algebraic equation are continuous functions of 
the coefficients ([1], Ch. 2, 6). Given a neighborhood M of ¥ in & not con- 
taining ¥, there exists a neighborhood N, of in k such that for all 
values e NV,, the equation f(é, Y) —0 has exactly one root in M. 

Let M= Mk. There exists a neighborhood N C N, of a such that 
for any value ée N, we can refine ¥ to a root » of f(é,Y) with ye M. This 
can be done with the Newton approximation method, as given in [5]. Since 
the partial derivative is continuous, we also have f,(é,4) ~0. Hence there 
exists a unique rational place q of K such that x(q) = é and y(q) =7. This 
proves that the map q—> x(q) is 1-1 for all q such that (x(q), y(q)) lies in 
the neighborhood (NV, 1/) of (%,%). By what has been said above, this 
latter condition is equivalent to the single condition that «(q) e N. 

The map is continuous. We may select N and M to be closed “ discs ” 
around ¢ and ¥ respectively, in which case the set of primes q mapping z and 
y into (N, M) is closed. 

If & is locally compact, this set of primes is compact. Hence the map 
is a homeomorphism. Summarizing, we have 


THEorEM 2. If & is locally compact, and if K/k is a regular extension 
of dimension one, then Mt is locally homeomorphic with k. Given a prime 
peM, and a separating local wniformizing parameter xe K at p, the map 
q—>2x(q) gives a homeomorphism of a certain neighborhood of p with a disc 
of center 0 in k. 
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If k —=R is the real numbers, then Qt is the union of a finite number 
of components, which have classically been called the real branches of the 
curve. It is obvious (and in fact can easily be proved) that each component 
of M is homeomorphic to a circle. (One uses the facts that it is compact, 
connected, and locally homeomorphic to real segments.) Harnack [3] has 
proved that the number of these branches cannot exceed g + 1, where g is 
the genus of the curve. We give below a proof depending on the Riemann- 
Roch theorem. 

We begin by a remark. Let B be one of the branches, and let re K bea 
function having no pole on B. We assert that x cannot have only a single 
zero on B. In other words, z has either no zero on B, or has at least two 
zeros, counting multiplicities. Indeed, if x has only a single zero p, then 
x(q) > 0 and x(q’) < 0 for q and q’ in a small neighborhood of p. Naively 
speaking, « changes sign as it runs through p. But B— {p} is connected. 
Since z gives a continuous function of B — {p}, it must therefore adopt the 
value 0 somewhere in B — {p}, contradiction. 

Now suppose there are g-+2 branches. Select one place on each 
branch, and let p,,- - -, Pgs2 be these places. Let q be a place of K of degree 2 
(a complex place). Then q does not lie on any one of the real branches. 


The divisor - Pgs2/q%t? has degree g + 2—2(g+1)=——g. By the 
Riemann-Roch theorem, there exists a function xe K which is multiple of 
this divisor. Thus x has g more zeros in addition to $,,- - -, Pg.2, and has 


no pole on any of the real branches. This implies that’ on at least two 
branches, x will have only a single zero, a contradiction. 

Harnack proved his theorem by taking a model of the curve, and 
assuming that the singularities were not too horrible. The classical proof 
is obtained by viewing the real branches as lying on the complex Riemann 
surface, by cutting the latter along these real branches, and using a sym- 
metry argument: the real branches consist precisely of those places on the 
surface which are invariant under the complex conjugate automorphism. 

Our proof is of course much more algebraic. In fact, it is very likely 
that one can define branches for a curve over an arbitrary real closed field, 
and that one can prove in that case also that there are at most g + 1 branches. 
As we have not been able to carry out all the details of this idea, we shall 
leave it as it stands for the moment. 

If one takes a model for the curve, K —k(z,y), then it is extremely 
difficult, if not impossible, to give a rigorous definition of a branch in the 
(z,y) plane without leaving the plane. For instance, we want to say that 
a lemniscate is one branch, but that two tangent circles are two branches. 
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The correct way of defining a branch in the (2, y) plane is to let it consist 
of the projection on this plane of one of the components B of Yt. This 
projection will be the set of all points (z(p),y(p)) with pe B. We note 
that this projection on the plane may not be connected (as in the case of a 
hyperbola, where the connection is at infinity), and may also intersect the 
projection of another branch. On the real manifold {, however, two branches 
can never intersect. 


3. Function fields over complete fields. In this section, we return to 
the general case, but assume that & has characteristic zero. It is again com- 
plete under a rank one valuation. We assume that & is contained in a 
universal domain of infinite degree of transcendence over k, and in general 
follow the conventions of Weil [8]. However, we do not distinguish between 
fields and abstract fields, as the context will always make our meaning clear. 

We say that an extension H/F of a field F is regular if every finitely 
generated intermediate field HZ’ with F C EH’ C # is regular over F. 

A subfield &, of & will be called admissible if it is dense in k, if k/k, is 
regular, and if &/k, has infinite degree of transcendence. 

Our main result is concerned with function fields over admissible fields. 


THEOREM 3. Let k, be admissible. Let K/k, be a finitely generated 
extension, regular over k,. There exists a rational place of K/k, tf and only 
if there exists an isomorphism of K into k which is identity on k,. If this is 
the case, then there exists infinitely many rational places. In fact, -given a 
finite set of quantities (z) = (4,° *,2%n) with ze K, and a finite set of 
polynomials g;(Z) ek,[Z]| such that g;(z) #0, there exists a rational place p 
of K/k, such that all 2; 2;:(p) are finite and such that g;(zZ) ~09. 


Proof. Suppose that there exists a rational place p of K/k,. By the 
local uniformization theorem, we can find a model of K whose center at p is 
non-singular. By a sufficiently general projection, we may assume that this 
model V is a non-singular hypersurface: K = -, y) where (2, y) 
satisfy the irreducible equation f(z,y) over are alge- 
braically independent over &,, and since V is non-singular, f,(Z, 9) ~ 0. 


By multiplying if necessary the elements 2;, y by an element ae k, having 
sufficiently small value, we may assume that <j, 9 are integral in k,. Namely 
we have (ax)(p) =a-2(p) because k, is fixed under p. Such a trans- 
formation does not affect the non-singularity of the point. In addition we 
May assume that f has integral coefficients, by clearing its denominators if 
necessary. 
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We can now apply the Newton approximation method as in [5]. Select 
& very close to 2, in k, but algebraically independent over k,. 
Our assumptions on & allow us to do this. Then y can be refined to a root + 
of f(é, Y) in k, and it is clear that &,(é, 7) is isomorphic to This 
proves the first part of our theorem. We note that we have a tremendous 
amount of freedom in the isomorphism of K into k. 

Assume now that k, C K C k. We must show that there exists rational 
places of K/k,. This will be done by an induction on the transcendence 
degree of K over k,. If this transcendence degree is 1, let K = k, (2, y), and 
let f(x,y) =0 be the irreducible equation for z and y over hy. Then 
fy(z, y) 0 because (2, y) is generic. As before we may assume that 2 and 
y are integral in & and that f has integral coefficients. We then use the 
Newton method in the reverse direction. We can select € in /, arbitrarily 
close to xz so that y can be refined to a root of f(é, ¥). This root will lie in k, 
and will be algebraic over &,, hence in k,. This procedure yields infinitely 
many points on the curve f(z, y) = 0, and almost all of them will be non- 
singular. We had pointed out that a non-singular point of a curve lies at 
the center of exactly one place and that the residue class field of this place 
is then generated by the coordinates of the point over the constant field. 
Since the coordinates obtained in our proof are rational, we have proved 
what we wanted. 

Our construction shows in addition that any finite set of quantities (2) 
will not be mapped into 0 or o by almost all the places. In particular, we 
can find places such that none of the quantities g;(z) are mapped into 0 or o. 
Furthermore, if we view (z) as a generic point, infinitely many of the places 
just constructed will induce specializations of (z) into k, which are arbi- 
trarily close to (z) itself. For a proof and an application of this, see [5]. 

It is now easy to complete the induction. Let k, C FE C K, where K/E 
has dimension 1. Let EF, be the algebraic closure of # in k. Then F£, is 
admissible. Let 1 = KE;. Then L may be viewed as a function field in one 
variable over the constant field #,. There exists a place p of Z rational over 
E,, and p induces a place on K/E which is E,-valued. Since K/E has 
dimension 1, the image K of K under p is finitely generated over H,, and 
consequently over k,. Since K has smaller dimension over k, than K we 
conclude by induction that there exists a place q of K which is rational over k,, 
and satisfying all the extra algebraic requirements that we desire. The com- 
posite of the places p and q is a rational place of K over k, satisfying all our 
conditions. This concludes our proof. 


Theorem 2 can be used to study function fields over admissible fields /; 
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which are interesting in themselves, such as the maximal unramified exten- 
sion of a p-adic field, or the convergent power series over a valuated field. It 
can also be extended to function fields over & itself. The fields &, then play 
the role of fields of definition. This is done as follows. 

Let K/k be a function field, and assume that there exists one rational 
place p of K/k. Let V be a hypersurface which is a model of K and is non- 
singular at the center of p. Let K — k(x) where (x) is a generic point of V. 
Let k, be an admissible subfield of & which contains a field of definition for 
V (i.e. all the coefficients of the irreducible polynomial f(X) vanishing on 
(c)), and all the values x;(p). Such a field may be obtained by adjoining 
to an admissible field the finite set of quantities just mentioned, and taking 
the relative algebraic closure in k. 

Let K, =k,(x). By a procedure similar to the one carried out in the 
first part of the proof of Theorem 3, there exists an isomorphism of K, into 
k which is identity on k,. Hence we can find infinitely many places q of K, 
which are rational over k,. All that remains to be shown is that such places 
extend in a natural way to rational places of K over k. 

The field K, is linearly disjoint from & over k, by Weil [8], Ch. I, 6, 
Th. 3. Our statement now follows from the following general lemma. 


LemMaA. Let K, L be two extensions of a field F and suppose that they 
are linearly disjoint over F. Let q be a rational place of K/F. Then q 
ettends uniquely to a rational place of KL/L. 


Proof. Wet Kq be the ring of elements of K which are finite under q. 
The specialization on Ky induced by q extends uniquely to a specialization of 
the ring L[ Kg] generated over L by the elements of Ky. This is well known 
for a finite set of quantities, and the generalization to an infinite set is 
immediate. 

Extend this specialization to a place q of KZ over L. We contend that 
it is uniquely determined, and that it is L-valued. 

An element of KZ can be written 


with a;,b;e K, and LZ, and with the 8; linearly independent over F. 
Say 6; has maximal value under q, or equivalently under Q. Divide 
numerator and denominator by b,. We get 


where each a;/b, and 6;/b, is finite under Q, and lies in F. Since O is identity 


v4 


1 
) 

I- 
E 
is 
ne & 
er 
as 
ad 
we 
ur 
ky 


370 SERGE LANG. 


on L, and since the ; are linearly independent over F, the denominator is 
now finite under Q, and is not =0 (mod). The numerator is finite, and 
clearly both are Z-valued. This concludes our proof. 


We may summarize the preceding results: 


TueEorEeM 4. Let K/k be a function field over a field k complete under 
a valuation, and of characteristic zero. Assume that there exists one rational 
place of K/k. Then there exists infinitely many such places. In fact, given 
a finite set of algebraic conditions as in Theorem 3, we can find a rational 
place satisfying these conditions. 


In order to take care of the algebraic conditions on a finite set of elements 
(z) in K, we must be sure that all the elements of this set will lie in the 
field K,. As they can be expressed as rational functions of (x) with a finite 
number of coefficients from k, all we need to do is to take a field k, con- 
taining these coefficients. 

We shall now make a few additional remarks on function fields over [ 
complete fields, which admit rational places. 

If we work over the complete field & itself, there is in general no natural [ 
extension of & in which we can imbed the function field K having a rational | 
place, as we could do if we worked over an admissible subfield. However, 
there is an interesting special case where such an extension exists. 

Namely, let & be complete under a discrete valuation, whose residue class 
field is algebraically closed. Then & can be represented as a power series field 
or a field of Witt vectors [9] with components in a field £ isomorphic to the § 
residue class field. Denote such a complete field by F(£). Let & be an alge- F 
braically closed field containing f and assume §8/f has infinite degree of 
transcendence. Then F'(£) is contained naturally in F(R). It turns out that 
F(&) plays towards F(£) the same role that & played towards k, in Theorem 4. 
In fact, one can replace k and k, by F(®) and F(£) everywhere in Theorem 4 
without losing the validity of the theorem. The proof is entirely similar, and 
we shall not reproduce the details of the argument (cf. [5]). 

The results which have just been obtained may be interpreted as follows: 
A complete field may be viewed as a sort of universal domain for those func- 
tion fields over it which admit a rational place. 

The admissible subfields behave as the natural fields of definition for 
varieties defined over k. However, the restriction to admissible subfields was 
partly done for convenience. For instance, if a function field K over the 
rational numbers Q has a rational place, it is clear from our theorems that 
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there exists an isomorphism of K/Q into each p-adic field, and into the real 
numbers. 


4, Additional remarks. It is perhaps worth while to note the analogy 
of Theorem 3 with the isomorphism theorem for real function fields K over 
real closed fields P, as proved in [6]. In the latter case, the internal criterion 
of reality suffices to yield the existence of a real, zero-dimensional place, and 
to prove that K can be mapped isomorphically into every real closed field of 
sufficiently high transcendence degree over P. In the case of complete fields, 
no such internal criterion seems to be available: One has to assume a priori 
the existence of at least one rational place in order to get enough of them. 
It would of course be of great interest to find such an internal criterion, if 
it exists. 


Addendum.* We take this opportunity to give another proof of the 
theorem for real function fields, following as closely as possible the proof of 
Theorem 3. 


THEOREM 5. Let K bea real function field over a real closed constant 
field P. Let K, 2; 40 (7 =1,---,m) bea finite set of functions of K. 
Then there exists a rational place p of K/P such that each 2;(p) is finite 
and 0. 


Proof. By essentially the same inductive step used in Theorem 3, and 
in the corresponding theorem of [6], we can assume that K/P has transcen- 
dence degree 1. It suffices then to prove that for a suitable model of K, we 
can find infinitely many points on the curve with coordinates in P. We 
proceed to do this. We suppose that K is ordered, and distinguish two cases. 


Suppose first the ordering of K/P is non archimedean. Then the 
canonical place p of K/P is rational over P. Let P(x,y) be a non-singu_.r 
model of K at the center of p. Then f(%, 7) —0 but f,(%, 7) ~0 and hence 
for small § in P f(Z, 9 + 8) = df, (#, 7) + where is much smaller than 8. 
This means that f(z, Y) changes sign at 8 and y—8. For all éeP 
sufficiently close to Z, f(g, Y) will likewise change sign at 7 + 8 and y —8. 
It will therefore have a zero in P, and we get in this way infinitely many 
zeros of f(X, Y), as desired. 

If the ordering of K/P is archimedean, we take any model K = P(z, y). 


* Received April 9, 1953. 
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Then f,(z,y) #0 because (x,y) is generic. The same procedure as above 
can now be applied to get the rational points in this case. This proves our 
theorem. 

The isomorphism theorem can also be obtained in a similar way: 


THEOREM 6. Let K/P bea real function field of transcendence degree r. 
Let T be a real closed field containing P, and having transcendence degree =r 
over P. Then there is an isomorphism of K into T which is identity on P. 


Proof. Let K=—P(2,,---+,2,,y) where (2z,y) satisfies over P an 
irreducible equation f(z,y) =0, fy(z,y) #0. Using Theorem 5 we can 
find a rational place p of K/P such that the point (%, ¥) at the center of p 
is non-singular: f(Z,y¥) but f,(Z,y) ~0. This means that f(z, Y) 
changes sign in P around y. For a point (€) with coordinates in T 
sufficiently close to those of (%), the polynomial f(g, Y) will change sign 
in TI. We may clearly select (€) algebraically independent over P. If 
f(é,») =0 for ne T, we see that P(é,) is isomorphic to P(z, y). 


The proofs of Theorems 3 and 5 are essentially alike, differing only 
in that the existence of rational points on a curve in the neighborhood of a 
given non-singular point obtained in one case by the Newton method, is 
obtained in the case of real closed fields by using the intermediate value 
theorem. This procedure avoids Sturm’s Theorem, and uses instead a more 
naive property of real closed fields. It also has the advantage of unifying 
the local techniques used in both cases, so that the ordinary real numbers 
appear as a common field for both cases. It must be remarked finally that 
although the full force of the local uniformization theorem was used in 
Theorem 3, only the special case of curves was needed to handle the real 
fields. This means that the proofs of Theorems 5 and 6 remain elementary. 


An interesting possibility for generalization lies in the following direction: 
How far do the results concerning specializations remain valid if one replaces 
the local fields by the quotient fields of complete local domains? The trouble 
arising here is partly due to the fact that the topology on the local domains 
has no natural extension to its quotient field, and consequently makes the 
handling of specializations over such fields more difficult. In particular, we 
know of no analogue to the Newton approximation method. 

One might for instance make the following conjecture: Let 0, be the 
ring of convergent power series in several variables over the complex numbers, 
and let o be the ring of formal power series. Let &, and k be the respective 
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quotient fields. Let (7) = (%,-**,%n) be a generic point over k,, with 
components in 0. Does there exist a specialization of (x) into 0;? 

It is actually possible to prove the following result. Let 0, —£{t} be 
the ring of power series in several variables, over an algebraically closed field f 
whose cardinality is greater than denumerable. Let & be an algebraically 
closed field containing f, and let » = R{t} be the power series over R. Then 
0, Co ina natural way. Let F, and F be the respective quotient fields. 

Let R{t} be a finite set of power series, and put (x) = 
View (a) as a generic point over F',. Then (x) has a specialization (Z) over 
F,, with Z; 0). 

Indeed, let f;(X) be a basis for the ideal in F,[X] vanishing on (2). 
By clearing denominators, we may assume that all coefficients of f; lie in 0,. 

Write each variable Y; as a power series X;(¢) with indeterminate 
coefficients: > where ranges over all monomials ¢,“:: - - ¢,“* in 


Then we can = > fin(é)u where f[é]. That all f; vanish 


on (x) implies that the denumerable system of equations fj,(€) =O has a 
solution in ®, and hence that every finite subsystem is consistent. By the 
Hilbert Nullstellensatz in infinite dimensional space [7], it follows that the 
system also has a solution in f, and such a solution yields a specialization of 
(z) into 03. 

By an analogous argument, we can also prove the following statement: 
Let m be the maximal ideal of 0,. Let f(X) be a polynomial with coefficients 
in 0,. The necessary and sufficient condition that f has a zero in 0, is that 
f=0 (mod m?’) is solvable in o, for each positive integer v. 

The problem is of course to relax the cardinality assumption, but we see 
no way of doing this at present. 


THE INSTITUTE FOR “ADVANCED STUDIES. 
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ON THE EXISTENCE OF CHARACTERS IN TOPOLOGICAL 
GROUPS.* 


By M. Cotiar? and R. RIcaBARRA. 


1. Introduction. In this section we will explain the idea of the paper 
and state the main theorems. The proofs will be given in Section 4. 

Let G = {z, 8, denote an abelian group, e the unit element, 
x(a) a character of G, Q = {f(x)} the set of all uniformly continuous func- 
tions on G, and P = {¢(«)} the set of all positive definite functions $(2z) e Q. 

Let us recall that ¢(2) is said to be positive definite if 


2 *) = 0, 


for any system of n complex numbers -, An and n elements Sn€ G. 
By a character we mean a continuous function such that 


| x(x) | =1, x(s2t) =x(s)x(t)- 


If x(x) is not continuous we will say that it is a non-continuous character. 
Any character, as well as any function of the form ¢ = > Aix: with all A; => 0, 
is positive definite. ; 

G is said to have sufficiently many characters if given an element s ~e 
there is a character x(x) such that x(s) #x(e) =1. 

The aim of this paper is to characterize the class of topological groups 
with sufficiently many characters. More precisely: a topological group is 
defined by the group operation zy and the system {0} of its open sets; we 
want to give a condition on the group operation and the system {0} which 
is necessary and sufficient for the existence of sufficiently many characters 
on G. 

Observe that by definition of a topological group, given an integer n > 0 
and an element se, there is an open set 0 such that ec 0, s= 0? 
Let us say that a subset B C G is a big set if G can be covered by a finite 
number of translations of B:@ C s,BU---Us,B, %¢G. For example if 
@ is a compact group then any open set is a big one. We will prove the 
following lemma. 


* Received January 23, 1953. 
* Fellow of the John Simon Guggenheim Foundation. 
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Lemma. In any topological group G, given se, there is a big 
neighborhood 0 of e such that ee 0 and 0°=0-0. 

We will prove that the lemma is true for 0" with n= 6, if and only 
if G possesses sufficiently many characters. More precisely: 


THEOREM 1. For any topological group G and any element se G, s Ae, 
the following conditions are equivalent: 


(1) There is a character x(x) such that x(s) ~x(e) —1. 
(2) There is a big neighborhood 0 such that ee 0 and sé 0°. 


Evidently the set 0° may be replaced by any 0",n=6. By the lemma, 
Theorem 1 is no longer true if 0® is replaced by 0", n=2. It would be 
interesting to know if 0° can be replaced by 0", n = 3, or to determine the 
minimum value for n. 

The proof of Theorem 1 is based on another theorem (Theorem 2 below) 
which we are going to state. We will recall first some known facts from the 
classical theory. A fundamental result is the following ([1], [2]): 


(A) In any locally compact abelian group there exist sufficiently many 
characters. 


More precise information is provided by the Tauberian theorems ([3], 
[4]). Let us say that a function f(z) belongs to the w*-closure of the set 
of functions A = {g(x)}, if given e > 0 and a compact subset C C G there is 
a function g(x) A such that | —g(«x)| <« for any On the set 
of characters G* = {x(x)}, the w*-closure defines a topology known as the 
Pontrjagin topology of G*. We will denote by S*[A] the w®*-closure of the 


n 
set of all functions of the form > Aigi(siw), where s;e G, ge A, and A; are 


complex numbers. Finally we define the w°-spectrum o°[A] of the set A, 
as the set of all characters x(x) such that xe S*[A]. If A is composed of a 
single function f(z) then we shall write S°[A] = S¢[f], and o°[A] =o°[f] 
= spectrum of f(z). 

The tauberian theorems give us the following properties of the 
w*-spectrum of f(z). 

(B) If G ts locally compact and feQ, then the spectrum o[f] is a 
non empty set. 

(B’) If @ is locally compact and V = {x(x)} is a set of characters 


such that o°[f] C V and such that V (as a subset of G*) is open in the 
w°-topology, then f C S*[V]. 
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The hypothesis that V is open in the Pontrjagin topology is essential. 
In other words, in general the relation f C S°[o*[f]] is not true. Godement 
({5]) proved that this relation is true in the following case: 


(C) Let G be locally compact. If feQ is integrable with respect to 
the Haar measure, or if f is positive definite, then f C S8*[o°[f]]. 


It is easy to see that (A) is a corollary of (C) as well as of (B’). 
The proofs of (A), (B), (C), use Haar measure, the theory of convolution 
and the theory of Banach Algebras or Hilbert spaces. Nevertheless in the 
statements of these theorems appear only ordinary notions concerning con- 
tinuous functions. It is natural therefore to pose the following problem: 
find a direct proof of (B), (B’) and (C), and generalize these theorems for 
not necessarily locally compact groups. Theorem 2 below gives us a solution 
for the case of proposition (C). We did not succeed in finding a similar 
solution for (B’). 

In order to state Theorem 2 it is necessary to introduce some definitions. 
Theorem (C) refers to functions integrable with respect to Haar measure 
and to the w°-spectrum of f. But neither the Haar measure nor the w°-topology 
can be used in general topological groups. Therefore first of all we have to 
define a type of spectrum and a measure which will fit for any group. 


Definition 1. We shall say that a function f(xr),xeG, belongs to the 
w“-closure of the set A = {g(x)}, if for any given « > 0, there is a subset 
A’C A such that: 


(1) all the functions of A’ are uniformly equi-continuous on G, 


(2) given 2,---,2,eG there is a function g(x) eA’ such that 
f(x) —g(ai)| for i=1,- -,n. 


We shall say that f(x) belongs to the w-closure of A if only the condition 
(2) is satisfied. 

We shall denote by S“[A] the w%-closure of the set {> Aigi(six)}, A, 
of all linear combinations of translations of functions g;(z) eA. While 
S¢[S*[A]] = S¢[A] is true, S“[S«[A]] — S#[A] is not generally true. For 
this reason we define the o%-spectrum o“[A] as the set of all characters (2) 
such that x(x) e S“[A] U S*[S“[A]] U U---. 

If A is composed of a single function f(z) then we write S“[f] and 
o“[f] instead of S*[A] and o“[A]. 


Definition 2. A function fe Q will be said regular if f C S*“[o*[f]]. 
The set of regular functions is denoted by R. 
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An immediate consequence of the above definitions is the following 
proposition. 

Proposition. (a) If fre S*[A], n=1,2,---, and >f (2) 
uniformly on G, then f(x) eS*[A]. (b) If fa C S*[f], n—1,2,°--, 
feQ, and each f,(x) is regular, and fx—>f uniformly on G, then f(z) 
is also regular. 


Now, if @ is not locally compact or locally bounded, it cannot have a 
Haar measure (that is a measure which is o-additive, invariant, finite on 
bounded sets and positive on open sets). But in G@ there always exist 
measures »(A), »(G) ~0, with the following properties: 


(1) pw is finitely additive: (> A;) no hile for every n disjoint 
measurable sets. 

(2) wis invariant: p(sA4) —p(A) if se G, and p(A) = p(A*). 

(3) lim § | F(sy) —F(y)| duly) =0, 


for any uniformly continuous and integrable function f(z).? 


(4) If f(x) 20 is p-integrable and « > 0, there is a set C C G such 


that <o and f |\fl|du<e 
G-c 
Such measures yw will be called G-measures. We introduce now the three 
following classes Ry, R,, R, of continuous functions. 
Definition 3. Let f(x) e©Q. We shall write fe Ro, if 


f(x) = f k(y*)du(y), 
where ¢ € P, » is a G-measure and k(y) and | k(y)| are -integrable functions. 
We shall write fe R, (fe R.) if there is a G-measure » and a sequence 
{k,(x)} such that f is y-integrable (f is positive definite), each k,(r) 20 
is w-integrable and k, ¢ P, and 


S f(xy) kn(y*) du(y) > f(z) 


uniformly on G. 


THEOREM 2. Let G be an arbitrary abelian topological group. Then 
any function fe R,U R, U R, ts regular, that is: Ry C R,R, C R,R, CR. 
Moreover, the spectrum o“[f] of f is a o-compact set (in the w*-topology). 


? When the integral is extended over the whole space @ we will write f f dp. 
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It is easy to see that if G is locally compact then Theorem 2 furnishes 
a more precise form of (C). In fact, we first observe that by a known property 
of equi-continuous functions on compact sets and by definition of w*%-closure 


it is clear that 
S*[A] CSA]; o*[f] Co[f]; 


S“[o*[f]] C S*[o°[f]]. 


Hence the relation f C S“[o“[f]] implies the relation fC S*[o°[f]]. There- 
fore from Theorem 2 it follows that (C) is true for any function 
fCRUR,U 

We next observe that if G is a locally compact group, » its Haar measure, 
f(z) © Q, and if we take k,(x) = 0 such that eP, ff kn(x)dy(r) = 1, 
and k,(x) = 0 outside of a small neighborhood of e, then 


f(xy) kn(y™) du (y) > f(z) 
uniformly on G. Thus, in the case of a locally compact group the class R, 
contains all the integrable functions fe Q, and R, contains all the positive 
definite functions. Hence (C) is a corollary of Theorem 2. Moreover, 
Theorem 2 gives us a further information about the spectrum of the function. 


2. Elementary observations on characters. Let G be an abelian top- 
ological group. The simplest transformations of functions f(z),zeG, are 
the translations: 7,f(r) —f(s*r),seG. It is easy to verify that if a 
bounded function fe@ satisfies T,f(x) —f(x)f(s*), for any se G, then 


| f(x) is a character or f==0, and conversely. If we denote f(sz)f(s) by 


T*f(x) the above condition may be written Tsf—f. Hence if Q’ is the 
set of the functions fe Q such that | f(v)| <1, then each T* transforms Q’ 
into itself and the characters are the non zero fixed points of Q’ under all 
the transformations 7*. Thus the problem of finding characters is related 
to the problem of finding fixed points but with the essential condition that 
the fixed point must be 0.° In order to get the last condition we will 
consider instead of the whole set Q’ the subset Q’ P. Since the T* do not 
transform P into P it will be necessary to take certain linear combinations 
of the T* and proceed as follows. 


* If fe Q’, consider all the functions f, of the form f,(#) = 7172. . . T*kf(@), and 
let C be the set of all the convex combination > Af; of such functions. The w-closure 0 
of @ is a compact (in the w-topology) convex set of equi-continuous functions and T° 
transforms © into itself. By Tychonoff’s fixed-point theorem there is a fixed point 
in @. If we could be sure that this point is ~0, we would obtain the Wiener 
tauberian theorem (B). Thus Wiener’s theorem may be regarded as a kind of a “ non- 
zero” fixed-point theorem. 


| 
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For any se G,0 [6S 2z, a= 0, we define the following transformations 
9 9° ‘ 
(1) T (a, 8, 0)f (x) —a*[2f (x) + (sx) + (sx) ], 
(1a) T(s,6)f() =T(1, 8, = 2f (x) + (sx) + (sz), 
so that T (a, s, 0)f(x) =a°T(s, 6)f(z), 
(1b) T (s,0)f(e) = 2f(e) + ef (s) + e°f(s*), 
(1c) T'(s, 6) (Af) =aT'(s, 6) f(z). 
It is easy to verify that T'(a,s, 6) transforms P into P. Moreover, the ; 
non zero fixed points are the characters. More precisely: 
Lemma 1. Let f(e) 1 and seG. If there exist two values of 8, I 
6, and 6; 462 (modz/2), and corresponding two real numbers 4, 
such that 
(2) f(x) = 8, f(x) = T (a, 8, 62) f () ( 
for any xe G, then the relations 
(3) f(xs) =f(x)f(s);  f(ws*) = f(z) f(s”) 
hold for any xe G. ( 
Proof. Suppose that f(a) satisfies the condition f(z) = T(a, s,6)f(z). § ;, 
Since f(e¢) =1, we deduce that a0, and this condition may be written “ 
(2a) 2f (x) + ef (sx) + af (x). ( 
For (2a) gives a? e**f(s) + and replacing this F 
value of a~* in (2a) we obtain (: 
— F(x) f(s) + [f —f(@)f (st) = 0. 
If this is true for two values of 6 ($mod-/2), then each bracket of the 
last expression must be zero and we obtain (3). This proves the lemma — 
Now we will drop the condition f(e) =1. (4 
LemMA 2. Suppose that | f(x)| 1, and that the following condition § Wi 
is satisfied: For any s, 0, such that T(s,0)f(e) 0 there is a real number § tal 
a0 such that f(x) =T(a,s,6)f(x). Then f(x) =f(e)x(x) where x(z) 
is a (not necessarily continuous) character. de 


Proof. Suppose first that for any s e G there are two values 0,, 02, 6, FA: 
(mod 7/2), such that T'(s, &)f(e) 40, for i—=1,2. Then by hypothesis we 
we will have f(x) = T (aj, s, 0:)f(x), i=1,2. From f(e) =aT(s, &)f(é) 


ns 
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we deduce f(e) =c40, and we may write f(x)/c == T (a, s, &) (f/c) (x), 
i=1,2. By Lemma 1 this implies f(x) /c = x(x), f(x) = =f (e)x(2) 
and in this case Lemma 2 is true. Suppose now that for some soe @ it is 
not possible to find two values of 6 (+mod 2/2) such that 
T' (80, 0) f(e) = 2f(e) + (so) + (so) AO. 

Then it must be that f(e) =f(so) =f (so?) =0. We shall show that this 
implies f(x) =0. In fact, suppose that on the contrary, for some seG 
we have f(s) #0. Then we can find a 6 such that T(s, 6)f(e) 40, and by 
hypothesis we will have f(z) for some For 
this gives f(e) = a’T'(s, 6) f(e) ~0, and we arrive at a contradiction. There- 
fore f(x) =0, and we may write f(r) =f(e)-1—f(e)x(z). This proves 
Lemma 2. 

Luma 3. For any transformation T(a,s,0) with a0, there is a 
transformation T*(a,s,6) of the form 
(4) T*(a,s, 

= (b? + f(x) + (sx) + bb, 


and two numbers X>0, » >0,A+p—1, such that 
(4a) f(x) =AT (a, s, 6) f(x) + pT*(a,8, 6)f(t), 


holds for any f(x). Besides X% may be chosen to be any number which 
satisfies the inequalities 


(4b) <1, AS 
For each T(a,s,0) there are many such T*(a,s,6) satisfying (4), (4a), 


(4b), and any of them will be called an adjoint transformation of T (a, s, 6), 
and denoted by T*. 


Proof. Condition (4a) will certainly be satisfied if 2a? + p»(b? + D,?) 
=1 and Aa? — »bb, = 0. Thus we must have 
(4c) (6+ b:)? =p, (b —b,)? = [1 — 4ra’*], 
wih A>0, »>0, A+~—1, and 0 < 4Aa? <1. It is then sufficient to 
take O << AS 407, AX <1, This proves Lemma 3. 


Remark. T(a,s,0) and T*(a,s,6) both transform P into P, for any 
determination of the adjoint transformation T*. 


3. T-invariant sets. It is well known that a function f(z),zeG, 
belongs to the w-closure (Definition 1, Section 1) of the set A = {g(z)}, if 
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and only if there is a directed subset {9g,(x)} of A such that lima ga(v) = f (2), 
for any zeG. By a theorem of Tychonoff any w-closed and bounded set 
A = {g(a)} is w-compact (we say that A is bounded if | g(z)| S M for some 
constant M and any geA,xeG@). For such sets we have the following 
theorem of Minkowski-Krein-Milman: If A = {g(zx) } is w-compact and convex 
(that is A, > 0, SA: —1, imply Awgie A), then any ge A belongs to 
the w-closure of the convex combinations of the extreme points of A. That is, 
g belongs to the w-closure of the set of the functions of the form > Agi, Ai > 0, 
> A; = 1, where each g; is an extreme point of A. Let us recall that g,¢ A, 
is said to be an extreme point of A, if 


Jo=AN ADO, Med, g2eA, 


imply go = 9: = 92. 

Definition 4. We shall say that the set A = {f(x)} is T-invariant if the 
following condition is satisfied: If f e A, and s, 6, are such that Ts, 6)f(e) 40, 
then there is a number a0, such that T(a, s, 0)f(x) eA, and such that 
at least for one determination of T* we have also T*(a, s, 6)f(x) © A. 


Definition 4a. We shall say that A = {f(x)} is strongly T-invariant 
if the following condition is satisfied: if fe A and s,6@, are such that 
T (s, 0)f(e) #0, then there exists a number a > 0 such that: 


0<m<@SM- (| 
(m, M, independent of fe A and of s,6), and 
T (a, s,6)f(x) A, T* (a, s,0)f(z) A, 
for any determination of T*. 


Lemma 2 combined with the Krein-Milman theorem yields at once the 
following method for obtaining characters. 


Proposition 1. Let A= {f(x)} bea set of functions which is conves, 


w-closed, T-invariant and bounded: | f(r)| SC for all fe A. Then each 


extreme point fo of A is of the form fo(x) =fo(e)x(x), where x(x) is a (not : 


necessarily continuous) character. Hence any fe A belongs to the w-closure 
of the convex combinations > SA =1, 420, where each yi(2) 
is a character and cixi(x) eA. In particular if f(e) 20 for any fe A, then 
any feA ts positive definite. 


Of course, if A is composed of continuous functions then the extreme 
points of A are continuous characters. 
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Proof. Let fo be an extreme point of A. If T(s, 6)f.(e) #0, then by 
the 7-invariance of A we have T(a,s,6)fo(z) eA, T*(a,s,0)foe A, a? > 0, 
and by definition of 7%, 


fo =AT (4, 8, fo + wT™* (a, 8, 8) fo; A+p=1, ADD, p>d. 


Since fy is an extreme point of A, this implies f)(x) = T (a, 8, 0)fo(%), a? > 0. 
Hence by Lemma 2 we have = fo(e)x(x). In particular if fy(e) 2 0, 
then fo(x) =fo(e)x(x) e P. The rest of the proposition is a consequence of 
the Tychonoff and Krein-Milman theorems. 


The following theorem simplifies the application of Proposition 1. 


Proposition 2. If A= ({f(x)} is strongly T-invariant, then the w- 
closure A of A is T-invariant. 


Proof. Let fe A and | T(s,@)f(e)| >0. Then there is a directed set 
{fa(z) } C A which converges to f(x): fa(x) > f(x) for each ae G, and since 
T(s,0)fa(x) > T(s, 6)f(x), we may suppose that 


(5) | T(s, 0)fa(e)| >y > 0 for all fa. 
By hypothesis, for any f, we can find a number dg > 0 such that 
(6) My", T* (ag, 8,0)foeAr 


for any determination of 7*. By Lemma 3 we can take all the A, =A, and 
all the we =p, and 


(6a) fa = AT (aa, 8, 9) fa + (da, 8, 9) fa; 
pel—a, 0<8S1. 


From (6) it follows that the directed set {ag”}, da? M:y, y > 0, has a 
limit point a7, 0< m-Sa®?=M:y. For this the function T (a, s, 0)f (2) 
belongs to the w-closure of the set {T7'(dq,s,6)fa}, and from (6a) and 
(4c) it follows that at least one 7*(a,s,0)f belongs to the w-closure of 
{T* (aq, 8,0)fa}. Therefore, from (6) it follows that T(a,s,0)f(x) and at 
least one T*(a,s,6)f(x) belong to A, and since a> 0, this proves the T- 
invariance of A, and hence Proposition 2 is proved. 

Now, let f(z) be a function such that f(e) 1. Consider all the 
functions g(x) of the form =T7,T.- -Tnf(x), g(e) =1, where 
is any transformation of the form T'(a,s,6) or T*(a,s,0). We will denote 
by A[f'] the set of all convex combinations of the form 3} Agi, 4 > 0, HA — 1, 
Where the g; are of the above type. 
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Proposition 3. A function (x), ¢(¢€) =1, belongs to P if and only 
if A[@] satisfies the two following conditions: 


(1) |f(x)|S1 for any fe ald]; 

(2) T(s,6)f(e) for any s, 0, and fe 

Furthermore, in this case A[¢] is strongly T-invariant with m =}, 
M 1. 


Proof. If fe P then also T(a, s, 0)f P, T*(a, s, e P, | f(x)| S 20. 
Hence if ¢e P then feP, for any fe Ald], and f(e) =1 by definition of 
A[¢]. Therefore | f(x)|Sf(e)—1, and T(s,6)f(e) 20. Conversely, 
suppose that any fe A[d¢] satisfies (1) and (2). If T(s,0)f(e) 
then by (2) ¢ > 0, hence if a? c~ then T(a,s, 0)f(e) =1, and therefore 
T (a, s,0)f(x) eA[d]. From f(e) = T(a,s,6)f(e) =1, and from 


f(e) AT (a, 6)f(e) (a, 8, 6) f(e) 


it follows that 7T*(a,s,0)f(e) and consequently T*(a, s,0)f A[d]. 
Since | f(x)| and a? = T(s, 6)f(e) = 2f(e) + e?#*f(s) + e?F(s) <4, 
it follows that A[¢] is strongly T-invariant with m = 4, By Proposi- 
tion 2, A—A[¢] is T-invariant, and being w-closed, convex and bounded, 
by Proposition 1, ¢ belongs to the w-closure of the functions } Aci, } Ai = 1, 
= 0, where cixi fie A[d] and x: —character. Since c;—=fi(e) 1, all 
the functions } A,c;x; are positive definite and therefore ¢e P. 

Remark. The same argument proves that condition (1) alone is necessary 
and sufficient for ¢ to be a linear combination of positive definite functions. 
An extreme case of condition (1) is when A[¢] is formed by a single function. 
Thus, if A[¢] is contained in a “ circle ” of “ centre” ¢ and “ radius r <0,” 
then ¢ is a linear combination of positive definite functions; if the “ radius” 
r is 0, then ¢ is a character. 


1. If P, = 1, then each extreme point of A=A[¢] 
is a (not necessarily continuous) character. 

It is important to observe that even if ¢ is continuous and _ positive 
definite, ¢(e) — 1, we cannot be sure that the extreme points of A[¢] are 


continuous functions. This difficulty will be avoided in the next section by f 


submitting the set A[¢] to a convolution with a continuous kernel. 


4, Proof of Theorems 1 and 2. In this section we prove the results ; 


stated in Section 1. 


Proof of the Lemma. Let se, and take an open set U such that 
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eCU, s&€U*?, U=U-. Let {B} be the class of all sets B with the 
following property: U C B, B=B", s&B*. It is easy to see that Zorn’s 
lemma is applicable here so that there is a set 0¢ {B} which is maximal 
with respect to the above property, in other words: Be {B}, 0 C B imply 
0=B. We shall show that 0 is a big set. 

Let If the set 0U cannot belong to {B}, because 
of the maximal property of 0, hence we must have 


a")? 0? U 20U U 2? Uo. 


Since s€ 0? we have only the following four possibilities: se 20, or se 270, 
or s= 27, or Hence for any element te G we have only three 
possibilities: xe 0 U s0 U or a? = 8, or Let C be the set of 
all ce G such that x* —s, and C”’ the set of xz such that v?—s". Then 
GC 

Take a fixed element and let Then if yeC, it 
follows that y?=e. If yeCo, yeO, then as above we will see that 
se(O0UyUy")? =0?U yOUy"0. Hence or yes*0. Thus 
C,C0U s0 and C C 20 U U aos10. Similarly 


C’ C U 2,'80 U . 
and we obtain 
GC {0U s0 U U {20 U U 280} U {290 U 2’s0 U 


Hence G is covered by nine translations of 0, hence 0 is a big set. Besides, 
eC U C 0 is interior to 0 and s€0*. This proves the lemma. 
Now we will prove Theorem 1 in the following more complete form. 


THEOREM 1 bis. Let G be an abelian topological group and se G,s~e. 
Then the following three conditions are equivalent: 


(a) There is a character x(x) such that x(s) Ax(e) —1. 
(b) There is a big open set 0 such that 0, 0°, 
(c) There is a function fe Ry such that f(s) Af(e). 


Proof. (a) implies (b): Since x(s) ~1, | x(s)| 1, we can find an 
are 1 of the unit circle | z| 1 such that lel and x(s) €/*, and 1 is sym- 
metric with respect to 1. It is easy to see, since x is a continuous homo- 
morphism of G@ into the unit circle, that 0 = y1(1) satisfies condition (b). 


Moreover, the same argument shows that (a) implies (b) with any exponent 
0" n=1. 


8 
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(b) implies (c): It was proved by Banach and v. Neumann that there 
exists a finitely additive measure »(A) defined on all the subsets A C G 
(so that there is no question about measurability), and such that: »(@) =1, 
p(A) 20, w(sA) —p(A) for any se G.* yw may be taken to satisfy also 


p(A) =p(A-*) because the measure 2-*{u(A) + »(A*)} has this property. 


Let 0 be a big open set such that ee 0, s€ 0° and define 
p(x) = for(y*) du(y), 


where f4 denotes the characteristic function of the set A. From the invariance 
of pu, it follows easily that ¢ is a positive definite function (this is very well 
known if » is Haar measure, but the proof does not use the o-additivity of the 
measure and applies to any finitely additive invariant measure). 
Let k(x) be a uniformly continuous function equal to 1 if xe 0 and to | 


zero if x€0*, and 0O=k(r) =1. Such a function exists by a well known | , 
argument of Urysohn adapted by Kakutani for topological groups. Since f ; 
#(G) =1 and all the bounded functions are yu-integrable, the function f f 
defined by f(z) = f k(zy)¢(y)du(y) belongs to the class Ry. We shall | j 
prove that f(s) Af(e). 
Since 0 is a big set, a finite number of translations of 0 covers G, hence | 
by the invariance of » we must have »(0) > 0, »(0*) >0. From the defini- [ q 
tion of ¢ it follows that ¢(z) if and iff q 
ze G—0*. Therefore, since k(x) —0 for re G—0?, f(x) vanishes out- 
side of 0°. In particular f(s) But for te0 we have 
$(x) = : (0%), hence 
f(e) = k(y)o(y)du(y) = {u(0) }?: (0?) > 0. th 
Thus, fe Ro and f(s) ~f(e). tn 
(c) implies (a): This follows immediately from Theorem 2. In fact by f 
this theorem f eR, may be uniformly approximated on any finite set by af 
function of the form SAxi(x). Since f(s) Af(e), one at least of the : 
characters x; must satisfy yi(s) ~yi(e), and this proves Theorem 1. , 
pai 
Thus, it only remains to prove Theorem 2. | 
Proof of Theorem 2. (1) Ry C R: Let 
f(x) = f o(y*)du(y) = f o(zy)k(y*) du(y) Th 


be a function belonging to the class R), where ¢e P, (e) —1, and k(y) 


«A more general theorem is given in Banach’s book [6], page 28. 


t by 
ny ae 
the 
| uniformly on G, and f is p-integrable, k,eP and is p-integrable. By the 
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is p-integrable and k(y) 20. Consider the set A[¢] = {h()} and for 
each he define 


(7) =k*h(x) = f k(xy)h(y*)du(y) = f 


and let K = {g(x) =k*h(x)}. Since each he A[¢] is positive definite 
we have | h(x)| <1, | T(a,s,0)h(x)| ST (a, s, 0)h(e), and since 


(7a) T (a, s,0)9(x) = f T(a, 8, 6)h(ay)k(y*) du(y) 
we obtain 

(8) | 9(z)| Sf | k(y)| = 6, 

(8a) | T (a, s,0)g(e)| S| T(a,s, 0)h(e)|-C. 


By Proposition 3, T(s,@)h(e) AO implies T(a,s,0)h(x) e with 
Sa’ S {T(s, 6)h(e)}-*. Hence from (8a) we derive that K is strongly 


T-invariant with M=C. Therefore K—w-closure of K is T- 


invariant (Proposition 2), and being convex and bounded (by (8), it follows 


| from Proposition 1 that fe K C K belongs to the w-closure of the functions 
| DAcxi(x) where %>0, SA—1, cixs(x) yi —character. But now 
' we can see that these characters x; are continuous functions, because from 


| 9(tz) —g(x)|S f | k(ty) —k(y)| dp(y) it follows that the functions 


| geK are uniformly equi-continuous on G, and hence the limit functions 
» geK are also uniformly equi-continuous. Furthermore, on account of this 


equi-continuity the w-closure of K becomes the w*-closure. From the defini- 


F tion of f(x) and ge XK it is clear that ge S*[f], hence K C S*[f] and all 
| the above characters x; belong to S“[f], hence to o*[f]. Thus f belongs to 
the w-closure of the functions > (x), xe o“[f], and by the equi-con- 
| tinuity it follows that fe S“[o*[f]]. Hence fe R. 


(2) R, C R: Let fe R,, so that 


f(z) fn(&) lim S f(y) kn (ey) du(y), 


part (1) of the theorem, f,¢R for each n. Let us fix n and consider the 


set of all functions of the form Af (ryi)kn(yi) where 


| Aken (yi)| Sf +. 
This is a set of equi-continuous functions, for 
| (cys) kn (ys) (2’yi) en (ys) | 
== sup | f (29) — | + 
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From the definition of the integral and the property (4) of G-measures 
it follows easily that f, belongs to the w*-closure of this set, hence by the 
equi-continuity also to the w%-closure. Hence fn S«[f]. Since fre fraof 
and f,e S*[f], it follows from the Proposition of Section 1 that feR. 
Similarly, it is proved that R, C R, and this proves Theorem 2. 


Remark. Let R; be the set of functions f ¢ Q such that 
f(x) =lim f f (vy) kn(y)du(y) 


uniformly on G, where k, = 0 belongs to P and is p-integrable, and f is a 
linear combination of positive definite functions. Modifying the above proof 
we can prove that R; C R. 
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ON THE TOTAL RELATIVE STRENGTH OF SOME HAUSDORFF 
METHODS EQUIVALENT TO IDENTITY.* 


By 8. K. Basv. 


1. Introduction. Given a sequence {s,} and a number « (the numbers 
with which we are concerned here are all supposed to be real, unless the 
contrary is stated expressly), let S, denote the matrix of the transformation 


(1) On = Sy + (1— (So +51 +° + Sn)/(m +1) 
(n = 0,1, -) 


(2) aE + 


where H is the identity matrix and M the arithmetic mean matrix (for 
definitions of H and M see [6], p. 105). It has been proved by Mercer that 
for ¢ > 0, Sa (or the method defined by S,) is equivalent to identity ([11], 
pp. 207-212—for a < 0, this equivalence no longer holds) ; in symbol S, = E, 
i.e. in (1), s,—>/ implies o, —/ and conversely (unless otherwise stated, / is 
finite). Since then different proofs have been given of this important limit 
theorem by different writers and various generalizations (for reference see 
[16], p. 14; also see [10]) have been made. However, the case of infinite 
limits in the above theorem (i.e. when 1] = -+0) seems to have been dis- 
cussed only by Mercer himself. It is natural to ask as to what happens when 
one replaces M in (2) by some suitable regular matrix. We shall be concerned 
here with a special case of this general question, when M in (2) is replaced 
by a Hausdorff matrix (H-matrix) of a special kind (for an H-matrix and 
its properties see [6], Chap. XI; note that M itself is an H-matrix), taking 
into account the possibility of both finite and infinite limits, the case of finite 
limits being discussed in § 3 (where our results are obtained mainly by a direct 
appeal to Pitt’s General Mercerian Theorem; see [12], p. 517) and that of 
infinite limits in § 4. In § 5 we consider the question of total relative strength 
of two matrices LJ, and Us, 0 << B < a, where 


Us = + (1—a)H, 


* Received July 11, 1953; revised November 9, 1953. 
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H being a suitable H-matrix and (Jz is similarly defined. (For the notion 
of total relative strength see [4], p. 51. When a method of summability 4 
is stronger than another B, we use the symbol A > B and when this is not 
true we write A PD B. In case A is totally stronger than B, we express it by 
A t.s. B. The symbol A n.t.s. B will denote that A is stronger but not totally 
stronger than B) The author wishes to consider similar problems for con- 
tinuous Hausdorff transformations in a subsequent note. 


2. The Hausdorff transformation of a sequence {s,} is defined by 


n 
(3) On = yy Sx (n =0,1,2,---), 
&=0 


where = px, = A" — (121), and {yn} is a given 
factor sequence. In case (3) is a regular transformation {pn}, called a 
regular moment sequence, has the representation 


1 
0 


where x(w), termed the regular mass function, is a function of bounded 
variation in the closed interval < 0,1 >. If H denotes a regular H-method 
defined by {u,»}, we use the symbol H/ ~ pn». To give prominence to the mass 
function x(u) defining H sometimes we use the symbol H ~ x (these symbols 
are due to Rogosinski; see [13], pp. 166-167). The transformation (3), 
when regular, is said to be totally regular if s,—>0o implies on ->00. (For 
conditions of total regularity of the transformation (3) see [3], Lemma 1. 
The necessary and sufficient condition of total regularity of the transformation 
(3), in terms of the mass function, is that x(x) 20 and f in < 0,15; 
see [3], proof of Lemma 1. Also cf. [9], Theorem VI and [2], p. 13. Such 
a mass function will be called a totally regular mass function). 

By taking different suitable moment functions (for the definition and 
properties of moment functions see [7], pp. 95, 102-103) we can have different 
regular H-matrices. Thus p(1) —1 (t20) gives the identity matrix F. 
For the matrix M, p(t) =1/(¢+ 1); for H*, the Holder matrix of order 
k >—1, w(t) =1/(t+1)*; and for the Euler’s matrix F, of order «21 
(for the Euler’s matrix we have used here the notation as in [3]), w(t) =«* 
(for the corresponding mass functions see [13], p. 167). These matrices are 
all totally regular. 


3. Suppose now that M in (2) is replaced by a totally regular H-matrix 7 


H ~ pn corresponding to the mass function ¢(t) which is of regular bounded 


t. 


( 
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os variation (r.b.v.) in <0,1> (for the definition of a function of r.b. v. 
A see [13], p. 176). We then consider the method (J, defined by 

ot 

(4) (1—«)H. 


ly The following theorem will be proved. 
THEOREM 1. (i) Fora>4, 
(ii) For 0<aSh, 


provided that the moment function w(t) corresponding to H satisfies the 
additional condition: 


(A) When considered as a function of a complex variable z =x + ty, 
Ru(z) 20 for 


(iii) If $(t) ts continuous at t = 1, then fora<0, Uz E (in fact, 
Ua E irrespective of the continuity of ¢(t) at t=1) but ED Us. 
We require the following lemmas. 


Lema 1. Two regular H-methods with non-vanishing moment sequences 
cannot be totally stronger than each other unless they are identical (See [3], 
Lemma 2). 


we 


| LeMMA 2. If is the moment function of a regular H-matrit H ~ x 
such that x(t) is of r.b. v. in < 0,1 > and 


|u(z)|2ad>0 (x= 0), 
 f then ED H. (See [12], p. 517 or [13], p. 178). 
Lemma 3. With the notations of Lemma 2, 


(This is a known result. See, for instance [13], p. 175) 


Proof of Theorem 1. Let p(t) and a(t) respectively be the moment 
functions of H and the regular matrix Ja~d» and let x(t) be the mass 
function of the latter. 


* We have preferred this restriction for simplicity’s sake. We might have imposed 
a less restrictive condition on u(t). It may be noted here that in order to extend the 
: result (i) to the case 0 << a < }, some suitable additional condition on u(t) is essential. 
F See [8], p. 906, Theorem 5 and the proof thereof. Also ef. [13], p- 188, § 5.1. 
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x(1) =1. The transformation defined by (4) is 


On = aS, + (1— (n = 0, 1, 2,-- -) 
k=0 
for which 
(5) An = (1 — &) pn. 


Since H is totally regular, A"*y,=0 (0SkSn). Hence 
| | > * om 1, 
k=0 k=0 


Consequently, Uj, = EF for a >4. (See [10], Theorem 1 and Cor. 3. We 
have deduced (i) directly from some results in [10]. We could have made 
use of Lemma 2 as well for the purpose. The case for « > } also follows 
easily from [8], Theorem 5 or from [1], Theorems 3. 2, 4.1. It is easy to 
give an independent proof of (i)). This proves (i). 

To prove (ii) we consider the moment function a(¢) = «+ (1— @) p(t). 
We have 


a(t + iy) + iy) =a + (1—a)(u + iv), say, 
where, by hypothesis, u—=u(2,y) 20 for r20. Then 
+ iy)| =| (2+ 1—a)u} |, 
Hence, when 0 < « < 3, 
|a(e+iy)| (1—a)uza>0 for 


Therefore, by Lemma 2, U, =~ HF for 0< a4}. 
In order to prove (iii) put e—=—£f (B>0). Then by Lemma 3, 
since ¢(¢) is now continuous at ¢ = 1, 


(6) a(t) >x(1) —x(1— 0) =1— (1+ £)4(1) =—8. 


Hence there is a number + > 0, such that a(t) <0 for t2r. Since a(t) 


is continuous for t=0 with a(0) —1, it follows that a(t.) =0 for some Ff 


Zo > 0. 
Consequently, Ua (See [13], Remark (iii) on p. 178). 


Two particular cases of Theorem (i). 


(i) Put H =c,H' + +--+ ++ ¢)H?, 


Then x(¢) is of r.b.v. in < 0,1 > and x(t) = (1— a) g(t) (OSt <1), 


| 
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n 
where ¢- = 0 (r=—1,2,3,---,p) and Then 


r=1 


aH + (1—«)(¢,H'+ SE. 


The corresponding result for continuous Hausdorff transformations follows 
easily from a general result due to Rogosinski (See [14], p. 359, Theorem 12. 
The definition of equivalence as used by Rogosinski is, however, somewhat 
different from ours). 


(ii) It is easily seen that, p being a positive integer, 
S.? = + (1— «)M]? 
is totally regular forO << «<1. For, S,? has the form 


p 
where c, = 0 (r = 1, 2,3,---,p), and Sc,—1. Hence «# + (1—2)Sy? = 
r=1 


The corresponding result for continuous Hausdorff transformations also 
follows easily from a more general result due to Fuchs and Rogosinski (See 
[5], p. 209, Theorem 5). 

In what follows when the index of summability (this nomenclature is 
due to Rogosinski and Fuchs; see [5], p. 29) for the matrix LJ, is such that 
0<a=3, the moment function p(t) is supposed to have satisfied the 
additional condition (A) stated in Theorem 1. Such a moment function we 
shall call where necessary a restricted moment function, and (Jq corresponding 
to a restricted moment function will be called a restricted matriz. 


4. We shall next prove the following Theorem. 
THEOREM 2. (i) For0<a¢<1, Uats. EF but En.ts. Ua. 


(ii) If @>1, Ets. but Uants. 


(ili) For a<0, 6(t) being continuous at t = 1, 
Ua 


We require the following lemma. 


Lemma 4. w(t) being totally monotone for t= 0, u(t) (n = 1, 2, 3,---) 
are also totally monotone. 
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Proof. If u(t) (=u) and v(t) are totally monotone for ¢= 0, then 
by Leibniz Theorem on successsive differentiation 


n 
(Uv) n = 
r=0 


so that wv is also totally monotone for {= 0. In particular, {u(t) }? = u?(t)P 
is totally monotone for ¢20. The total montony of then follows by 
induction. 


Proof of Theorem 2. To prove (i) we notice from (4) that, for 0 << a <1) 
Ua is the sum of two totally regular matrices with positive coefficients and 
as such is totally regular, as it is regular. Clearly, the moment sequence {a,}f 
as given by (5) is non-vanishing and consequently, by Lemma 1, Ua" can-f 
not be totally regular. 


For the proof of (ii) we observe that, H being totally regular, $(t) ip 
non-decreasing in <0,1>. Hence for ¢= 0, 


Accordingly, b(t) being the moment function for a7, as « > 1, b(t) = 1/a(t) 
= > Un(t), where vn(t) =1/a-(1—1/a)"u"(t) (n=0,1,2,---). Also 


n=0 
for ¢ > 0, provided term-by-term differentiation is permissible, 


(7) (—1) M(t) = 0 0,1,2,---), 


as U,(t) (n=—1,2,3,-- -) are totally monotone by Lemma 4. 

We have still to justify the term-by-term differentiation in (7). If w 
consider »(t) as a function of a complex variable then p(t) is regular fa 
xz > 0 and, therefore, v,(t) (n 2,-- -) are also regular for > 0. 

Again, for z > 0, 


Hence | v,(t)| << (1—1/a)" (n=0,1,2,---). Therefore, by Weierstrass 
M-test the series > vn(t) is uniformly convergent for R(t) > 0 and as sud 
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| can be differentiated term-by-term as often as we please (see, for instance, 
| [15], p. 95). The term-by-term differentiation in (7%) is thus justified. 
| Also since b(0) = b(-+ 0), it follows from (7) that b(¢) is totally monotone 
for 0. (The total monotony of b(t) also follows easily from a known 
Theorem. See [17], p. 151, Cor. 7.) Hence J,” is totally regular and so 
UJ_ cannot be totally regular. The latter result also follows from the fact 
that, for > 1, x(t) =— (a—1)¢(t) (0 <<t <1), 80 that x(z) <0 
' for some ¢ and since x(0) —0, x(t) cannot be non-decreasing in <0,1>. 
> The proof of (ii) is now complete. 

In (6) we have already noticed that x(1) —y(1—0) =— 8B < 0; so 
that in a left-hand neighborhood of t = 1, is not a non-decreasing func- 
' tion. Consequently, UJ, cannot be totally regular and (iii) follows. 


5. Let Us=fBE + (1—8)H, so that Ug is a matrix obtained from 

(4) by replacing the index of summability « by 8, the matrix H remaining 
the same. Then Us. =H = Us for «4,8 >0. (In such cases of comparison 
| of the strength or total strength of two matrices LJ,, Us, since the same 


8) matrix H is to serve in both cases, whenever one of the matrices is restricted, 


the other is also supposed to be so.) Our object is to compare the total 
| relative strength of the matrices (J, and Ug for 0 << 8 <a. In this connec- 
| tion the folloiwng theorems will be proved. 


THEOREM 3. For1<B< a, Usts. Ua but Uan.ts. Us. 

THEOREM 4. Jf O=B<1l<a, Usts. Ua but for 0<B<1l<z, 
n.t.s. Us. 

THEOREM 5. For0<B<a<1, Uan.ts. Us. 


6. Jf 0O=B<a<1, there are some for which 
Ust.s. Ua, and other H for which Ugn.t.s. Ua. 


Proof of Theorems 3, 4,5. Let {b,} and {M,} respectively be the moment 
sequences of Jg and Us/Ua. Then 


My = = (8 —1)/(a—1) + (a—8)/(%#—1) -1/a. 


Now, for a > 1, {1/an} is totally monotone by Theorem 2(ii). Hence for 
1<B<a, M, is totally monotone; consequently, Ugts.U,_ and so 
Uan.t.s. Ug. This proves Theorem 3. 

For O=B<1<z, 


My = (a—8B)/(#—1) (1—8)/(a—1) > 0 
(k =0,1,2,- °°). 
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Also, 


A"M;, = (a — B)/(a#—1) -A*(1/aq,) 20 (n = 1, 2, 3,- =0, 1, 2,--+), 


since {1/a,} is totally monotone for « >1. Hence {M,} is totally monotone | 
and, therefore, Theorem 4 follows. 
For 0<B<¢4<l, 


1/M, = (1—a)/(1— 8B) + (a—B)/(1—B) 


Now 1/b, > 0; but {1/b,} is not totally monotone for 0 < B < 1 by Theorem 
2(i) ; consequently, {1/,} is not totally monotone and the proof of Theorem 
5 is complete. 


Taking 8B and p(t) =1/(¢+1) we have from the 
first part of Theorem 4, with the notations of (2), M t.s. Sg, a result already 
well known ([11] Loc. Cit.; it will follow from the proof of Theorem 6 below 
that this result may be extended to the case of 0<a< 1). 


Proof of Theorem 6. For the proof of the first part of Theorem 6 it is 
sufficient to show that for given a, 8 with O=B <a< 1, there is a totally 
monotone restricted moment function »(¢) for which Ugt.s. Ua. We take 
w(t) =1/(¢ +1) (20), the moment function for M, which is obviously 
restricted. Then M(t) being the moment function for Us/Usa, 


M(t) = (1— B)/(1— 2) — B)/(1— «@) 1) > 0; 
and 
(—1)"MM(t) =n!(a—B)an*/(at >0 (n—1,2,3,---). 


Thus M(t) is totally monotone and accordingly, 


Lists. Le. 


For the proof of the second part of the Theorem we require one more lemma. [ 


Lemma 5. If M(t) ts totally monotone for t= 0, then, for t > 0, 
P(t) = — 0. 
({17], p. 167) 
To prove (ii) we take H=4(H+4E£,), a totally regular matrix, F, 
being the Euler matrix of order e, so that p(t) =4(1+ e+) (20). Then 


+ iy) = + — {1 + y —isin y)}, 
so that 
Ru(t) —4F{1 + cosy} 20 for 0. 
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p(t) is, therefore, a restricted moment function. We have now 
M(t) = (1—8)/(1— a) —2(a—8B)/(1—a@) e#/{1—a + (1+ a)et}. 
As such 


M’(t) =— 2(a— B)e'/[(1— a) + (14 
=—2(a—B) -{(1—a)e*— (1+ a)e**}/[(1—a) + (1+ a)et]®. 
(t) =— 2(«a#— B) 

{(1—a)®et 4 (1 + + (1 + 
Hence 
M’'(0) = — — B), = a(a— B)/2, M’"(0) = (a — B) (1 — 3a”) /4. 


Therefore F'(0) = (a— B)*?(a?—1)/8 <0, as O< a<1. Since is 
continuous at ¢t = 0, it follows that F(t) < 0 in a neighborhood (0, «) on the 
right of the origin. Consequently, by Lemma 5, M(t) cannot be totally 
monotone for t= 0; and, therefore, Ug n.t.s. Ua. 


PRESIDENCY COLLEGE, CALCUTTA. 
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AN EMBEDDING THEOREM FOR SEMIGROUPS WITH 
CANCELLATION.* * 


By Trevor Evans. 


Introduction. The main part of this paper is concerned with the 
following question. Is there a finite integer n such that any countable 
cancellation semigroup can be embedded in a cancellation semigroup generated 
by n elements? ? From other theorems of this type (see [1], [2], [4], [6]) 
one is tempted to conjecture that n = 2. However, this is not the case and 
we give an example of a countable cancellation semigroup which cannot be 
embedded in a two-generator cancellation semigroup. We show, furthermore, 
that there cannot be, for any n, any embedding theorem for arbitrary cancella- 
tion semigroups of the same type as those obtained for groups, semigroups, 
loops, etc. in the papers quoted above. The difficulties arise from the existence 
of subgroups in the cancellation semigroup to be embedded and we are able to 
prove the following theorem. Any countable cancellation semigroup not 
containing sub-semigroups which are groups (other than the trivial group, 
possibly, if the cancellation semigroup has a unit) can be embedded in a two- 
generator cancellation semigroup. 

As an application of the embedding theorem described above we show the 
existence of a finitely related two-generator cancellation semigroup which has 
an unsolvable word problem. 


1, Preliminary ideas. A semigroup consists of a set of elements closed 
with respect to an associative multiplication. A cancellation semigroup is a 
semigroup in which ry = «xz or yx = zx implies We need later the 
following definitions and simple lemmas, the first of which we state without 
proof. 


Lemma 1(i). Let u, v be two elements in a cancellation semigroup such 
that ww =v (or vu=v). Then u is a two-sided unit for all elements in the 
semigroup. 


* Received May 8, 1953. 

* This work was done under contract with the Office of Naval Research. 

* This problem was brought to my attention by Mr. J. Addison who remarked on the 
possible use of such an embedding in the study of certain decision problems for can- 
cellation semigroups. 
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(ii) Let S be a cancellation semigroup with unit e and let G be the set 
of all elements g in S such that gh =e for some hin 8S. Then G is the unique 
maximal subgroup contained in S and the set of elements S — G 1s an ideal 


of 8. 


(iii) Let S and G be as in (ii). If the product u,u2- - - ux of a finite 
set Uy, Us,* * +, Ux of elements of S lies in G, then each u; 1s in G. 


We now turn to semigroups and cancellation semigroups given by genera- 
tors and relations. Consider all finite strings, or words, in the set of generators 
91> 92, A set of equations pj = ai, 1, 2,3,- - between certain pairs 
of words is given and called the fundamental or defining relations. By a 
relation «= between a pair of words 2, 8 in the generators, we mean the 
following. 


(1.1) «=a isa relation, where « is any word in the generators, 
(1.2) if a= 8 is a defining relation, then a= is a relation, 
(1.3) if «= 8 is a relation, then 8 —« is a relation, 

(1.4) if a—£f and B —y are relations, then a = y is a relation, 
(1.5) if a= isa relation, then giz — 9,8 and ag; = Bg; are relations, 
(1.6) if ga—giP or «gi = Bg; is a relation, then « = B is a relation. 


Two words a, 8 are defined to be equivalent if, a= is a relation. The 
resulting equivalence classes {«} of words form a semigroup under the multi- 
plication {a@}{8} — {a} and this semigroup satisfies the cancellation laws. 
It is called the cancellation semigroup generated by 91, 92, 9z,°** with defining 
relations pj = oi, 1 = 1, 2,3,°--. 

In the above, if (1. 6) is omitted, we obtain the definition of the semigroup 
generated by 92,93, With defining relations pj — oi, 1, 2, 3,° °°. 

We now wish to introduce the notion of an elementary transformation of 
a word. Let S be a semigroup generated by 9:1, 92,93,° with defining 
relations pj oi, i= 1,2,3,---. Let be two words in such that 
&% =Apim, B =o * where dA, » are void or words in 8. Then we say that 8 
can be obtained from @ by an elementary transformation and we shall some- 
times write this as « > 8 by pi oi. Similarly, we say that a can be obtained 


®t is immaterial whether or not we include the empty word 1 although, of course, 
if it is included, then the semigroup necessarily has a unit. 
‘The relation (=) is used to denote identity between words. 


fl 
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from 6 by an elementary transformation and write Ba by opi. By 
(1.5), isa relation in S. We have, however, a stronger result than this. 


LemMMA 2. Two words «a, in a semigroup given by generators and 
relations, are equivalent if, and only if, there ts a finite sequence of words a, a1, 
M%,° * *5%;B in S such that each word in the sequence 1s obtained from the 
word immediately preceding it by an elementary transformation. 


Proof. For the proof that, if a, 8 are equivalent, then such a sequence 
exists, we refer to [5], Lemma 5. Assume then that such a sequence exists. 
By the preceding paragraph, = —2,° +, %, are relations in 
and hence, by (1. 4), «= 8 is a relation in S. That is, a and B are equivalent 
in 8. 


2. Two counter-examples. We give here two examples which illustrate 
the difficulties in obtaining an embedding theorem for arbitrary cancellation 
semigroups. These examples show that the embedding theorem obtained in 
the next section is, in a sense, the best possible. 

Let K be the cancellation semigroup generated by 9:1, 92,° * *;910 With 
defining relations. 


(2.1) 919s = 9395 = 9197 = 929s; 90910 = 1. 


We note two properties of K. It cannot be embedded in a group since the 
subsemigroup H generated by 9;,92,° is the cancellation semigroup 
constructed by Malcev [3] which cannot be embedded in a group. Further- 
more, gy generates an infinite cyclic group. 

Let us assume that a cancellation semigroup C exists, generated by two 
elements a,b, such that C contains a subsemigroup K’ isomorphic to K. 
Since K’ has a unit, by Lemma 1 this is a unit for all elements of C. There 
will be no ambiguity if we denote this unit by 1 also. Let u,v be the 
elements of K’ which correspond to go, gio respectively in the isomorphism 
between K’ and K. Since go, gio = 1 in K, uv =1 in C and so u and v are 
contained in a subgroup G of C. Now u,v as elements of C are generated by 
aand 6. There are three cases to consider. 


(i) One, say wu, can be generated by a alone, and the other can be 
generated by b alone. Then u—a”", vb". Since u,v are in G, Lemma 
l(iii) implies that a and b are in G. Hence C is a group. 


(ii) One, say u, can be written as a product involving both a and 8. 
Again, by Lemma 1 (iii) this implies that a and } are in G and hence that C 


is a group. 
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(iii) Both wu and v can be generated by one of the generators of F, say a, 
Then u—a”", v=a". Since uv =—1, we have in C. Hence ymn 


g(mtn)m 1, 


In cases (i) and (ii) we obtain a contradiction since K cannot be 
embedded in a group. In case (ii) we obtain a contradiction since gy”*" 41 
in K. Hence K cannot be embedded in a cancellation semigroup generated 
by two elements. 

We may now ask whether there is, for some n greater than 2, an embedding 
theorem for arbitrary cancellation semigroups similar to the theorems obtained 
for other systems in [1], [2], [4], [6]. Assume that such a theorem holds. 
It will be as follows. 

Let S be a cancellation semigroup generated by 91, g2, gs, °° > With defining 
relations pi(91, 92; 9s,° *) 92, 1,2,3,-- +. Then there 
is an infinite set of words 02, 03,° iM @,, * *,@, such that in the 
cancellation semigroup generated by a, With defining relations 
pi(91, +) (61, 03° +), 1, 2,3,- - the subsemigroup gen- 
erated by 6, 02, is isomorphic to S. 

We note in the above that since 6;, 62,63, : : is an infinite set of words 
in a finite set of symbols a, a2, d3,° * *,@», there is an infinite subset 641), 
6i(2), = Of the 6; and a finite subset aj;1), * Of the a; such 
that each 6 in 6;;1), is a Word containing all symbols in the set 
* * 5 and no other aj. 

We now attempt to apply the above embedding theorem to a cancellation 
semigroup consisting of the generators and relations of K in the previous 
example and an infinite number of free generators. If in this embedding the 


generators 91, *,910 of K correspond to 6; in the set 6i(1), 
then @j,1), @jc2),* * * » @jcx) Will all have inverses in F and so K will be embedded 
in a group. 


From this contradiction, it follows that the embedding theorem stated 
above does not hold and that if we are to formulate an embedding theorem 


for arbitrary cancellation semigroup S, we must be able to distinguish between 
those generators of S which have inverses and those which do not. However, > 
in an arbitrary cancellation semigroup given by generators and relations, there 


is probably no general method of deciding whether an element in the semi- 
group has an inverse. Hence there will be no general method of embedding 


an arbitrary cancellation semigroup in an n-generator cancellation semigroup. § 
This does not mean that we cannot show that any countable cancellation 
semigroup can be embedded in a cancellation semigroup generated by "f 


elements for some n. In fact, we conjecture that n 4. The reasons fo! 
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this are as follows. A cancellation semigroup, by Lemma 1, can be split into 
two disjoint parts, one a group @ and the other, a cancellation semigroup H 
not containing elements with inverses. We can embed G in a group generated 
by two elements of finite order® that is, in a cancellation semigroup A 
generated by two elements, and using the theorem proved in the next section, 
we can embed H in a cancellation semigroup B generated by two elements. 
Further relations must now be added to the elements of A and B corresponding 
to the relations between the elements of G and H in S. If these relations do 
not affect the embedding of G in A and H in B, then S is embedded in a 
cancellation semigroup generated by four elements. This method certainly 
avoids the difficulties of the above examples and it would be interesting to 
know whether it does, in fact, give an embedding of S, especially since the 
explicit construction of the embedding for a given cancellation semigroup 
requires a method for determining which generators of S have inverses and, 
as we have remarked above, such a method probably does not exist. 


3. The embedding theorem. Let S be a cancellation semigroup gen- 
erated by a finite or countably infinite set of generators 9, 92, g3,° °° and 
defined by a finite or countably infinite set of relations 


We shall assume that S does not contain any subgroups. By Lemma 1, this 
is equivalent to assuming that there is no relation in S of the form gia = gj 
or “4g; == g; Where « is a non-empty word in the generators.® 

We construct now a semigroup F generated by two elements a and b. 
Let 6; denote the word biabia*bi, i= 1,2,3,---; and X& the set of words 
generated by 6,, 02, 63,: There is a one-one correspondence between the 
words in § and the words in & obtained by making g; correspond to 6;. If 
is any word in S, a will denote the corresponding word in K. We also use 
the following notational device. If the word « in F is of the form b‘y, then 
bia will denote the word for <i and will denote the word y. 
A similar notation will be used for words ending with a power of b. 

The defining relations of F can now be described. Consider all relations 
a= in § such that « and 8 do not begin with the same generator nor end 


*This result (unpublished), an extension of one of the results in [2], has been 
obtained by B. H. Neumann. 

* Here, and throughout this section, we do not include the empty word as a word 
in 8, 
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with the same generator. We call such relations reduced. These relations 
have one of the following forms: 


(3.2) giyg3 = GAM; GiGi LIAR; Gi, 


Here y and 8 are words in 8 or possibly void. 

Corresponding to every reduced relation «= in S of the form gig; 
= we take b-?a’b-? = as a defining relation of F where 
p=min(t,k), q=min(j, m). 

Corresponding to every reduced relation « = 8 in S of the form giygj = 9; 
we take b-?a’b-4 as a defining relation of F where p= min(1, hk), 
q==min(j,/). Here, of course, is 

Corresponding to every reduced relation = in S of the form = 
we take b-?a’b-? = b-?8’b-” as a defining relation of F where p = min(i, J). 
Here, a’ and #’ are 6; and 6; respectively. 

As an example, suppose 9:929;—=g9sg2 is a relation in S. Then 
= would be a defining relation of F. 

In what follows we will constantly be using special properties of words 
in &. These properties are specified in the next lemma. 


Lemma 3. Let o be a word in & of the form Arp (A, p may be void). 
If the subword x begins with b’-Paba? (for some r,0 Sp Sr), and ends with 
absa*b*-4 (for some s, OS qs), then is a word in K and rb, 


are void or words in K. 
Proof. A word in & has the form 
The lemma follows from this. We leave the details of proof to the reader. 


Lemma 4(i). If a= 8 is a relation in 8, then o =f’ is a relation 
in F, (ii) if =» is a relation in F and if £ is a word in K, then 7 1s also 
a word in K, (iii) if a’, B’ are two words in K such that a = B’ is a relation 
in F, then a= 8 is a relation in 8. 


Proof. (i) Let «= 8 be a relation in 8. This relation can be written 
as Adu = Ap. where ¢, w do not begin with the same generator nor end with 
the same generator and A, » are void or words in 8S. Then ¢ =y is a reduced 
relation in S and, for some p, q, b-°¢’b-4 = b-*y’b~4 is a defining relation of F. 
Since a” == B’ = it follows that is a 
relation in F’. 
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(ii) One sees from Lemma 3 that if a word ¢ in K admits an elemen- 
tary transformation, then £ == where X’, are words in K 
or void and b-°¢’b-4 = b-*y’b~4 is a defining relation of F. 


If £7 is a relation in F', by Lemma 3 there is a finite sequence of 
elementary transformations carrying £ into y. Assume, first of all, that this 
sequence consists of just one transformation. By the preceding paragraph 
and so Hence »=A'y’p’ is in K. 
A simple induction on the number of elementary transformations connecting 
{and » completes the proof. 


(iii) If a’, B’ are connected by one elementary transformation, then, by 
part (ii) of this lemma, =)’¢’p’, B’ where 2’, p’ are in K and 
¢=yisarelation in S. Then «=Adu, B =A and so a = B is a relation 
in S. In general, we have a finite sequence of words @’, @n, 
all in K, such that each word is obtained from the word preceding it by an 
elementary transformation. Then = @, a, +, are relations 


, 


in S and so «= is a relation in 8. 


LemMA 5. The subsemigroup of F generated by 6;, 02, 63,° is 
morphic to 8. 


Proof. By Lemma + two equivalent words in F are either both in & or 
both not in KH. The equivalence classes consisting of word in K& form a 
subsemigroup K of F. There is a one-one correspondence «<> a’ between 
words « in § and words «’ in K. Now, by Lemma 4, two words a, B in S 
are equivalent if, and only if, a and f’ are equivalent in F. Hence, the 
semigroups S and K are isomorphic. 


We have now embedded S in a semigroup generated by two elements. The 
next step is to show that F can be regarded as a cancellation semigroup and 
to do this we need to obtain some properties of the defining relations of F. 

Let » be a word in F and let o>, by & > m, 01 > w2 by £2 > n2 where 
are elementary transformations of We say that these two 
successive elementary transformations of » overlap if part of w, lies both in 
the », of the first transformation and in the ¢ of the second transformation. 
There are four ways in wl.ich this can happen. 


(a) & has the form Ayn. Then 
(3. 3) oS > = > w2 = 


(b) & has the form Az, y; has the form zp. Then 
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(3. 4) wo > = = Kyopt 
(c) has the form Then 

(3. 5) wo Khit = > = 
(d) 1 has the form Az, £ has the form zp. Then 


In all these cases A, w, x, r are void or words in F. The word = is assumed 
non-empty since, otherwise, the two transformations would not overlap. 

The overlap, that is, the subword of »; which is in both and &, is 
m1, 7, C2, in cases (a), (b), (c), (d) respectively. If this overlap y is such 
that, for some p, g, b®yb4 is a word in &, then we say that the transformations 
X-overlap. Since, for some m,n, p, g, 6b", b°€.b9 are in K, the overlaps 
in (a) and (c) are always K-overlaps. In cases (b) and (d) the situation 
is a little more complicated. We will restrict our discussion to (d) since (b) 
can be treated in the same way. 

Since Az is one side of a defining relation it is of the form ¢ab"a? or 
pab*a*bs", where ¢ is such that b”Arb”" is in K. Similarly, zz can be written 
as ab’a*y or 6" %ab'a*y. In the following cases we shall say that the trans- 
formations have a short overlap of type (d). 


(3.7) A\= ¢gab"a, p= dPay, 
(3.8) m=ab?, A= ¢gabia, p=ay, (p=—s—ni in this case), 
(3.9) (iS min(s — n, r— p)). 


An examination of the beginning of zz and the end of Aw shows that it is 
not possible for to be aba, a*b*", bia? or a*. It follows 
that, except for the short overlappings listed above, + necessarily must begin 
with ab?a* or b’-?ab"a® and end with ab"a? or ab*ab*-". In this case, since 
b™ Arb", K, by Lemma 3, b’rb"e K. Hence, except for the short 
overlaps, all overlaps of type (d) are K-overlaps. 

Similarly, except for three short overlaps, all overlaps of type (b) are 
K -overlaps. 


LemMMA 6. Let by 6-7, respectively, the two 
transformation having a K-overlap. Then there is an elementary transforma- 
tion £3 > nz of F such that w—> we by £3 > 93. 


Proof. The method of proof will be sufficiently illustrated if we consider 
overlaps of type (d) only. 
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For some m, n, p, g, we have 
(3. 10) b"C,b" = @’,, = Bs, = a's, = 


where = £1, are relations in S. Thus both the words b”Azb", 
b’rpb’ are in K. As we have noted above b?rb” is in K since the overlap is 
a K-overlap. Considering b?xb" as a subword of b”Arb", by Lemma 3, the 
word 6”Ab-? is either a word in & or possibly void. Similarly b-"pb¢ is 
either a word in & or void. 

Hence, in the relations @b-"ub? = b™AD-Pa’, = both 
sides are words in K. These two relations can be written as 


We obtain the new relation b”{,yb4 = b™An2b% and since both sides of this 
relation are in K, by Lemma 4, there is a relation a; = 8; in S such that 
a’, = B’s = Now @, a, begin with the same generator, and 
Bi, Bs begin with the same generator. Hence a3, 8; do not begin with the 
same generator. Similarly, a3, 8; do not end with the same generator. It 
follows that = or is a defining relation of F, 

The remaining cases are proved in the same way, i.e. by considering 
those relations in S which correspond to the overlapping transformations in F’. 


Lemma 7%. Let aa and aB be two equivalent words in F. Then « and B 
are equivalent in F. 


Proof. The proof is by induction on the number of elementary trans- 
formations connecting ax and a8. Consider first of all the case where az —> aB 
by £-—>y. As a defining relation of F one side of £7 begins with a and 
the other side with a power of b. Hence £— y cannot affect the beginning of 
az since both aw and @@ begin with a. The transformation must, therefore, 
occur inside a, carrying « into B. That is, « and B are equivalent. 


Assume now that the lemma is true whenever ae can be transformed into 
a8 by n or fewer elementary transformations and consider the case where aa 
and a8 are connected by n + 1 transformations. We will denote this by 


(3. 12) at > 0, P02 >on, > a8 


the suecesive transformations being m, 2 > We can 
dispose of three cases immediately. 


(1) Suppose that, for some 1, o; begins with a, that is o; is of the form 
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Hence « and £ are equivalent. 


(ii) Suppose that, for some 1, the transformations { > yi, fis > ni, 
have a K-overlap. By Lemma 6 these two transformations can be replaced 
by one. Hence az and af can be connected by n elementary transformations 


and so a and £ are equivalent. 


(iii) Suppose that aw can be written as xr and that the first 7 trans- 
formations all occur in x; that is, 7 is not affected by any of the transforma- 
tions 2. Let the transformation {j,; > occur in +. 
Using an obvious notation, we can write the first 1+ 1 transformations as 


But then the sequence of transformations 

will also transform a into ;,,, and so the sequence of n + 1 transformations 


transforms aa into a8. Now in this last sequence, the second term xr, begins 
with a since az= «xr. Hence, by (i), and are equivalent. 

We shall assume from this point that in the sequence (3.12) the three 
situations described above do not occur. Then £,—> 7 must occur at the left 
end of aa and £,—>7, f2—> 72 have a short overlap of type (d). The next 
transformation £; >; must then either have a short overlap of type (b) or 
(d) with £,— , or £, must lie to the left of y2 in oe. 

Similarly, if £, > 2, £;—> 73 do have a short overlap of type (d), then 
either £;—> 3, £,—> 7; have a short overlap of type (b) or (d), or ¢, must 
lie to the left of 7; in w;. In general, if each pair of successive transformations 
nw > 1, 2, +, have a short overlap of type (d), then 
List > either have a short overlap of type (b) or (d), o 
must lie to the left of IN 


ay. Then, by the inductive hypothesis, both a@ and 8 are equivalent to y, 


Let r be the number such that each pair of successive transformations in 
has a short overlap of type (d) but 
£41 > nrei do not have a short overlap of type (d). We have seen above that 
= 2. Since o, begins with b, the elementary transformation taking into 
af, cannot have a short overlap of type (d) with the preceding transformation. 


Hence, r= n. 
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We can therefore write the first 7 transformations as 


(3. 16) C1 Aim, Avtre, > 


where the short overlaps 7, 72,° * +, mr, are as listed in (3. 7), (3.8), (3.9). 
Then 
(3. 17) ag = Ci pipe 


and the transformation occurs in * 

We consider now in what way ¢,,, can be a subword of A,Az° + * Apaitr 
with the restriction that it cannot have a K-overlap with 7. An examination 
of the form of A; in (3.7), (3.8), (3.9) coupled with the fact that, for some 
j,k, b/G,4,b* is a word in &, shows that only two cases are possible. 


(1) is a subword of some 1S tSr—1. Let go into A; 
under the transformation {,.: > 41. Then it is easily verified that the 
sequence of 7 elementary transformations (3.16) with A; replaced by ); trans- 
forms az into ,,,. Hence ae can be transformed into a8 by n elementary 


transformations and the lemma is proved. 


(2) Here we consider those cases in which £,,, does not lie entirely in 
one of the A;. If %,,,; contains subwords of more than one );, it must contain 
the right end of some A; and the left end of (or for 1 St 
We use now the fact that if one side of a defining relation of F begins with a, 
then the other side begins with a power of b. In (3.7), if A; ends with ab"a, 
then z; is a and hence Ay, (or 7m) begins with a power of b. In this case 
{1 cannot overlap both A; and Ay, (or n,). In (3.8), if Ay ends in abéa, 
then +; is ab? and hence A;,, (or nr) begins with a power of b. In this case 
{41 cannot overlap both A; and (or In (3.9), if Ay ends in 
then 7; is b¢ and hence A;,, (or y-) begins with a. In this case £,,, can overlap 
A: by a power of b, the rest of £,,, being contained in d;,, (we can disregard y, 
here since we have assumed that £,.: — ys: does not have a K-overlap with 
nr) 

in this final case, all of £,,, except a power of b at the left end lies in 
Hence the transformations and have a K- 
overlap. Now 


(3. 18) =AirA2° * AtAte * pr—1X 


Where the transformations £;,2—> mr all occur in the subword 
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Since Occurs in the subword it is easily 
seen that the following sequences of r+ 1 elementary transformations 4 


transforms aa into Replacing {141 rer by a single elemen- 
tary transformation, as we may do since these two transformations have a 
X-overlap, we can transform az into ,,, by r elementary transformations and th 
hence a« into a8 by n elementary transformations. Then, by our inductive 
hypothesis, ¢ and @ are equivalent. This completes the proof of the lemma. 

Lemma 7 requires only trivial changes in order to prove that if ba and 
bB are equivalent words in F, then a and £ are equivalent. Also, if in the 
above, we consider short overlaps of type (b), the same method of proof shows 


that if aa and Ba (ab and Bb) are equivalent words in F, then a and B are — ™ 
equivalent. 
Hence if ga —gB or ag = Bg (g is a generator, a or D) is a relation if 
in F, then a = B is a relation in F. It follows that two words are equivalent Tr 
in F if and only if they are equivalent in the cancellation semigroup 0 > ™ 
having the same generators and defining relations as F. Thus, the identity Th 
mapping is an isomorphism between F and C. We have now embedded the the 
cancellation semigroup S in a cancellation semigroup C generated by two 
elements. However, the defining relations of C are not in a very useful form. | ng, 
This is remedied by the next lemma. ger 


Lemma 8. The cancellation semigroup C (and hence the semigroup F) 
is isomorphic to the cancellation semigroup D generated by a, b with defining | pro 


relations gen 
(3. 20) pi( 62, 03, ° = 0i(4:, 02, 6s, i= 6,, 
in 


Proof. It is sufficient to show that C satisfies the defining relations of 
D and that D satisfies the defining relations of C. Now pi(9:, 92; 9s," °°) s 
= 0i(91, 92, 9s,’ **) is a relation in S and so, by Lemma 4(i), p;(1, 62, 43, 


= i (6;, 02, - -) is a relation in F, therefore a relation in C. Hence 0 
satisfies the defining relations of D. Furthermore, a defining relation of C 
has the form b-?a’b-4 = b-?8’b-4 where « = 8 is a relation in S. It is easy ii 
to show that if « = @ is a relation in S then a’ = f’ is a relation in D.’ On f ero 
cancelling powers of b it follows that b-?a’b-? = b-°8’b-4 is a relation in D. oy 
Hence D satisfies the defining relations of C. bal 
We are now in a position to state the main theorem. ‘ 
ey 


7 Using the correspondence g,<> 6, we can copy in D the steps leading to the f 
relation a = £ in 8. 
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TurorEM I. Let S be a cancellation semigroup given by generators 
gr; * * and relations 


pPi( Gis Jes Jas = 01(91, Jo, Ja; i= 1, 2, 3,° 


and such that S does not contain a group as a subsemigroup. Then 8 1s 
isomorphic to the subsemigroup generated by b‘ab‘a*b‘, 2,3,---, of 
the cancellation semigroup generated by a, b with defining relations 


pi(baba*b, b?ab?a*b?, b'ab*a*b®, - - -) =o; (baba*b, b?ab?a*b’, -), 


In the above work the empty word was excluded from consideration as a 
word in a semigroup or cancellation semigroup given by generators and 
relations. However, the proof of Theorem I goes through unchanged even 
if the empty words is included among the words of the semigroups considered. 
This will then be a unit element for S and the restriction, that for no non- 
empty « is gia —=g; or ag; — gi a relation, can be replaced by the following. 
There is no relation of the form « —1 in S, where « is a non-empty word in 
the generators of S. 


CoroLtiaRy. A countable cancellation semigroup with a unt but without 
non-trivial subgroups can be embedded in a cancellation semigroup with a unit 
generated by two elements. 


As in [1], the above embedding theorem can be interpreted in terms of a 
property of certain subsemigroups of the free cancellation semigroup on two 
generators. In our construction we have used the subsemigroup generated by 
§,, 02,03, - -. We may ask which other semigroups can play the role of K 
in the proof of the embedding theorem. It is easy to obtain sets of generators 
to replace the 6; which have a more complicated form than the 6; as words in 
a,b; one such is bia‘bia'b', i = 1, 2, 3,- - - but it seems likely that no simpler 
set of words can replace 62, 63,° 


4, The word problem for cancellation semigroups.* A. M. Turing [5] 
has given an example of a finitely generated and related cancellation semi- 


| group with an unsolvable word problem. Using the embedding of Theorem I, 
|} wecan prove the following theorem. 


*I am indebted to Dr. W. W. Boone for several illuminating discussions concerning 


| the word problem for cancellation semigroups. The proof given that there is no relation 
| *=1 (a a non-empty word) in Turing’s semigroup is essentially due to Dr. Boone. 
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TueEoreM II. There exists a finitely related cancellation semigroup 
generated by two elements for which the word problem is unsolvable. 


Proof. It is only necessary to show that the cancellation semigroup 
constructed by Turing in [5] satisfies the condition of Theorem I necessary 
for it to be embedded in a two-generator cancellation semigroup. We shall 
use the terminology of [5] but adapt the notation to conform with the rest 
of this paper. 


Let S be the semigroup given by the generators and defining relations 
on pp. 495-496 in [5] and let C be the cancellation semigroup with the same 
generators and defining relations. We first show that two words in the 
generators are equivalent in S if, and only if, they are equivalent in C. 
Turing has stated the equivalent of this in [5], p. 497, line 12 ff. but only 
proves that two normal words are equivalent in S if, and only if, they are 
equivalent in C ([5], Lemmas 17 and 19). 

Any word «@ in the generators can be written as Agigou where gig is a 
barrier and Ag, contains no barrier. Here A and » may be empty. Using 
Lemmas 5 to 11 in [5], in particular Lemma 11, if B is equivalent to « in § 
(i. e. B can be obtained by a sequence of elementary transformations from 2), 
then 8 can be written in the form «g39,7 where g3g4 is a barrier, Ag, is equi- 
valent to «gs in S, and gop is equivalent to gsr in S. Furthermore «xg; contains 
no barrier. Hence the transformation of « into B can be effected by first 
transforming Ag, into «gz, then transforming gou into gyr.. 

Assume now that 2 is of the form gy and £ is of the form gé where g 
is a generator. Then Ag,, xg; can be written as gp, go respectively. As words 
without barriers, gp and go are either both normal or both consist entirely of 
left (right) symbols. If gp, go are normal, by [5], Lemma 17, p and o are 
equivalent in S. If gp, go consist entirely of left (right) symbols, by [5], 
Lemma 18, the words gp and go are identical and so, again, p and o are 
equivalent in 

It follows that there is a sequence of elementary transformations taking 
y into 8; i.e. y and 8 are equivalent in §. In the same way we can show that 
if two words yg, 8g are equivalent in S, then y and 8 are equivalent in 8. 
Thus two words are equivalent in § if, and only if, they are equivalent in @. 


It is now easy to show that no relation of the form «—1 (or ga=g f 


if we are not allowing the empty word) holds in C, where @ is a non-empty 
word in the generators. For «—1 (or ga—g) in C if, and only if, there 
is a sequence of elementary transformations of S carrying « into the empty 
word (or ga into g). Now, in the defining relations of S, no side of a relation 


[1] 


[2] 


[3] 


[4] 


[6] 
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has length less than two in the generators. Hence, neither the empty word 
nor a word consisting of a single generator can be affected by an elementary 
transformation of 8 and so neither can be equivalent to any word other than 
itself. 

The cancellation semigroup C therefore satisfies the condition necessary 
for Theorem I to apply and we can embed C in a cancellation semigroup 
generated by two elements. This two-generator cancellation semigroup has 
the same (finite) number of defining relations as C and obviously its word 
problem is unsolvable. 


THE INSTITUTE FOR ADVANCED STUDY. 
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THE UNITARY EQUIVALENCE OF BINORMAL OPERATORS.* ! 


By ARLEN Brown. 


1. Introduction. The principal object of this paper is to give a set of 


unitary invariants for a certain class of operators (called binormal) so chosen { 


as to be a generalization in operator form of the 2 X 2 complex valued matrix 


in a way parallel to the way in which a normal operator is a generalization [| 


of a complex number. A binormal operator consists essentially of a 2 Xx 2 
matrix of mutually commuting normal operators acting in the usual fashion 
on a Hilbert space of the form # © #. But while this characterization of 
the object of investigation has the advantage of immediacy, the more con- 
venient definition for the purposes of this paper, and the one given below, 
is in terms of the *-ring it generates. The exact relation between these two 


aspects of a binormal operator is the first problem to which we address our | 
attention, and the desired formulation is stated in Theorems 1 and 2. The [| 
central result of the paper may be paraphrased as follows: We attach toa | 
binormal operator four commuting normal operators (fixed polynomial func- | 
tions of the operator and its adjoint) and show that except for comparatively | 
uninteresting normal direct summands which each posseses, two binormal 


operators are unitarily equivalent if and only if these four functions of each 


are simultaneously unitarily equivalent in pairs (Theorem 5). In the course [| 
of proving this it is necessary to obtain considerable information about the | 


structure of a binormal operator, and in particular a sort of standard form 
is given for any such operator ($9). Also we show that Dixmier’s solution 


of the “ two projection problem ” [2] is subsumed under the theory of binormal 


operators in a natural and satisfactory manner. 


Concepts or terms introduced below without any definition are to be taken F 
as defined in such standard works as [11] or [15]. All the terminology will f 
be found to be in accord with the more recent work [4]. To mention a few 


relevant points in advance however: Hilbert space always means a Hilbert 


* Received Nov. 1953. 
1 This paper is essentially a part of the thesis submitted in partial fulfillment of 


the requirements for the Ph. D. degree at The University of Chicago. The research for 


the paper was carried out while the author was an Atomic Energy Commission Pre 
Doctoral Fellow. 
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space admitting complex numbers as scalars. It is not assumed to be infinite 
dimensional. The term operator is used exclusively to signify a bounded linear 
transformation of a Hilbert space into itself. The notions of a ring (of 
operators), of the equivalence of projections, of factors and types etc., are 
those introduced by Murray and von Neumann in [10]. Rings are not assumed 
to contain the identity operator (but every ring does automatically have a 
unit, namely its maximal central projection). 

The familiar notation Diag(a, y,z,---) (a, y,2,° any symbols) is 
employed to signify the diagonal matrix with entries z, y,z,- - - from left to 
right down the main diagonal. Moreover, for the sake of brevity in repeated 
references to triangular 2 X 2 matrices we use the notation Tri(z, y,z) to 


stand for 
0 


The author wishes to express his sincere gratitude to Professor Irving 
Kaplansky for his helpful suggestions during the preparation of this paper. 
Thanks are particularly due for many conversations about the papers on 
AW*-algebras [6], [8] which are the basis of this study. 


2. Binormal Rings. We begin by defining the class of operators to be 
investigated in terms of the rings containing them. A ring R will be called 
binormal if it satisfies the identity 


(*) D> + = 0 


where XY;, 1 1,- - -,4 are arbitrary elements of R, the summation is taken 
over all permutations of (1, 2, 3,4) and the sign is the parity of the permu- 
tation. A single operator is binormal if the ring it generates (the smallest 
ring containing it) is binormal. This identity is so chosen that the full ring 
of n X n matrices over a commutative ring with unit satisfies it if and only 
ifn < 2 [1], and it would not be very difficult to show directly that the only 
binormal factors are 7, and J, thus making it possible to use reduction theory 
to study the relation between binormal rings and matrix rings, and indeed 
the next two sections will make precise the exact sense in which a binormal 
ring is a 2 X 2 matrix ring. It has seemed preferable, however, to develop 
the necessary theory “in the large” independently of the use of reduction 
theory, and the following analysis is based upon some results of Kaplansky [8]. 
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3. The imposition of (*) on a ring of operators restricts the equivalence 
of projections in the ring. Thus we have at once: 


Lemma 3.1. If the ring of operators R is binormal then it does not 
contain three mutually equivalent, orthogonal, nonzero projections. 


Proof. If it did one could construct at once in standard fashion an 
abstract system of 3 X 3 matrix units, which obviously could not satisfy (*). 


Let us follow the terminology of [8] in calling a projection EH ~0 ina 
ring R abelian if the subring ZRF is abelian. The following lemma is an 
immediate consequence of Lemma 3.1 and [8], Lemma 4. 12. 


Lemma 3.2. If R is a nontrivial binormal ring then R contains an 
abelian projection. In fact, if EH 0 is any projection in the center of R 
then R contains an abelian projection F= E. (In terminology of [8] then, 
R is of type I.) 


We call a projection G in a ring R homogeneous (of order two) if 
G=—=G,+ G, where G, and G, are orthogonal and equivalent abelian pro- 
jections in R. (There is no need here to define homogeneity of higher orders 
as we shall be concerned exclusively with binormal rings.) A binormal ring 
will be called homogeneous if the identity operator 1 is homogeneous in it. 
The following lemma is a direct consequence of the definition. 


SS 


LeMMA 3.3. The supremum of an orthogonal family ‘of central abelian 
[homogeneous] projections is itself central abelian [homogeneous]. 


THEOREM 1. Let & be a Hilbert space and R a binormal ring on H. 
Then there are subspaces I and N, orthogonal complements of one another, ir 
and rings R, and R, on In and N respectively, such that R, is abelian, n 
R, is binormal and homogeneous, and such that R=R, @ R, (1. e., such that 
R consists of all operators on H = IM @N of the form A @®B with AcR, 


and BeR,). 
et 

Proof. By Zorn’s Lemma let {£;} be an orthogonal system of nonzero 
central projections in R which is maximal with respect to the property that — 7 
each F; is either abelian or homogeneous. It results in standard fashion from E, 
an application of Lemmas 3.1 and 3.2 along with [8], Lemma 4.8 and the J ry 
usual argument from maximality that 7 = sup; is the unit of R. We now §— im 
simply split {#;} into two parts, denoting by P the supremum of the sub- § fh, 
collection of the H,’s which are abelian and by Q the supremum of the homo- § Dj, 


geneous Fs. Then FE =P-+ Q, P is central abelian and Q is central and 


= 


an 
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homogeneous. We set 71 = Q(#) and M=—HO MN and define R, and R, 
by restricting the operators of R to In and TN respectively. The desired 
relations follow at once. 


In view of this theorem we may limit our considerations to binormal 
rings which are homogeneous. Accordingly, let R be a binormal ring on a 
Hilbert space # and let where the projections (i =1, 2) 
are orthogonal, equivalent and abelian in R. Set £i(#) =; Since the 
Bs are equivalent there is a partial isometry Ve R with V*V =F, and 
VV* = E., and hence V IN, is an isomorphism between the two Hilbert 
spaces NM, and N.. If we let K denote the Hilbert space M, © N, and define 
i(t,y) =x+ Vy for (x,y) ¢ K, it is easy to see that 7 is an isomorphism 
of K onto 9, and hence that the set R of all operators 7-1A7 for Ac R is a 
unitarily isomorphic copy of R on &. Thus we represent R as a ring of 
2X2 matrices in the usual fashion. (Quite precisely, we identify the 
operator C on & with the matrix (Wi;) according to the formula C(z, y) 
= + Wirey, Weix + Weoy) where, of course, the Wj;’s are operators 
on N,.) 

It is easy to settle the question of exactly what matrices correspond to R 
in this scheme. In the first place, the two projections H, and F,, which we 
now prefer to call H,, and F.., along with V and V*, which we denote by 
F,, and EF, respectively, give us a set of matrix units in R with Fy, + Fo.=—1 
and, furthermore, these four operators are carried by j onto the unit matrices 


(00) 0) G0) 5) 


in natural order. Let us state the following theorem, making use of the 
notation already introduced. 


THEOREM 2. Let R be a homogeneous binormal ring on & with the 
matrix units 1,7 = 1,2 where and Ex. are orthogonal abelian pro- 
jections with E,, + Hs.—=1. Then R is represented via j on K as the 
ring R of exactly those matrices (Wi;) whose entries lie in the abelian ring 
Z’ on Ni, where Z’ consists of all restrictions to N, of the operators in 
E,RE,,. A ring is homogeneous and binormal if and only if it can be so 
represented for some abelian ring containing 1. Under this isomorphism the 
matric units Ei; correspond in natural order to the unit matrices (**) and 
the center is carried onto the 2-fold copy of Z’ consisting of all matrices 
Diag(W, W) with We Z’. 


Proof. In the first place, it is a well known fact of matrix theory (see 
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e.g. [5], pp. 57-58) that R consists of all elements of the form %,4;;F,; 
where A;;¢ D and D denotes the collection of elements of R commuting with 
all £,;. This D is a ring between R and its center Z and an operator in it is 
carried by j onto some matrix (Wi;) which commutes with all four unit 
matrices. Computation shows that W,.— W.,—0 and Wi, = W2.2 so that 
the elements of D correspond under j to matrices Diag(W,W) in R. But 
then also it is clear from the definition that WeJZ’ and hence that D is 
abelian, so that D — Z is the center of R, and, as is readily seen, the theorem 


follows. 


Corottary A. Ina homogeneous binormal ring, the only central abelian 


projection is 0. 


Proof. A central projection must have the form Diag(H,#) where F 
is a projection. If this element is also abelian the matrix Tri(0, #,0) must 
commute with its own transpose, whence / = 0. 


This last result enables us to clear up a point which was left unsettled 
in Theorem 1. Reverting for the time being to the notation of that theorem 
we recall that the unit EF of the arbitrary binormal ring R is written 
E=P+Q with N =Q(&) so that the result, R., of restricting the 
elements of R to N is homogeneous and binormal. Now if F is any central 
abelian projection in R and if F,—F| then F,., being again central and 
abelian in R., must be trivial, or FQ —0 which implies F= P. Since P is 
itself central abelian by construction we see that P is identified as the maximal 
central abelian projection in R. 


Corottary B. The decomposition of Theorem 1 is unique. The pro- 
jection on the subspace In is the sum of the annthilating projection 1—E 
and the maximal central abelian projection P. 


4. If SM, and &, are Hilbert space and if ¢:%,—> MH, is an iso- 
morphism then we write ¢°(A) —¢*A¢ where A is an operator on #;, 
introducing the notation ¢° for the unitary isomorphism implemented by ¢. 
We proceed to investigate the circumstances under which two binormal rings 
are unitarily equivalent, and in view of the last result we shall assume that 
they are both homogeneous. 


TuEoreM 3. Let R, and R, be homogeneous binormal rings on Hilbert 
spaces 8, and H, respectively. Let Z, and Z, be their respective centers. 


Then R, and R, are unitarily equivalent if and only if Z, and Z, are, and§ 
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further, if $° is any unitary isomorphism of Z, onto Z,, there is another 
unitary isomorphism y° of R, onto R, which agrees with $° on the ring Zp. 


Proof. We limit our attention to the last assertion. By Theorem 2 we 
may assume that (i—1,2) #;— I: @ M; and that the operators in the 
ring R; are those matrices whose entries lie in the abelian ring Z’; on the 
space %;. The centers, Z;, consist then of all operators A © A with A eZ’. 
It is sufficient to obtain a new isomorphism y: &#,—M, such that 
y°|Z. = $°|Z, and with the further property that y can be factored, y =o © a, 
where w: 9,— is an isomorphism with w°(Z’,) =2Z’;. Moreover, if 
each Z’; (and hence each Z;) is*resolved into its pieces of uniform multiplicity 
and account is taken of the given unitary isomorphism, it follows that it 
suffices to prove the theorem under the assumption that the rings Z’; have 
uniform multiplicity n (where n is a not necessarily finite cardinal number) 


and we accordingly assume this to be the case.” 


Now (t=1,2) Z’; is the n-fold copy of a maximal abelian ring M; on 
some Hilbert space ; (for the sake of simplicity we ignore the distinction 
between internal and external direct sum) and the given isomorphism ¢ 
furnishes us with an algebraic *-isomorphism y°: M,— M, in a natural way. 
But these are maximal abelian rings and a *-isomorphism between two such 
rings is in fact spatially implemented.* Hence there exists an isomorphism 
x: H:— K. which justifies our use of the notation y°. The desired iso- 
morphisms w and y are obtained as n-fold and 2n-fold copies respectively of x. 


5. The most general homogeneous binormal ring is the ring R of 
operators on a space # — In ® M consisting of all matrices with entries in 
Z’, where Z’ is some abelian ring on 9 containing 1. Concerning the ring Z’ 
it is well known that there is a compact Hausdorff space X such that Z’ is 
isomorphic to the algebra C(X) of all complex valued continuous functions 
on X under a map which preserves not only the ring operations in Z’ con- 
sidered as a *-ring but the norms of elements as well. This is no more than 


the usual representation theorem for abelian C*-algebras with unit, and 


*It may be noted that from this point the proof goes through even if the *-iso- 
morphism ¢° is not spatially implemented. Thus the assumption that the isomorphism 
is unitary serves essentially to align the various parts of the rings Z, having uniform 
multiplicity. It is not difficult to see that the condition of uniform multiplicity here 
“fay is equivalent to the homogeneity of the commutants in an appropriate sense. 

ee [6]. 

*A proof of this important fact has been given by Segal in [13], Part II, Th. 1 

and [14], Th. 4.1. 
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would hold if Z’ were only closed in the uniform topology for operators. The 
fact that Z’ is actually a ring of operators leads to the additional requirement 
that X be totally disconnected (i.e., possess a topological base of compact- 
open sets) and further to the condition that the closure of every open set in 
X be open ([16]). 

We conclude our considerations of binormal rings by extending this 
correspondence to a functional representation for R itself. Consider the 
collection M,(X) of all continuous functions from X to the ring of all 2 x 2 
matrices. (The latter ring is, for this purpose, considered as the ring of all 
operators on some 2-dimensional Hilbert space. The topology that it so acquires 
is obviously the same as that commonly given it by means of coordinates.) 
We can give M.(X) the structure of a normed algebra in the usual way 
by defining algebraic operations pointwise and writing || A || —sup, || A()| 
for Ae M,(X). It is a simple matter to verify that this yields a new C*- 
algebra which we also denote by M.(X). 

Now an element W of R is a matrix with entries, say, Wi; for i, 7 = 1, 2. 
The correspondence between Z’ and C(X) gives us continuous functions w 
on X corresponding respectively to Wi;. We define ®(W) (x) = (wi;(x) ) for 
Then e M.(X) and we have a mapping 6: M.(X). One 
verifies directly that @ is onto M,(X) and preserves all the ring operations 
in R (including *) and, from the uniqueness of the norm of a C*-algebra 
({7], Th. 6.4), we conclude also that ® preserves the norms of elements. 
For convenience we summarize this as: 


THEOREM 4. Let R be the homogeneous binormal ring of operators on 
H=MNOM consisting of matrices with entries in the abelian ring Z 
on IN. Let C(X) be isomorphic to Z’ under a map ¢. Then the map © 
defined by (x) = (¢(Wij) (x) is an isometric *-1somorphism (or in 
other words a C*-somorphism) between R and the ring M2(X). 


6. Binormal operators. Call a subspace & normal with respect to A 
if it reduces A and if A|¥ is a normal operator. The supremum X of all 
subspaces normal with respect to A is also normal with respect to A. This 
maximum normal subspace, together with the normal operator A|X on it, 
we shall call the normal kernel* of A. If K 0 we shall say that A is 
reduced. In terms of these concepts we may characterize the binormal 
operators which generate homogeneous rings. 


“We shall sometimes take the liberty of referring to either the subspace & or the 
operator A |* alone as the normal kernel. The exact meaning will always be clear in 
context. 
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LemMa 6.1. Let A be binormal on & and let K be its normal kernel. 
Let A generate the binormal ring R and let In and N be the subspaces of 
the unique decomposition of Theorem 1. Then K =. 


Proof. In the first place, 9 is normal with respect to A by its definition 
so that 92 C K. To prove the other inclusion, let 7 be any operator com- 
muting with A and A*. Computation shows that X is invariant under T. 
Replacing T by T* we see that K reduces 7. Write F for the projection 
of #% onto K& and E£ as before for the maximal projection in R. Then ([{12], 
Satz 5) FEeR. But FE is also of course in the commutant of R and hence 
is central. Let £—H(&) so that FF projects onto KN ¥&. It is easily 
seen that the restriction of any two operators in R to K N & must commute, 
and hence that FF is abelian. By Theorem 2, Corollary B, this implies that 
KC M and the desired relation K C IM follows. 


In other words then, the binormal operators which generate homogeneous 
rings are those which are reduced. Since the normal kernel is surely pre- 
served under unitary isomorphism, we may accomplish our goal of giving a 
set of unitary invariants for a binormal operator by solving the problem in 
the reduced case. 


7. Before turning to the general investigation it will be instructive to 
look closely at two special kinds of binormal operators which are particularly 
amenable to our approach. 


Lemma 7.1. Let T, and T, be operators on & satisfying 
= 1, 2 


for complex numbers a, and a. Then the algebra of operators generated by 
T, and T, (in the purely algebraic sense) satisfies (*). 


Proof. This is a special statement of the result ([9], Lemma 5). 


In view of this it is easily seen that an operator A is certainly binormal 
if = «A for some complex number a, for then also A** Indeed, 
it is not hard to see that A is binormal if it satisfies a quadratic equation 
aA? + BA +y=0 for a, B and y arbitrary complex numbers, but all the 
possibilities thus presented reduce readily to one or the other of the following 
distinct special cases, which we proceed to treat in order. 


a). A?—0: The assumption that A is reduced leads in this case to 
NAN* —0 where N and N* are the null spaces of A and A* respectively. 


4 
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Let R be the ring generated by such an A. Then (employing the notation 
introduced in Theorem 2) R consists of all 2 X 2 matrices (Z;;) with entries 
in the ring Z’ on 9m. The operator A itself we shall write as the matrix 
(Ai). Using Theorem 4, we obtain an isomorphism between R and a ring 
M.(X) under which A corresponds to an element A(-) whose value at each 
point eX is the matrix A(x) = (aj(r)) with A(z)*=0. (Here and 
throughout, if Z eZ’ we shall denote by z the continuous function in C(%) 
corresponding to it, and if B = (Zj;) is in R, the corresponding element of 
M.(X) will be denoted by B(-) so that B(x) = (ai,;(x)) for xeX.) For 
every x the trace and determinant of A(z) are both 0. Now set 


A,,:* Az* A;:* Aso*\ - 
Since the operators C,C,* = C,*C, and C.C.* = C.*C, are central in R, the 
projections, says F, and F2, on their respective null spaces are also central, 
and correspond to compact-open sets LZ, and LZ. in X. (More exactly, F; 
corresponds to the function in M,(X) whose value is the matrix 1 at every 
point of LZ; and 0 at every point of X —JL;.) The subspaces onto which F, 
and F, project (the null spaces of C, and C, respectively) we shall call &, 
and ¥. Write C; in polar form, C;= |C;|U; for i=1,2. (The absolute 
value of a normal operator is the positive square root of the product of it 
with its own adjoint.) The operator U; is 0 on the subspace &, (which 
reduces it) and operates unitarily on # © ¥&; and so corresponds to a func- 
tion U;(-) ¢ Mz(X) which is 0 on ZL; and unitary valued on X — Jj. 
Now define 
Ui(z) Ci(z) 


These correspond to the operators U = U, + F,U, and C=C,+4+ F,C, in R 
and C =|C|U is the polar resolution of C. The function U(-) is unitary 
valued except on L, Lz. But on this set, Ci(x) =C2z(x) =0 so that 
aij(x) =0 for i,j =1,2 and A(x) =0. This means that F,F,A = 0 and 
that ,M &- is normal with-respect to A. Since A is reduced we conclude 
that L, N L. = 0, or, in other words, that U is unitary. In particular, C is 
non-singular.° 

Now denote by B the operator UAU*. We have CC*B = CAC* and it 
follows that CC*B has the form Tri(0,*,0) since C(x)A(x)C(x)* has this 
form whether ze L, or not by the way C, and C, were chosen. But then also 


5 A non-singular operator is one with a trivial null space. 
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B=Tri(0,G,0) for some GedZ’, since CC* is a non-singular matrix 
Diag(P, P), which implies that P can be cancelled in the matric equation 
for CC*B. 

Since B must be reduced also we learn that G is non-singular. Hence, 
in the polar resolution G=|G@|W, W is unitary and the operator 
U’ =Diag(1,W) is unitary in R. But U’BU’* =Tri(0,|G@|,0). Thus 
by a slight modification in the unitary operator U we can arrange for B to 
have the property that G > 0, and we accordingly assume this to be the case. 

Now polynomials in B and B* are matrices whose entries are polynomials 
in G. A continuity argument shows that every element of the ring generated 
by B, which is of course R itself, has its entries in the ring on9n generated 
by G. This in turn implies that G generates Z’, and hence that 


G? 0 
B*B + BB* 0 

generates the center Z of R. This central element may also, of course, be 
written A*A + AA*. Thus A*A + AA®* generates the center Z of the ring 
R generated by A. 

Now suppose given reduced operators A, and A, on spaces #, and #, 
respectively, such that both A, and A, have square zero. Suppose moreover 
that there exists an isomorphism ¢: #, > &, with 


Let A; generate the ring of operators R; with center Z, i—1,2. Then 
¢°(Z,) = Z,, and, emplyong Theorem 3, we may replace ¢ by an isomor- 
phism such that not only is y°(A.*A, + A,A.*) = A,*A, +4,A,* and 
¥°(Z.) = Z,, but also y°(R.) = R,. We now make R, correspond to a ring 
M.(X) as above. As we have seen, there are unitary elements U; (i = 1, 2) 
in R, such that U,A,U,* and U°(A2)U.* correspond to elements G;(-) 
in M,(X) with = Tri(0, gi(v),0), 20 for Since also 


Diag = Diag(g2? (x), g2”(2)), 


we have g,(2) = g2(x) for every so that G,(-) = G.(-). Thus U,A,U,* 
= U.y°(A.)U.*. The appropriate combination of the isomorphisms U,, U, 
and y° yields a unitary equivalence between A, and A». 

A simple argument frees us from the assumption that A is reduced and 
we summarize as follows: For an operator A with A? —0 a complete set of 
unitary invariants is furnished by the unitary equivalence class of the self- 
adjoint operator A*A + AA*. It may be remarked that in the reduced case 
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the two operators A*A and AA* are themselves unitarily equivalent so that 
we could go a step further and use either of them in determining an invariant 
for A*A + AA* and hence for A. It is easy to see, however, that this 
assertion fails in the presence of a non-trivial normal kernel 


b) A*?—A: We assume once more that A is without normal kernel. 
The latter in this case is clearly a projection. The ring R generated by A 
may be represented on M,(X) under a correspondence which carries A onto 
a function A(-) with A(x)*= A(x) everywhere on X. The matrix A(z) 
can have only 0’s and 1’s for characteristic roots so that its trace ¢(2x) is 
either 0, 1 or 2. Since ¢ is continuous the subset on which ¢ takes each of 
these values is compact-open. But let d(z) denote the determinant of A(z). 
Then A(x)? — t(x) A(x) + d(x) = (1 — t(x)) A(x) + d(x) = 0, so that the pos- 
sibilities t(z) — 0 and t(x) = 2 yield respectively the relations A(z) = = d(z), 
and the sets on which these two relations hold correspond to subspaces which 
are normal with respect to A. It follows that ¢(2) = 1 identically and hence 
that d vanishes identically. Let us again set B = UAU* where U is the same 
unitary element defined in a). In the present case it is readily seen that the 
element of 1/,(X) corresponding to CC*B = CAC* has the form 


90) 


where, as before, CC* = Diag(P,P) and it follows that B = Tri(1, H, 0) 
for some operator He Z’. As before we may assume that H > 0 since it is 
non-singular. 

By a repetition of the argument used above we conclude that the ring 
Z’ on IM is generated by the single operator H (1 being in this latter ring 
automatically). Since in this case 

—H* 0 
(B— Br)? —( 0 

(B — B*)’ = (A — A*)?* generates the center Z of R, and, proceeding exactly 
as before, we find: A complete set of unitary invariants for the idempotent 
operator A is furnished by the unitary equivalence class of the self-adjoint 
operator (A—A*)* (or of A—A*) together with that of the projection 
which is the normal kernel of A. 


8. In these two examples the procedure has been to put the information 
we have about binormal rings and what we know of 2 X 2 matrices together. 
In approaching the general case in this spirit we shall need a solution of the 
special problem of the unitary equivalence of 2 X 2 matrices. 
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We begin by assuming that the matrix C is not normal. Let b be a 
characteristic root of C. It is a simple matter to find a unitary matrix U 
such that UCU* has the form Tri(a,c,b) with (0,6) in the lower row. 
The entry a is then the other characteristic root and is fixed, but with these 
three entries unchanged the matrix U may be varied so as to replace c by ce‘? 
with arbitrary ¢. In particular, c can be made real and positive. With ‘this 
restriction, along with the condition that the lower row be (0,0), the matrix 
equivalent to C is uniquely determined. Similarly C is equivalent to another 
matrix Tri(b, d,a) with d > 0, but computation shows that |c|—|d|, or 
c=d. This proves: 


Lemma 8.1. If C is any 2 X 2 complex matrix then there is a unique 
non-negative real number r such that C its unitarily equivalent to a matria 
Tri(a,r,b). The diagonal entries are then the characteristic roots of C im 
some order (so that while this is not quite a canonical form, there can be at 
most two choices for it). A complete set of unitary invariants for C is 
furnished by the number r and the trace and determinant of C. 


Let us now define (for A an element of an arbitrary C*-algebra) the 
functions : 


fi(A) [A, f2(A) [A?, A*)?, fs(A) ([A, A*] + [A’, A*])?, 
fs(A) (A [A, A*))?, 
where [A, B] denotes the commutator BA — AB. 


Lemma 8.2. Jf A is a 2X2 complex matrix then f;(A) is a scalar 
matric for 1=1,---+,4. Two non-normal 2 X 2 matrices A and B are 
unitarily equivalent if and only if fi(A) =fi(B) for all i. 


Proof. Since the square of a 2 X 2 matrix with zero trace, and hence 
in particular of a commutator, is scalar, the first part of the assertion is 
readily established. It follows that the functions f; are constant on unitary 
equivalence classes, which proves the necessity of the condition of the second 
part. Also, to show sufficiency, we may replace A and B by any matrices 
unitarily equivalent to them, assuming for instance that A = Tri(a,, 11, b,) 
and B = Tri(ds, 72, Computation shows that 


f(A) =t°fi(A), =(+1)*7,(A), f(A) = (A) 


where ¢, and d, are the trace and determinant of A and where f,(A) 
=r?(r,? + | a,—b, |*). The analogous relations hold for B. 
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Now since we assume explicitly that A and B are not normal f,(A) = f,(B) 
= 0, and we may divide the other qualities by this one, obtaining the relations 
t, —t, and d, —d.. This means that A and B have the same characteristic 
roots, and, in particular, that |a,—b, |*—|a,—b.|?. From this and 
from f,(A) = f,(B) it follows that r; = and the lemma is proved. 


9. Before stating the final theorem we have need of one more pre- 


liminary result. 


Lemma 9.1. Jf Band C are any elements of an abelian ring of operators, 
then there is an operator X in the ring satisfying X? + BX +C=0. 


Proof. In the special case B = 0 this amounts to finding a square root 
of a normal operator. In the general case we have but to complete the square 
as in the familiar quadratic formula. 


THEOREM 5. Jf A is a binormal operator then the operators f;(A) for 
t=—1,---,4 are normal and mutually commutative. If A, and Az are 
reduced binormal operators then they are unitarily equivalent if and only if 
there exists an isomorphism q such that for all « $°(fi(Az)) = fi(A1). 


Proof. It clearly suffices to prove the first assertion also under the 
assumption that A has no normal kernel, while, in the second part, we again 
need only concern ourselves with the sufficiency of the condition. Let the 
reduced binormal operator A on & generate the ring R with center Z. Then 
employing Theorems 2 and 4 as before we may assume that ®% = In © MN, 
that R consists of all 2 X 2 matrices with entries in the abelian ring Z’ on 
n, and that R is isomorphic to M.(X) where X is the representation 
space of Z’. The given operator A appears as, say, the matrix (Aj;). Also, 
the operators f;(A) correspond to elements H;(-) in M.(X) for which 
H(z) =fi(A(z)) for all z This means that H;(-) is scalar valued every- 
where and hence that f;(A) e Z fori —1,--+,4. Thus the first part of the 
theorem is proved. 


We next take up the problem of reducing the matrix (A;;) to a simpler f 


form, exactly as was done in §7, but taking here in the general case 
some extra care which was not needed there. Write T= Aj, + Ao. and 
D = Ay;Ae2 — Ay2An1.° By Lemma 9.1 there is an operator Z,¢Z’ such 


that Z.2—TZ,.+ D=0. To facilitate matters later on we shall insist F 


* These might conveniently be called the trace and determinant of A. It is seen — 


that T ®T e Z is the center valued trace of A in the sense of Dixmier [3]. 
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further that Z. lie in the ring generated by JT and D on Yn. This time we 
write 
Aoi Zo Ay Ze — Ags 


It is easy to see that C; is normal in R and that C,C;*eZ for 1—1, 2. 
Hence, again, if F; projects & onto the null space of Ci, F; is central and 
corresponds to a compact-open set L; C X. On L, we have of course a2; = 0 
and == 22 while on the relations a;, and dz. = 22 hold identically. 
Thus F,F.A is normal and Write Cj = |C;| U; in polar form 
for i=1,2. The element U;(-) ¢ M@.(X) is unitary valued on X — J; and 
vanishes identically on L;. Hence if we define U(x) to be U,(x) on ¥—LD, 
and U.(xz) on Ly, the operator U = U,+ F,U.,eR corresponding to U(-) 
is unitary. Clearly also C= |C|U is the polar decomposition of C where 


_C=C,+ FC, corresponds to the function C(-) with C(x) =C,(z) for 


L, and for ve Ly. 

Now let B=UAU* and choose PeZ’ so that the central element 
CC* = C,C,* + F,C.C.* = Diag(P, P). The exact form of p may readily 
be found and direct computation discloses that the function corresponding 
to CC*B = CAC* has the form Tri(*, *, p(x)z.(v)) at every point of X so 
that CC*B = Tri(PZ,, PR, PZ.) for certain operators Z, and R, whence, 
cancelling P once again, B= Tri(Z,,R,Z.). The operator Z, is clearly 
T—Z,. Moreover, the operator R can be assumed to be positive definite as 
before. Thus the general reduced binormal operator A is unitarily equivalent 
in the ring which it generates to an operator B = Tri(Z,, R, Z,) with R > 0. 

We shall now show that the center Z is generated by the four operators 
fi(A). Since these are central we have f;(A) = Diag(M;, Mi) i=1,---,4 
for certain M;,eZ’ and it suffices of course to show that the M; generate Z’. 
Also B= Tri(Z,,R,Z.) generates R too which implies that Z,, Z, and R 
generate Z’. Finally, since fi(A) =fi(B), the M; are, in natural order, 


Q, DQ 


where Q = R?(R? + |Z,—Z,|*). Now Q is non-singular (if it were not 
we would once more be able to produce a subspace normal with respect to B), 
so if Y reduces both Q and DQ it must reduce D. In other words D is in 
the ring, which we may temporarily call Y’, generated by the M;. Similarly 
T? and (TJ + 1)? are in Y’ whence 7'e Y’. But then also Z,,Z,¢Y’. Finally, 
along with (|Z, — Z.|? + 2R*)? = |Z, —Z.|* + 4Q, Y’ contains the positive 
square root |Z, — Z.|? + 2R? and we are able to conclude that Re Y’ and 
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Now let us suppose, as in the hypothesis of the theorem, that A, and A, 
are reduced binormal operators and that there exists an isomorphism ¢ with 
$°(fi(Az)) =fi(A1) for =1,---,4. By making a familiar application 
of Theorem 3 it is easy to see that we can reduce our problem to the special 
case which arises when “A, and A, generate the same ring R and f;(A1) = fi(A.), 
We then map R onto M.(X) by Theorem 4. Denote by M the set of points 
at which A,(z) and A2(x) are unitarily equivalent. Denote also by H;(-) 
the element of /.(X) corresponding to f;(A:) = fi(Az) and let 


(2) = 


We know that A,(z) and A,(a) are normal if and only if f,(7) =0. Since 
= fi(A2(x)) for 1=1,---,4 and for all 2, it follows from 
Lemma 8. 2 that if f,;(7) #0 then ze M. But the operators A; are reduced 
and therefore the functions A;(-) cannot be normal on an open subset of %, 
whence it follows that M is dense. 


Now if t:(z) and d;(z) are the trace and determinant respectively of 

A;(x) then ¢,(7) and d,(z) =d.(x) for re M. Since M is dense 
, =t, and d, = d, and therefore the pair of operators T and D above is the 

same whether constructed from A, or As. Hence for a suitably chosen Z, 
satisfying TZ. + D=0 and for Z, + Z,=T we see that A, and A, 
are unitarily equivalent in R to operators B, and B, respectively with 
B, = Tri(Z,, R,, Z.) and B. = Tri(Z,, R., Z2) where R, and R, are positive. 
We will complete the proof by showing that R, = R,. 

Let f:(z) #0. Then ze M and the matrices B,(z) and B,(x) are uni- 
tarily equivalent. Also, r2(x) 20. By Lemma 8.1 
Since this holds on a dense subset we have 7; =r. and the theorem is proved. 

In the course of this proof it has appeared that the reduced binormal 
operator (A,;) which generates the ring R is unitarily equivalent in R to a 
matrix Tri(Z,, 2, Z.) with positive definite R. This is, in general, far from 
being a canonical form since, as we have seen, Z, can be taken to be any 
solution of a fixed quadratic operator equation. It is clear, however, that 
with the choice of Z. once made the other entries in the matrix are uniquely 
determined. 

It should also be noted explicitly that by adding one more term to the 
list of conditions of this theorem we can easily extend our result to include 
binormal operators which are not assumed to be reduced. In fact, if A, and 
A, are binormal operators with normal kernels N, and N, respectively, it 
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follows at once that A, and A, are unitarily equivalent if and only if there 
exist isomorphisms ¢ and yw such that 


1) ¢°(fi(A2)) =fi(Ar) (t= = 


(In general, of course, ¢ and y are defined on different Hilbert spaces.) 
In the special cases treated in § 7, where it was found that a single operator 
suffices to generate the center of R, we may expect to find relations among 
these functions f;, and indeed it is easy to verify that if A? 0 we have 
f(A) =f2(A) =0 and f;(A) =f,(A) = (A*A + AA*)?, while, on the 
other hand, if A* A the relations f,(A) = 0 and 4/;(A) =f.(A) =f (A) 
= (A — A*)*— (A — A*)? obtain. Thus the results of the special cases 
are implied in the general result but in a relatively complicated form, as 
could be expected from the fact that we have had to raise the degree of the 
polynomials f; to take care of the most general 2 X 2 matrix. 


10. Conclusion. Dixmier [2] has given a solution of the problem of 
the simultaneous unitary equivalence of two projections, or, what is the same 
thing, of the unitary equivalence problem for operators of the form EF + iF 
where # and F are projections. Now the operator # + iF generates the same 
ring as do F and F together, and hence is binormal by Lemma 7.1. We shall 
now use this fact to solve the problem anew, arriving at essentially the same 
result as that announced in [2]. 

Let E and F be projections on #. The first step is to reduce the problem 
to the case where the ring R generated by EF and F is homogeneous, and since 


_ the non-normal part of R cannot here be identified in terms of a normal 


kernel, we must describe the subspace 9% directly (cf. Theorem 1). The 
projection G of & onto 9n is, as we know, the sum of P, the maximal central 
abelian projection in R, and the annihilating projection for R, which is 
readily seen to be (1—H)A(1—F). The first problem before us then 
is to identify P. 

To begin with, a projection H in the center Z of R is abelian in R 
if HE and HF commute. In particular then FEAF, EA(1—F) and 
(1— F) \F are abelian projections. Let us denote by Q the central abelian 
projection 


(EAP) ((1— £)AF) (EA (1— F)) 
EAF +(1—B) AF + EA(1—P). 
Then of course @Q <P. But on the other hand 


1= EF + (1—E£)F+E(1—F) + (1—E) (1—F) 
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and, making use of the properties ascribed to P, we see that 
P = P(EAF) + P(A— B)AF) + PEAA—F)) + P(A — — 
— PQ 


(note that Pe R implies P((1— F) \(1— F)) =0) whence P=Q. Thus 
we have proved that P = (FEAF) + (1— AF + EA (1—F), and hence 
also that 


G=(FAF)+(1—B)AF+ F). 
We assume accordingly that H and F are projections on & for which 
EAF=(1— B)AF = F) =0 


and we proceed to investigate the circumstances under which F and F are 
simultaneously unitarily equivalent to another pair of projections, say H, and 
F,, for which this requirement is also fulfilled. (The reader will perceive 
that this initial simplification of the problem is exactly the one made by 
Dixmier. In the terminology of [2] we are assuming that the subspaces onto 
which £ and F project are “in position p” relative to one another.) The 
ring R is then a matrix ring which we represent on M.(X) in the usual way. 
The functions H(-) and F(-) corresponding to the generating projections 
E and F are projection valued at every point. Let ¢(2) denote the trace of 
E(x). Then ¢ can take only the values 0, 1, and 2, and the relations ¢(7) =0 
or ¢(x) = 2 correspond to the situations H(z) and H(z) =1. Since 
the set of points for which H(x)F (x) A F(x) E(x) must be dense we con- 
clude that ¢ = 1, and hence that H(z) is singular everywhere. It is now a 
simple matter to construct a unitary function U(-) such that 


U (2) E(2)U — (2) = (j + 


for every xe X. (We may in fact use the same function defined in the first 
example of §7.) Then 
PF a(x) c(x) 

The function a is real and ce ~a—a?*. Let y be a continuous function on 
# such that |c(x)|-y(x) =c(z). (That such a y exists can be shown a 
usual by returning to the ring Z’ which is isomorphic to C(X) and expresssing 
the operator C corresponding to c in polar form.) If c(z) ~0 we must have 
| y(z)| 1, and since c cannot vanish on any open set we conclude that } 
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has absolute value 1 everywhere. Thus the function U’(-) defined by 
U'(z) = Diag(1, y(x)) is unitary in M,(X), and writing 


F’ (a) =U’ (a) F’ (x) U’ (2)* 


a(z) | e(2)| 
= (100) 


while = U’(x)E’(x)U’(x)* = E(x). Since | c(x)| = (a(x) — a(a)?)A 


one verifies that for every x 


(B"2) — 1° = + 
from which it follows as usual that (H + /—1)? generates the center Z 
of R. By pursuing the same line of reasoning as we have used in each 


we see 


previous instance we arrive at the result: Jf HL; and F; are projections on 
Hilbert spaces for i 2 and if 


E,AF, = = Fi) =0 


then E, and F, are simultaneously unitarily equivalent to E, and F, (order 
is of no importance) if and only if the self-adjoint operators (ZH, + F,—1)? 
and + F,—1)? are unitarily equivalent. 


il. In connection with the treatment of binormal rings and operators 
given above certain questions naturally come to mind. Among these, most 
notably, are the following: 


1) How, and to what extent, can this work be generalized to include 
rings which are n X nm matrix rings for n > 2, and the operators 
which generate such rings? Can one go on at once to consider 
“trinormal * and, more generally, “n-normal” operators? _ 


2) To what extent can the set of unitary invariants given in Theorem 5 
be simplified ? 


With regard to 1), the fact is that there is nothing in § 2-§ 5 in which the 
number 2 plays any essential role. If a ring is subjected to any polynomial 
identity whatever, the above line of reasoning can be pursued with only 
unessential changes. The relevant facts are as follows: If R is a ring on # 
satisfying a polynomial identity, then for some (finite) cardinal number n 
(depending on the polynomial identity) R has a direct factorization 
@®---@R, corresponding to a decomposition = H,. 
The ring R, on #, is the normal part separated out in Theorem 1 and, for 
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> in the sense that, in R,, 


each j > 1, R; on &; is “ homogeneous of order 7’ 
1 is the sum of 7 orthogonal, equivalent, abelian projections. For each homo- 
geneous part R; the analogs of Theorems 2, 3 and 4 hold, and the proofs need 
scarcely be modified. (In particular, Theorem 3 provides a set of unitary 
invariants for rings satisfying a polynomial (or finite rings of type J in the 
terminology of [8])). Also, n can be chosen at will by making an appro- 
priate choice of polynomial identity, just as (*) was selected to make n = 2, 
and hence it is possible to give definitions of n-normal operators for any 
finite n. It would appear that for each n the solution of the unitary equi- 
valence problem awaits only an adequate and applicable solution of the 
problem for the special case of n X n matrices. It should be observed at this 
point, however, that a straightforward generalization of the method we have 
used in the case n= 2 would involve setting forth a suitably separating 
collection of polynomials on the matrix ring which take their values in the 
center. As of this writing no such center valued polynomials are known to 
the author in any case n > 2. 

If, on the other hand, we drop the assumption that R satisfy a poly- 
nomial identity, and turn rather to the direct assumption that there be an 
adequate supply of abelian projections (assuming, in other words, that R 
is type I but not necessarily finite) the case for the results of this paper 
disintegrates at once. The above mentioned analog of Theorem 1 holds still, 
but of course n need no longer be finite. (A ring is “ homogeneous of order 
8” if 1 is the sum (supremum) of § equivalent, orthogonal, abelian projec- 
tions in it.) The very meanings of Theorems 2 and 4, depending as they do 
on matricial concepts, are in serious jeopardy, while Theorem 3, which is 
meaningful, is, unfortunately, false. Indeed, it is not hard to construct a 
non-trivial factor on N-dimensional Hilbert space which is homogeneous of 
order 8 (provided, of course, that & is infinite).? These rings have been 
successfully investigated in [13], Part II, where a set of unitary invariants 
is given which does not depend on finiteness, and more recently in [6] where 
it is shown, among other things, that the algebraic analog of Theorem 3 
Yemains true. 

With regard to question 2), we shall content ourselves with showing by 
an example that in the presence of any non-trivial center it is not possible to 
do away with the simultaneity of the unitary equivalence of f,(A) and f;(B). 
Let ¢; and d; be any complex numbers for i—1,2 and let a; and b; be s0 
chosen that a;+ b;—=#; and ajb;—=d;. Choose also numbers p; and q; for 


7 On the direct sum of & copies of a 2-dimensional space XK let R be the commutant 
of the &-fold copy of the ring of all operators on X. 
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i=1,2 such that p,tq,—t, and pig.—d, while p.+q.—t, and 
pog2=4,. Let m be any positive number exceeding the four quantities 
| a, |? and | p,— qi |*. Then there exist positive real numbers 7; and 5 
such that, for = 1, 2, + | — |?) = + | pi— qi |?) =m. Let 
7,= Tri(a, ri, b;) and Y; Tri(pi, qi) and define the two binormal 4 4 
. matrices A and B by 

Z, 0 0 
If we make the further assumption that t, At, and d,~d., we may assert 
that A and B are not unitarily equivalent, and similarly that Z, is not 
witarily equivalent to Z., nor Y, to Y., from which it readily follows that 
A and B generate the same ring (all Diag(C,D) where C and D are any 
2X 2 matrices) whose center is represented on a 2-point space. Clearly 
neither A nor B has a normal kernel. But though they are not unitarily equi- 
) valent, it is an immediate consequence of the construction that f;(A) = fi(B) 
for i = 1, 2,3 while the other functions, f,(A) = Diag(d,m, d,m, d,m, d,m) 
and f,4(B) = Diag(d.m, d.m,d,m,d,m) are obviously unitarily equivalent. 
: Finally, a slight variation of this example can be used to show that, in 
proving that two reduced binormal operators which generate the same ring are 
unitarily equivalent in the ring, one cannot avoid making some restrictions 
! (such as requiring that certain functions of them agree) which serve to exclude 
the intervention of an isomorphism corresponding to a homeomorphism of 
the representation space. For, in the above construction, with t,t, and 
) d; A dz, it is readily seen that A and B are unitarily equivalent, but that they 
} generate in common the same ring as before which obviously cannot contain 
U carrying A onto B. 


Rice INSTITUTE. 
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ON THE ALGEBRAS FORMED BY THE CAYLEY-DICKSON 
PROCESS.* 


By R. D. ScHAFER. 


The iterative process by which L. E. Dickson obtained generalizations 
of Cayley’s 8-dimensional non-associative algebra from the 4-dimensional 
algebras of generalized quaternions can be used, as A. A. Albert has shown in 
[1], to obtain central simple non-associative algebras {; of dimension 2* 
(t = 2,3, 4,-- -) over an arbitrary field F. 

Let % be an algebra of dimension n over F with identity element 1, and 
let an involution «— @ be defined in % satisfying 


(1) rt = iz = n(z)1, t(z),n(v) in F. 


Then, for a fixed y ~0 in F, an algebra of dimension 2n over F 
may be constructed as the set of all pairs z= (2,y) for z,y in Y%, with 
the obvious addition and multiplication by scalars, and with multiplication 
defined by 


(2) = (Ly, Y1) (Xo, Yo) = + + 


The pairs (z,0) may be identified with the elements of %, and the element 
1=(1, 0) is an identity element for 8. The mapping z—=(z, y) >2=(&, —y) 
is an involution of satisfying z+ 2Z—=t(z)1, where 
t(z) =t(x), n(z) =n(x)—yn(y). Beginning with a two-dimensional 
commutative associative algebra 9%, which is either a separable quadratic 
extension of F' or the semisimple algebra F @ F, and in which z—>7Z is 
the unique non-trivial automorphism, one obtains in this way a class of 
2'-dimensional algebras - *,ye} over F. We write 
B= Mer = = Wily: yeu}, and it is clear that in our nota- 
tion = yi} and = ++, ye} imply We Wi ---, y}- 
The best-known cases of the algebras 2%; are the algebras O = %, of generalized 
quaternions, and the Cayley-Dickson algebras © = %;. 

In this paper we derive first some elementary properties of the algebras 
Y,. Our chief purpose is to compute their derivation algebras D(M;) for 


* Received June 2, 1953. 
*This research was supported in part by a grant from the National Science 
Foundation. 


435 


7 
| 


436 R. D. SCHAFER. 


t>3. The derivation algebras D(%M,) for 42 and 3 are well-known. 
®(Q) is isomorphic to the 3-dimensional Lie algebra [0,0], while any 
& = D(C) of characteristic ~ 2 is 14-dimensional and, for F of characteristic 
0, is a central simple Lie algebra of type G@ in the sense that &o is the 
exceptional simple Lie algebra G, over ©, the algebraic closure of F [8]. 
We show, for ¢=3 and characteristic 42,3, that D(W:{y}) = D(M,). 
Hence, for all ¢ > 3, D(W+) = D(Ms) is 14-dimensional, a central simple 
Lie algebra of type G in case F is of characteristic 0. 


Added in the proofs (January 20, 1954). I. Kaplansky announced at 
the American Mathematical Society’s 1953 Summer Institute on Lie algebras 
and Lie groups that D(%3;) is central simple for F' of characteristic ~3 
(and not simple for characteristic 3). If we say that these 14-dimensional 
central simple Lie algebras of characteristic 43 are of type G, we can 
make the obvious changes in our statements above and in the statement of 
Theorem 4. 


1. Elementary properties of the algebras M;,. It is well-known that 
the algebras Q —., are associative, and that the algebras © =, are 
alternative? but not associative. Albert has shown [1, p. 172] that 
Wir = W:{y} is alternative if and only if %; is associative. Hence the 
algebras 9%; are not alternative for ¢ > 3. However, we shall show that all 
of the algebras 9%; are flexible; that is, 


(ry)x = x(yz) for all y in 


The right and left multiplications R, and L, of an algebra % are the 
mappings y—> yx = yR, and zy = yL, respectively. If S: 
is an involution of 2, then S?—J and 


(3) RS LS for every in 
The condition for flexibility of an algebra 2% is 

(4) L,R,=R,Lz for every in 
Linearized, this identity becomes 

(5) LR, + L,R, = RL, + for all z,y in 


2 The identities = a(ay), yx? = (yx)@ are satisfied. 


| 


™m 


| 
| 
( 
( 
fi 
Pp 


ON CAYLEY-DICKSON ALGEBRAS. 437 


By (2) matrices for the right and left multiplications of M41 = WU:{y} may 
be written in terms of those for %; as: 


A L SL 
( yLys Ras ySR, Rk, 
Lemma 1. If an algebra Y; is flexible, then 
(7) RygRy = Lyglhy = L,Lyg = RyRyg for every y in 


For, applying (5) to ¥, we obtain = y(yr) + n(y)a, 
or But Lysl,=t(y)L,—L,? =L,Lys, and similarly 
= R,Ryg. 


LemMa 2. If an algebra Y; is flexible, then 
(8) t(ry) =t(yz), t((va)y) = t(x(ay)) for all x, a, y in 


The first of these equalities does not require flexibility. To prove the 
second, we use (5) to compute 


t((wa)y) — t(x(ay)) = (2a)y + —a(ay) — 
= a([Lz, Ry] + Ros, Lys]) = a(I + 8) [Le, Ry] 
= t(a)(ry— zy) = 0. 
THEOREM 1. The algebras A; are flexible for every t. 
This is known for ¢ = 1, 2,3. Proceeding in a proof by induction on f¢, 


we show that [R., L,| 0 for every z in %.1 = U:{y} from (6) as follows. 
We verify that 


(9) [Rz, Le] + y(LySRy — = 0, 
(11) y(Lyele + RosSRy — SRyR,— Relys) = 0, 
(12) — + Be] 0 


for all a, y in %;. Now (9) follows from (4), (3), and (7). Also 
L,R,— + S(Legly L,Rzs) (I + S) (L,Rz — L,Ly) 0 


Since, applied to any a in %;, it gives t(a)(yx—yr) =O, proving (10). 
Moreover, 


— + — LysLes)S = (Lysle — (I + 8S) = 90 


since, applied to any a in %;, it gives ¢(a(ya) —Yy(ar)) =0 by (8). This 
proves (11). Finally (12) follows from (7) and (38). 
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The center of a non-associative algebra 2 is the set of all elements c in 
satisfying 
(13) re = cx for every x in Y, 


(14) (cx)y=c(xy), (rc)y=a(cy), (ry)c =a2(yc) for all z,y in Y. 


The elements a1 are elements of the center of any algebra %;. Furthermore 
F1 is the center of M; for ¢=2. This is well-known for the quaternion 
algebras QO —, and the Cayley-Dickson algebras © = Y;, and for t>3 
is a consequence of the following lemma. 


Lemma 3. Let c be an element of an algebra for t= 3. Any one 
of the following conditions is sufficient for c to be in F1: 


(13’) R, = 
and, for every z in XU, 
(14a) LR, = (14b) RR: = Re, (14c) = Ree. 


For the Cayley-Dickson algebras 2, the equivalence of (14a), (14b), 
(14c) follows from the alternative law. Also it is well-known in this case 
that either (13) or (14) suffices for c to be in F1. Assume that any one of 
the four conditions on an element of %; for ¢ = 3 guarantees that it is in F1. 
Take c= (a,b) in M41 = U:{y}. Then (13’) and (6) imply in particular 
that Ra = Iq, Ing = SRy, Ly = SLy. Then we have a= al by the assump- 
tion of the induction. Also = implies = £1, so that BJ —BS. Then 
B=0, c= (a1,0) in Fl. Next (14a) and (6) imply in particular that 
= Slylys = L,SR, for all z,y in Then a—al by the 
assumption of the induction. By (3) we have 


(15) Ly Lys = Rygky for every y in 


Hence Ly = Ry so that b= B1. Then (15) becomes BLys = BR,g for every 
y in U,. Since there exists y in YM; such that Lys ~ Rys, we have B = 0, and 
c= (1,0) in Fl. Similarly (14b) or (14c) implies that ¢ is in F1. 

It follows from (1) that 2?—t(x)¢-+ n(x)1—0 for every z in %;. 
We say that %; has degree two. The algebras 2; are power-associative, since 
it is easy to see that every non-associative algebra of degree two is power- 
associative. 


We assume throughout the remainder of the paper that the characteristic 
of F ts not two. Then %, itself is formed by the Cayley-Dickson process 
from %,—F where €>é—é€ is the identity transformation on YM. For 
we have = Wo{yi}, yi 40 in F. 
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A normalized basis for 2[; over F may be chosen as in the known cases 
t=1,2,3. Let m=2'. Beginning with a basis 1, f, for 2%, in which 
f= %1, % —y,~0, it follows readily by induction that there is a basis 
1, fos’ for satisfying 


(16) f? = al, a,540 in F, i—1,+ --,m—1, 
and, for 1 j, 

(17) fifi = — fifi = Bute, By~O0 in F, 
the elements a; and depending on y, - ye, and f, being an element 


(~ 1, fi, f;) of the basis uniquely determined by f, and f;. If «—&1-+ > &fi, 
then £.1— DS éifi, t(x) = 2%, n(x) = &?— Dd In our notation, 
if — and r— 2! — 2m, then for i—=m,m-+1, 

-+,r—1, we have f; = (0, fim) if we make the convention that f, = 1, 
and we also have 


It follows from (16) and (17) that the algebras 2; are Jordan-admissible, 
for the algebra in which multiplication is defined by y= + yz) 
is a special central simple Jordan algebra of degree two. Thus the algebras 
Y, are flexible central simple Jordan-admissible algebras of degree two.® 

The following lemmas will be used in the next section to prove Theorem 3. 
Although 2; is not alternative for ¢ > 3, the alternative law is satisfied by all 
pairs of elements in the normalized basis: ff; = fi(fif;), fif? = (fifi) fi, as 
may be seen from the following lemma. 


Lemma 4. In any algebra with basis 1, f:, fo: +5 fm+ satisfying 
(16) and (17), we have 


(19) fi(fix) = ay = (afi) fi for every x in %. 
It is sufficient to prove 
(20) fi(fifi) = afi = (fifi) fi for 1A}. 


Now (17) implies (fifj)fi+ (fif)fi—=O and fi(fafs) + filfifs) =0. Sub- 
tracting, and using Theorem 1, we have fi(fif;) = (fifi)fi. We prove 


*In [2, p. 588] Albert studies flexible central simple Jordan-admissible algebras 
of degree two. We note that this class includes the non-commutative algebras Q% such 
that Mx = M,(X)x for some extension K of F, where %;(A) is the algebra which is 
the same vector space as 9, for t=2 and in which multiplication is defined by 
Gey = + (1—A)ye@ for 1/2. 
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fi(fif;) = af; by induction on ¢, assuming (20) for Then in = 
there are three cases left to consider: (i) 1Si=m—1, mSjSr—1,; 
(ii) mSisSr—1, 1SjSm—t}1; (iii) mStSr—1, 
In case (i), we have fi(fifj) = (0, (fi-mfi) fi) = (0, = aif; In case 
(ii), fi(fif;) = (y (film) fem 0) = af; by (18), and case (iii) holds similariy, 


Lemma 5. Let fw—af; for some fixed i, 1StSm—1. Then 


z= ol + éifi. 
For «= £1 + > éf; implies 
j 
[fi 7] = fi] =2 = = 0 
by (17). Then = a1 ~0 implies éf; —0, or = &1 + Efi. 


For any algebra %f;, 4¢(z¥) is a nondegenerate symmetric bilinear form 


g(x,y) and we may define, for any linear transformation T on Y%;, its 
transpose T’ (relative to g) by g(aT, y) = g(x, yT’) for all x, y in M%;. For the 
unique algebra ©, = %){—1,---,—1} in which y, = yz =: 
we have g(a, y) = > éim so that the matrix of 7’ relative to the normalized 
basis of €; is the transpose of the matrix of 7. Then T is symmetric (skew- 
symmetric) if and only if its matrix is symmetric (skew-symmetric). 


Lemma 6. For any Y; we have 


(21) R/ — R:s, — Lg. 
For, 


g(x, yR.’) = y) = = = y2) = 
implies yR,’ = yRzs, or = Similarly = 


Finally we remark that in YW; (¢ 22) an element is a sum of commu- 


tators if and only if its trace is zero, so that any element x of %; (¢ 22) 


may be written in the form 


(22) Eol + >> [ ai, bi, Ai, b; in Ws, t 2. 


It is sufficient to show that any basal element f; is a commutator. For ij 
we have fif; = Bix, which implies af; = Bijfifk = — Bisfifi, by (19), since 
k 1. Hence Qaif; [ Biifi, fil. 


2. Derivations of the algebras 9%, A derivation D of a non-associative 
algebra % is a linear transformation on % satisfying 


(zy) D = (aD)y + x(yD) 


(23) for all in 
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Equation (23) is equivalent to either one of the following: 
(24) [Re D] = Rep, [Le, = Lep for every x in 


The set of all derivations of % is a Lie algebra D(M), called the derivation 
algebra of %. 


nv 


Lemma 7. Let D be a derivation of an algebra U;. Then 
(25) =0 for every x in (26) 
where 8 is the involution x of Us, and 

(27) D’ =— D. 


Since %, is separable and commutative, it is known that in this case 
D=0, so we may assume that {= 2. From (22), since it is known that 
1D=0, we have ([aD, di] + so that 

i 
t(cD) =0. Also 
+ =0 =t(z)1D = («+ 28D 
for every 2 in Y;, implying (26). Finally (25) and (26) imply that 


t((zy)D) = + t(z(yDS)) =0, or g(xD,y) =— g(a, yD). Then 
g(z, yD’) = 9 (2D, y) =— g(x, yD), implying (27). 


THEOREM 2. Any derivation A of an algebra YU; can be extended to a 
derivation D of Mt. by defining 


(28) zD = (2, y)D = (2A, yA). 
In fact, [R., D] is 
A 0 [R. A] [Ly A]\ Rea Lys 
= Rzp, by (24) and (26). 
The following theorem is not true for ¢ = 1, 2. 


TueorEM 3. Let F be of characteristic A2,3. Then, if t= 3, every 
derivation D of Mis. = Urfy} has the form (28) for a derivation A of YU. 


We begin by proving the result for €,—%{—1,- - -,—1} and 
€i4.= €,{— 1}. In this case (27) implies that the matrix of D is skew- 
symmetric: 


_( A B 
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where A’ =— A, C’=—C. Since [R,, D] = Bz» for every 2 = (2, y) in 
€;,, we have in particular that, for every x and y in ©, 
(29) [R., A] = Res, (30) —L,B’ + = — 
(31) —Ly,sB + BL, =— Ryp's, (32) [Res, C] = Reas. 

Equation (29) states that A is a derivation of ©; Then (32) and (26) 


imply 
(33) [R., C] = Rea for every x in ©. 


Then — or 
(34) C=A+L,, c= 10. 


Putting (34) in (33), we have [R,, A + L,] = Roa = Rea + [Re Le] by 
(24), or 


[R., =0 for every x in 
Since ¢ = 3, Lemma 3(14a) implies that c is in Fl. But C’ = —C implies 
g(1, 10’) =g9(10,1) =g(c,1) =— g(1,1C) =—c, or c=0. Hence 
(35) C= A. 

Since 1D = 0 = (1A, 1B), we have 

(36) 1B = 0. 
Put y=1 in (30). Then 
(37) B’=B+R,, c= 1B". 


Transposition in (30) yields — BL,g + L,B’ =— Rypy = Ryp by (21). 
Hence 
(38) B’S = — B’. 


Putting (37) into (30) and (31), we then have [R,, Ly] + BLy.ys = LysysB, 
or [R,, Ly] = 0 for every y in That is, for every x in &, 
and by Lemma 3(14c), c—1B’ is in F1. Then (38) implies —c, or 
c= 0, and B is symmetric by (37). Hence (30) becomes 


(39) L,B = Blyg + Ryp for every y in ©. 
Let u=fif;. Then (39) implies that uB = —fi(f;B) + f;(f:B) and 


(fiw) B = — f,(uB) + u(f.B) 
= — + (ffs) (FB) 
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by (19), or 

(40) (fifi) (FB) = fifi (FB) }, 
Write for the (i+ 1)-st row vector of B 

(41) (ai, 01) for ai, bj in G1. 


Since B is symmetric, it follows from (36) that ¢(a;) 0. Let h=2** 
=m/2. For 1,7 =1,:--,4—1, we have (fifj) (fiB) = ( (fifi) a, 
and fi{f;(f:B)} = (fi(fiai), (bif;) fi). Hence (40) implies in particular that 


(42) bi(fifs) = (ifs) fi, 


Let j—=h; i=1,---,h—1 in (40). Then (— bifi, = (— Gifs), 
so that 
aif; = fii, fibs = bifi, 


By Lemma 5, + mf; for i=—1,---,h—1. But 
t(a;) = 0 implies 
(43) a; = 1=1,- -,h—1. 


For any jAt, (42) implies vifify + mifi(fifi) = + fi, 
2 (vifi — m1) fj =(. Hence vifi ml = 0, 0, 


(44) b, = 0, 
Since B is symmetric, (36) and (44) imply 
(45) a, = 0, t=h,-:-,m—1. 


For i=h,: -+,m—1; j=1,:-++,h—1, equations (40), (41), (45) 
imply (— bi(finfj), 0) = (—(fibi) fin, 0) so that 


(46) fibn = j= ‘,h—1, 
and 
(47) (fifs-n) bi = fin i=h+1,---,m—1; 


Formula (46) implies that xb, = b,x for every z in ©, ({—1=2). By 
the definition of the center of the quaternion algebra — ©, if 2, 
and by Lemma 3(13’) if t—1= 83, we have 


(48) Dy = 


443 
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For j=h; i=h+1,---,m—1, equations (40), (41), (45) imply 
(0, — bifin) = (0, — fi-ndi), or Difin = finds +1,-+-,m—1. Then 
it follows from Lemma 5 that = Ail + wifi» for 1—h+1,---,m—1, 
Put this in (47): Aifjfin — wifi = — Taking 7 ~i—h, we have 
= 0 so that 


Combining (41), (43), (44), (45), (48), (49), we have 
(50) pfi, 1=1,- 
or B is the diagonal matrix B = diag{0,y1,-- , um-1}. 


Using (50), we see that (39) implies (fif;)B = — pyfifj + wif or 
= — — Hence 
(51) = 0 
for any triple (i,j,%) satisfying fif; = Bijf,. We shall prove, by induction 
on ¢ (¢=3), that the equations (51) imply »,—0 for 1—1,- --,m—1. 
For the Cayley algebra © = &; there are, without duplications, exactly seven 
triples (i,7,4). These are given by the seven quaternion subalgebras of € 


with normalized bases which are contained in the basis 1, f,,- - -,f; of € 
[3, p. 775]. It is easily verified that the seven triples (t,j,%) are* 

(52) (1, 2, 3), (1, 4, 5), a 6, 7), (2, 4, 6), (2, 5, 7), (3, 4,7), (3, 5, 6). 

If F is of characteristic ~ 2,3, the seven homogeneous equations (51) for 
(7,7,%) in (52) have only the trivial solution p, 
Assume that in ©;, (¢ > 3) the only solution of the equations (51) is the 
trivial one 

(53) = Po =" = 0. 

Since the basis 1, - -,fm-1 for ©; includes the basis 1, f1,- -, for 
€,_,, we have (53) in ©;. Then fsfass = — fifa = fra —1) 
give h equations (51) for the triples 

(54) (3,h+1,h+2), (t,4,h +1), 


By (53) the equations (51) for the triples (54) become pass + pase = 9, 
Pat = 0 which clearly have only the trivial 
solution = pas =* for F of characteristic 42. Hence 


‘Our notation differs slightly from [3], but the desired triples (i,j,k) are the 
same. 
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B=0, and by (35) any derivation D of ©:,, = €,{—1} has the form (28) 
where A is a derivation of ©. 

We extend this result to arbitrary 2; and %;,, = %:{y} as follows. For 
A in D(G:), A<>D in (28) is an isomorphism between the derivation 
algebras D(€;) and D(G;,,). There is an extension K of F such that 
Vin = rx, = x. Hence 


D (Wt) x — D (Wx) = D(Cix) = x = D(C) x = D x- 


For A in (M+), A<>D in (28) is an isomorphism of D(W;) onto a sub- 
algebra D of But = D x implies dim D dim D(z) 
= dim D(%41), so that D = D(%41), and every derivation D of has 
the form (28). This completes the proof of the theorem. 


THEoreM 4. Let F be of characteristic 42,3. Let © = Uof{y1, v2) ys} 
be a Cayley-Dickson algebra over F and 


be any algebra formed from © by iteration of the Cayley-Dickson process. 
Then D(X) = D(C), a 14-dimensional Lie algebra over F. Also © 1s 
invariant under D(M,), and every derivation of € can be extended in a unique 
way to a derivation of UX; = C{ys,: - >, ye}. If F is of characteristic 0, then 
D(X) = D(C) ts a central simple Lie algebra of type G. 


It follows from Theorem that D(Wf{yii}) =D(W) fort =3. Then 
D(X) = D(Azs) — D(C). Also it follows from Theorem 3 that %% is 
invariant under D(Wif{yii}) for += 3. Hence is invariant under 
2(U:). That D(C) is 14-dimensional and, in case F’ has characteristic 0, 
is a central simple Lie algebra of type @ is established in [3]. 


TuHEorEM 5. Let F be of characteristic 0. Then the derivation algebras 
ye}) and for t => 3, s => 3, are isomorphic 
if and only if the Cayley-Dickson subalgebras = Mof{y:, ys} and 
= yo’, ys’} are isomorphic. 


For D(Mo{y1,° = D(C) and D(Mo{yi’,- ye }) = D(C’) by 
Theorem 4. But D(€) = D(C’) if and only if € = © by [8, p. 782]. 

It is well-known that in the construction of a Cayley-Dickson algebra € 
any generalized quaternion subalgebra OQ of © will suffice for the algebra 2, 
from which © = Y%; is constructed by the Cayley-Dickson process. That is, 
€=Of{y} for some y in F [3, p. 776]. That this property is not enjoyed 
in general by the algebras %; for t > 3 may be seen from Theorem 5 (for F 
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where @ is not isomorphic to © (as would be the case if F were the field of 
all real numbers and yi: =y2=ys=—1, ys=1). Then the algebra 
= ys, ys} C{ys} contains a Cayley-Dickson subalgebra ©” ‘iso- 
morphic to ©’. However, there is no y in F such that M, = C”’{y}. For, : 
if there were, then D(C{ys}) = D(C’ {y}) would imply © = by 
Theorem 5. Hence %, cannot be constructed by the Cayley-Dickson process 
from its subalgebra ©”. A similar argument holds for ¢>4. It is likely 
that one can prove by a different method that the same situation exists for | 
characteristic p > 0. 


of characteristic 0). For, let © = y2, ys} and = ys} | 
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A KRONECKER FACTORIZATION THEOREM FOR CAYLEY 
ALGEBRAS AND THE EXCEPTIONAL SIMPLE 
JORDAN ALGEBRA.* 


By N. Jacosson.* 


A well known theorem, due to Wedderburn, states that if 8 is an 
arbitrary associative algebra with an identity 1 and Y& is a finite dimensional 
central simple subalgebra containing 1, then % is the Kronecker product 
% — 9% ®@ U where WU is the subalgebra of elements which commute with every 
aeM. Recently Kaplansky indicated that the same result holds for 8 any 
alternative algebra and I an algebra of Cayley numbers.? It is the purpose 
of the present note to give a new proof of this result and to show that an 
analogous result holds for 8 any Jordan algebra and 2 an exceptional simple 
Jordan algebra of finite dimensions. 

In considering a subalgebra 2 of an algebra % it is natural to regard B 
as a bimodule relative to 9. One is therefore led to believe that the under- 
lying reason for the validity of the results of the type noted (beginning with 
Wedderburn’s) is to be found in the theory of bimodules. This is indeed the 
case in the associative and alternative cases where the key result in the proof 
of the factorization theorem is the theorem that every associative or alterna- 
tive bimodule for 2 is completely reducible and 2 has only one irreducible 
bimodule (cf. [2], $18). We have recently proved that the analogous 
result holds also for any exceptional simple Jordan algebra of finite dimen- 
sions (cf. [2], §18). However, there appear to be certain technical 
difficulties which have prevented our giving a similar type of proof for the 
Jordan theorem. Instead, we shall base our proof for these algebras on the 
corresponding result for Cayley algebras and on some structure theorems for 
Jordan algebras which were used in deriving the results on bimodules for 
Jordan algebras. 


* Received November 9, 1953. 

*The results of this paper were obtained while the author held a Guggenheim 
Memorial Fellowship. 

* This is stated in [5]. Kaplansky observes that the proof can be obtained by using 
the arguments given by Albert in [1]. The situation for the other central simple alter- 
native algebras is mentioned in the Concluding Remarks at the end of this paper. 


447 


| 

| 

| 


N. JACOBSON. 


1. The alternative theorem. Throughout this paper we assume that 
the base field is of characteristic + 2. 


Lemma 1. Let 8 be an arbitrary alternative algebra and YU a finite 
dimensional separable* subalgebra. Set U={u|ueB,au— ua, ae Y}. 
Then UU is a subalgebra of B and (au) (bv) = (ab)(uv) holds for all 
u,ve ll. 

Proof. It suffices to prove this result under the added assumption 
that the base field is algebraically closed. In this case the structure theory 
shows that % has a basis consisting of idempotent elements. Now let e 
be an idempotent element in 8 and consider the Peirce decomposition 


B = B1, + Bro + Bor + Boo where Bij {bi; | edi; dijbi; = die}. 


If uj, Bi; then ew—vue is equivalent to the condition that 
U = Uy, + Ugo. It is known that the subset 11(e) of elements of this form 
is a subalgebra. Since U = NU(e), ee AM, U is a subalgebra of B. Now let 
Xj, Bij, be any element of B. If w= + woo, then 


(xve)u + + Uoo) (211 + Lor) 
x(eu) = (211 + + + Loo) — + Lo1) 


Hence the associator A(z, e,u) = (xe)u—a(ew) for all in and 
allwel(e). Then A(z,a,u) =0 for ue. Now let a, 
u,veU. Then 


(aw) (bv) = ((aw)b)v = ((ua)b)v = (u(ab) )v = ((ab)u)v = (ab) (uv). 
This completes the proof. 
Lemma 2. Let U and U be alternative algebras with identities. Then 


B= AOU is alternative if and only if either A and U are associative or 
one of these algebras is associative and commutative. 


Proof. If a,b,ce and u,v,we Ul, then 


A(a@u,b@v,c@w) 
= (ab)c® (uv)w —a(be) @ u(vw) 
= (ab)c® (uv) w — (ab)c @ u(vw) 
+ (ab)c®u(vw) —a(bc) @ u(vw) 
= (ab)c@A(u,v,w) + A(a, b,c) @u(vw). 


* A finite dimensional alternative algebra % is called separable if Up is semi-simple 
for © the algebraic closure of the base field. 
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If 8 is alternative, then 
0 = [ab]c @ A(u, v, w) + A(a, b,c) @ (u(vw) — v(uw) ) 
where [ab] = ab — ba. We distinguish five cases: 


(i) There exist u,v,w such that A(u,v,w) and u(vw) —v(uw) are 
linearly independent. Then A(a, b,c) =0 and [ab]c=0. This means that 
Y is associative and commutative. 


(ii) There exist wu, v,w such that A(u, v,w) u(vw) — v(uw) A0 
but these are linearly independent. Then u(vw) —v(uw) =AA(u, Vv, w), 
Hence [ab]c —=—AA(a,b,c) for all a,b,c. Setting c—1 gives 
[ab] =0. Since A+ 0, A(a, b,c) =0 and YF is associative and commutative. 


(iii) There exist u,v, w such that A(u, v, w) 40 but u(vw) = v(uw). 
Then [ab]c =0 and [ab] =0. The original relation now gives A(a, b,c) 
=0 so that again % is associative and communicative. 


(iv) A(u,v,w) =0 but there is a u,v,w such that u(vw) ~Av(uw). 
Then A (a,b,c) and both U and are associative. 


(v) A(u,v,w) =0 and u(vw) =v(uw). Here UW is associative and 
commutative. 


We can now prove 


TuHEeorEM 1. Let B be an alternative algebra with an identity 1 and 
% a subalgebra containing 1 and having the structure of a Cayley algebra. 
Then 8 = where U is the center* of B. 


Proof. We consider 8 as an Y%-bimodule. It is known that every Y%- 
bimodule in which 1 is the identity operator (left and right) is a direct sum 
of bimodules isomorphic to %. This implies that B contains a set of elements 
{u,} (the images of 1 in the isomorphism with %) such that lug = ug = ual, 
= Ugt, a(bu,) = (ab)ug for all a,be and every element of B can be 
written in one and only one way in the form > dgta, dae. Let U be the 
suoset of elements of 8 which commute with every ae A. By Lemma 1, UW is 
asubalgebra. Evidently the uae UU. On the other hand, let > U, 
Then Lemma 1 shows that Ue = Ug. Hence aga = ad, 
for all a. Since % is central this means that ag = pal, pa in the base field. 


_ 


‘The center of a non-associative algebra is the set of elements which commute with 
every element and associate with every pair of elements. The center is a commutative 
and associative subalgebra. 


12 


449 
t 
a | 


450 N. JACOBSON. 


Hence 11 is the space spanned by the we. It is now clear that 8B = 1 ® U. 
Lemma 2 then shows that U is associative and commutative. It follows 
that 11 is contained in the center. On the other hand, it is clear from the 
definition that 11 contains the center. 

The assumption that the identity of 9 is an identity in 8 can be dropped. 
One then obtains the following extension of Theorem 1: 


TuEoREM 2. Let B be an alternative algebra and XM a Cayley sub- 
algebra. Then 8 =%, ® By where the B; are ideals and B, 2 MW and is a 
Kronecker product 8, = @U, U the center of 


Proof. Let 1 be the identity of % and let B = Boo + Bor + Bio + Bi 
be the Peirce decomposition relative to 1. Thus 


Bi; = {bij | 1bi; = = 8 
It is known that C Bo, C Bo. and if aa’ eM, Bo, then 
(aa’) bo: = a(a’bo:) (cf. [2], Since is not associative, this implies 


that —0. Similarly 8,,—0. Hence = B, © By where = is an 
ideal. Since 1e%,, CB, and B,— A@U, UW the center of B,. 


2. The Jordan theorem. For the proof of the factorization theorem 
for Jordan algebras, the following three lemmas will be required. 


Lemma 1. Let & be a Jordan algebra containing an identity 1 and 
elements ei, = Ujiy 1A J, 1,7 = 1,2,° such that 


(1) = Uij, wi; = 2 (ii €;;) Ui jU = Uiky 1, k 


and all other products are 0. Assume, moreover, that $3 ei =1 and nZB. 
Then & is isomorphic to the subalgebra § of hermitian matrices of a suttable 
matrix algebra Dny where D is an involutorial alternative algebra which ts 
associative if n > 3 and, in any case, has the property that its self-adjoint 
elements are in the nucleus. 


We recall that the nucleus of an algebra is the subset of elements n 
such that 


A(a, b,n) =A(a,n,b) =A(n, a,b) =0 for all a, b. 


Lemma 1 is proved in a forthcoming paper ([2], Th. 9.1) by the author. 
We remark also that in the identification of ® with § we may identify ¢ 
with (i,i)-matrix unit and with e;-+ ei, ey; the (i, 7)-matrix unit. We 
shall also require two other results on Jordan algebras ([2], Th. 7.1 and Th. 
7.3) as follow. 
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LemMA 2. Let % be a subalgebra of the algebra R(—) of the pre- 
ceding lemma containing the elements ey, uy. Then 3=RNE, where € 
is a subalgebra of D invariant under the involution in D. 


LemMA 3. Let &, and R, be two algebras of the type given in Lemma 1 
with the same n for the two. Let S be a homomorphism of &, onto &, 
mapping ei, onto the corresponding uj°. Then can be 
extended to a homomorphism of Din onto Don. 


If § is an isomorphism, it follows that its extension is an isomorphism. 
In any case the extension maps the system of matrix units of D,, onto that 


‘of Don. It follows that if § is an isomorphism then D, and D, are isomorphic. 


We can now prove 


THEOREM 3. Let & be an exceptional simple Jordan algebra of finite 
dimensions and let & be an arbitrary Jordan algebra containing X and having 
the same identity as R. Then & is a Kronecker product R = 3 ®@ € where © 
is the center of &. 


Proof. 'The argument used in the proof of Theorem 1 reduces the 
consideration to the case of an algebraically closed base field. In this case 
we may assume that the exceptional simple Jordan algebra is the algebra of 
hermitian Cayley matrices. Hence 9 contains a set of elements ej, wij, 
1,7 = 1, 2,3 satisfying the conditions of Lemma 1. Since and has 
the same identity as 3, ® can be identified with the subalgebra of hermitian 
matrices in an algebra D; where D is an involutorial alternative algebra 
whose self-adjoint elements are in the nucleus. We may suppose also that 
the elements e, are the matrix units ey and wij = ej + eji,.t547. Then by 
Iemma 2, §¥=RME,; where € is a subalgebra of D closed under the 
involution. By Lemma 3, € is a Cayley algebra. Then, by Theorem 1, 
D—(€@C where © is the center of D. Let u be a skew element of € 
(i —— uw) and let c be a skew element 40 in €. (Elements c exist since 
the nucleus of © is the set of multiples of 1.) If uw 0, cu is a self-adjoint 
element not contained in the nucleus of D. Hence u = 0 and € contains only 
self-adjoint elements. Thus © C&. It is now immediate that © is the 
center of and that R=ZOC. 

As in the alternative case we have the following extension of Theorem 3. 


THEOREM 4. Let & be an exceptional simple Jordan algebra and let R 
be an arbitrary Jordan algebra containing § as a subalgebra. Then R is a 
direct sum of two ideals R =Ry) where R, DF and R, = SOE, C the 
center of R,. 
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Proof. Let 1 be the identity of and let GD Ki be the 
Peirce decomposition of & relative to the idempotent element 1. Then Q, 
is a subalgebra containing ¥ and having the same identity. Hence 8, = 36, 
€ the center of &;. Y maps RK, into itself and the correspondence j > §; 
where j and S; denotes the multiplication xj, x is a special repre- 
sentation of § ([2], §6). Since § is simple and exceptional it follows that 
S;=0 for all 7. For =1 this gives Rj 0. HenceR Ko. Since 
RR, =0 and R,? CR, this proves the theorem. 


Concluding remarks. We have recently proved the converse of Wedder- 
burn’s theorem: If % is an associative algebra with an identity which is a 
Kronecker factor of every containing associative algebra having the same 
identity, then % is finite dimensional central simple. It can be seen from 
the representation theory of [2] that no special central simple Jordan algebra 
has the property of the theorem proved here. One might therefore conjecture 
that this property characterizes the exceptional simple Jordan algebras. In 
the alternative case one can prove, using Zorn’s methods, that if B is alterna- 
tive with an identity and 9% is an associative central simple subalgebra con- 
taining 1 and of more than 4 dimensions, then % is necessarily associative. 
In this case Wedderburn’s theorem applies. The example of the Cayley 
algebra shows that this is not the case if 2 is a quaternion algebra. Never- 
theless, it is conceivable that the structure of alternative algebras 8 with 1 
containing a quaternion subalgebra containing 1 can be fully determined. 


YALE UNIVERSITY. 
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THE JACOBIAN VARIETY OF AN ALGEBRAIC CURVE.* + 


By Wer-Liane CHow. 


1. Preliminary remarks. We begin with a few remarks concerning 
terminology and notations. We shall follow in general the terminology and 
notations of A. Weil, as developed in his books [8], [9], with the modification 
that we shall operate throughout with points and (complete) varieties in 
projective spaces only. The projective space of dimension m will be denoted 
by Gm. If (x) = (2%, 21," °°; 2m) is a point in Gm, the 2; being its homo- 
geneous coordinates, and if K is a field, we shall denote by K((z)) the field 
generated over K by the adjunction of the mutual ratios of the 2; similarly, 
if (x), (y),- *, are points in projective spaces, we denote by K((z), (y),° 
the field generated over K by the adjunction of the mutual ratios of the homo- 
geneous coordinates of every one of the points. The dimension of a point (2) 
over K is then the degree of transcendency of K((x)) over K, and (z) is 
said to be algebraic or rational over K if K((x)) is algebraic over K or 
coincides with K respectively; we shall also say that a homogeneous poly- 
nomial or form is rational over K, if the mutual ratios of its coefficients are 
allin K. We shall use the capital letters U, V, W, X, Y, Z (and these only) 
to denote indeterminates; for convenience, a system consisting of m+ 1 
indeterminates (X) = (Xo, Xi, - +,Xm) will be called a system of indeter- 
minates in Gym. 

We shall assume that the reader is familiar with the theory of associated 
forms of positive cycles in a projective space, as developed in Chow-van der 
Waerden [4]. For a positive cycle 8 of dimension r and degree d in a 
projective space Sp, the associated form is a form f(V°,- - -, V") of degree d 
in each system of indeterminates (V‘) = - V'm), =0,1,---, 73 
this form is characterized by the following property: For any set of r hyper- 


m 
planes in given by the equations } vi;X¥;=—0, i—1,---,7r, the form 
j=0 


* Received June 29, 1953. 

+ This paper was written in 1948; a brief summary of the results contained in it 
appeared under the same title in the Abstract of the addresses given at the Conference 
on Algebraic Geometry and Algebraic Number Theory, University of Chicago, 1949, pp. 
25-27. It is published here with some minor revisions, mainly in the first two sections. 
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f(V°,v',- > +,v7) does not vanish identically if and only if 8 has a finite 
number of common intersections with these r hyperplanes, and when such is 
the case, the form f(V°,v',---,v") dissolves into a product of d linear 
forms, giving all the common intersections with the proper multiplicities, 
If we regard the set of coefficients (f) of the form f(V°,- - -,V"), arranged 
in an arbitrary but fixed order, as a point in a projective space ©;, and if 
(f’) is a specialization of (f) over any field K, then it follows from the main 
theorem of associated form ([4], Satz 2) that the point (f’) corresponds 
also to the associated form f’(V°,- - -, V") of a positive cycle 3’ of dimension 
r and degree d in ©,,; this positive cycle 8’ is said to be a specialization of 
8 over K. A property or relation concerning a point (x) in ©,» and a 
point (y) in S, is said to be algebraic over a field K, if there is a set of 
forms F(X, Y) in K[ X,Y], homogeneous in each of the systems of indeter- 
minates (X) = (Xo, - -,Xm) and (Y) = (Yo, ¥:1,° Yn), such that 
the property holds for any two points (x) and (y) in ©, and ©, respectively 
if and only if the equations F(z, y) = 0 hold; we then say that the property 
in question can be expressed as an algebraic condition (over A’) in terms of 
the points (x) and (y). In case the set F;(X, Y) contains forms in (X) or 
(Y) alone, the point (x) or (y) can be restricted to the points of a certain 
bunch of varieties in S,, or S, respectively; in such a case the property in 
question is often defined only for the points of certain bunches of varieties 
in GS» and G,, so that the equations defining these bunches of varieties are 
already implicitly contained in the definition of the property. The following 
fact is important for us later: The property that a given algebraic relation 
over K holds for a point (y) in ©, and every point (x) in a positive cycle 3 
in GS» can itself be expressed as an algebraic condition over K in terms of 
(y) and the point (f) in ©;. To show this, we refer to the proof of Satz 2 
in [4] and consider the equations (2), (3), (5) listed there; in our present 
notations these can be writen as a set of equations 


where (X'),- - -, (X%) are d distinct systems of indeterminates in ©,, and 
(Z) is a system of indeterminates in ©; Let Rj(V',---,V", Y,Z) =0 
be the set of equations which we obtain by eliminating the indeterminates 
(X"),- - -,(X% from the set of equations F(X’, Y) =0,---, F(X4, Y) =0, 
G(V1, --,V",X*,---,X4Z)—=—0, and let A;j(Y,Z) be the set of all 
coefficients of the forms R,V',---,V",¥,Z), considered as forms in 
(V?),---,(V*) only; then it is easily seen from the cited proof in [4] 
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that the property in question holds for (y) and 8 if and only if A;(y, f) = 0. 
It is clear that this result remains true if more than one point (y) or more 
than one cycle 8 are involved, and it is also clear that it remains true even 
if the original algebraic property does not actually involve the point (y) at 
all, so that it is simply an algebraic property for the points of the positive 
eycle or cycles. For later reference we shall state our result as a lemma: 


LemMA 1. Let 8',- - -, 3% be positive cycles in projective spaces and 
let (f'),: be the systems of coefficients of the associated forms of 
+, B® respectively; let (y'),---,(y") be points in projective spaces. 
Then the property that a given algebraic relation over a field K holds for the 
system of points (y'),- (y") and every system of points -, (2°), 
where each (x‘) runs through all the points in 3*, can be expressed as an 
algebraic condition over K in terms of (y'),- -, (y") and (f*),- -, (f*). 


It is easily seen from the definition of the associated form that if the posi- 
tive cycle 8 is rational over a field K, then its associated form f(V°,: --, V*) 
is also rational over K, i.e. K((f)) = K. We shall now show that the converse 
of this statement is also true, provided that 83 contains no component with a 
multiplicity which is divisible by the characteristic of the universal domain ; 
more precisely, we shall now prove the following lemma: 


Lemma 2. Let 8 be a positive cycle in a projective space and let (f) 
be the set of coefficients of the associated form of 3; if 8 contains no com- 
ponent with a multiplicity which is divisible by the characteristic of the field, 
then 3 is rational over the fields ko((f)), where k ts the prime field. 


Proof. Without any loss of generality we can assume that the form 
f(V°,- - -,V*) is irreducible over the field — k.((f)), for otherwise we: 
can apply the same argument to each irreducible factor of f(V°,- - -, V"): 
over K,; then our assumption about the multiplicities of the components in. 
8 implies that the form f(V°,- --,V*) has no multiple factors over Ko. 
It follows that f(V°,- --,V*) is the product of a number s of absolutely 
irreducible forms, all distinct and conjugate to each other over K,, and each 
one of these forms is rational over a finite extension of degree s over Ko. 
In order to prove our assertion, it is therefore sufficient to restrict ourselves 
to the case where the form f(V°,- - -,V") is absolutely irreducible and the 
positive cycle 8 is a variety defined over a field K containing Ky, and to show 
in this case the variety 8 is also defined over Ky. Consider the equation 
{(V°,- - -, Vr) =0; since the form f(V°,- -, V") is absolutely irreducible, 


: 
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this equation defines a variety T of dimension (r + 1)m—1 in the (r + 1)- 
fold product space Sm X-°+-*,X Sm, defined over the field Ky. Let (2) 
be a generic point of 8 over K, and let (v°),- - -, (v") be r+ 1 independent 


m 
generic solutions of the linear equation > 2;V;—=0 over the field K((zx)); 
j=0 


if r’ (=r) is the dimension of (x) over Ko, then the point (v°,- - -,v") 
in Z has the dimension 


(r+1)(m—1) +7 =(r+1)m—14 2(r+1)m—1 


over Ko, from which it follows that r= 7’ and that (v°,- - -,v") is a generic 
point of T over Ky. Since the point (v°,- --,v") has the same dimension 
(r + 1)m —1 over both K and Ko, the fields K,((v°),- - -,(v")) and K must 
be independent with respect to each other over Ko, and since K((v°),- --, (v’)) 
is a regular extension of Ko, this means that K,((v°),- --,(v")) and K are 
linearly disjoint with respect to each other over Ko. Without any loss of 
generality, we can assume that 2 ~ 0 and that z, —0 in case any one of 
the coordinates 2; vanishes ; then we can take all v‘; (7 40) to be independent 


variables over K((x)) and we have v‘,, ~ 0 for alli —0,1,---,7. We have 
then the relations (vt (2j/2o) = — 1 = 0, 1,° 17; the first 
j=1 


r equations show ([8], Chapter V, Theorem 1) that (x) is separably algebraic 
over K,((v°),- - -, (v"-*)), and then the last equation shows that (2) is rational 
over K,((v°),- - -,(v")). Since K,((x)) is subfield in K,((v°),- - -,(v7)), it 
is also linearly disjoint with respect to K over Ky; this shows that the 
variety § is also the locus of (x) over Ky and hence is defined over Ko. 


2. The problem. Let © be a curve in the projective space G» of 
dimension m, defined over a field k, and let h and g be the degree and genus 
of © respectively. A positive divisor p of degree n in © can be considered 
as a positive 0-cycle in S,, and as such it can be represented by its associated 
form p(V). Let 2.(V),9:(V),- - -,Q:(V) be the set of all power products 
of degree n in (V), arranged in an order which is arbitrary but fixed 


t 
once for all; if we set p(V) => p,Q;(V), then the set of coefficients 
j=0 


(p) = (Po, Pi," * *, pt), arranged in this fixed order, determines a point in 
the projective space ©; of dimension ¢, so that every positive divisor of degree 
nm in © is represented in a one-to-one manner by a point in ©;. We shall 
assume that the curve © is free of singularities, so that every 0-cycle in Gm 


| 
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which is contained in © can also be considered as a divisor in ©; then the 
set of all points in S; which represent positive divisors of degree n in © is a 
variety ©" of dimension n, defined over k. In fact, let y be the positive divisor 
of degree n consisting of n independent generic points (2'),- - -, (a") of © 
over k, and let (y) be the point in SG; representing y; then, for any positive 
divisor p of degree n in ©, the point (p) is a specialization of the point (y) 
over k, and conversely, by the main theorem of associated form, every such 
specialization (p) determines a positive 0-cycle of degree n in Gm which is 
contained in © and hence is a positive divisor of degree n in ©. Since k((y)) 


is contained in the regular extension k((a'),- --,(a")) of k, it is also a 
regular extension of /, and since k((x'),- - -, (@")) has the degree of trans- 


cendency n over k and is an algebraic extension of k((y)), the field k((y)) 
must have also the degree of transcendency n over k. It follows that the 
point (y) has a locus over k which is a variety of dimension n and which 
evidently coincides with ©". As we shall see later, this variety ©” is also free 
of singularities, but we shall not need this fact for the present. It is clear 
that if a positive divisor p is rational over a field K, then the point (p) 
representing it is also rational over K; the converse of this statement is also 
true, as we have shown in [3], so that for any point (p) in ©" the positive 
divisor p determined by it is rational over k((p)). 


Two divisors p, q in © are said to be linearly equivalent and we shall 
write p~q, if p —q is the divisor of a rational function on ©, or in other 
words, if p— q is a principal divisor in ©. It is well known that the set of 
all principal divisors in € forms a subgroup 9;(€) in the group 9(C) of all 
divisors in ©, and that the quotient group of &(€) over 9,(C) is the group 
® of divisor classes in ©, each class consisting of all divisors which are 
linearly equivalent to each other. In this group ® the classes of degree zero, 
sometimes called the zero classes, again form a subgroup Do, while the classes 
of any one fixed degree n form a coset D, of Dp in D; if we denote by Ha(C) 
the group of all divisors of degree zero in ©, then Dy is the quotient group 
of G,(C) over 8,(C). If we let any one fixed divisor class © of Dy 
correspond to the principal class of Do, then the relation D,—0D + Dy 
establishes a one-to-one correspondence between the classes of D, and those 
of Do. Through this correspondence the group addition of Dy will induce 
in D, the following operation: Given any two classes 8,0 in Dp, the class 
¥ + OQ —D, also belonging to Dy, is called the sum of % and Q. We shall 
call this operation in D, also addition (with respect to the class ). 


A divisor class $$ is said to be rational over a field K, if it contains a 


= 


458 WEI-LIANG CHOW. 


divisor » which is rational over K. It is well known that the set of positive 
divisors in the class % constitutes a linear system ; that is, there exists a linear 
system of hypersurfaces in ©, which cut out on ©, besides a fixed component, 
exactly the positive divisors of the class %. If the class $ is rational over K, 
then ([8], Chapter VIII, Theorem 10) this linear system of hypersurfaces 
has a basis consisting of hypersurfaces which are defined over K. We recall 
that according to the Theorem of Riemann-Roch, the linear system of positive 
divisors of a class of degree n > 2g — 2 has exactly the dimension n — g. 


A variety 3, defined over a field K containing &, is called a Jacobian 
variety of the curve ©, if it has the following properties: (1) ¥ is an Abelian 
variety defined over K; (2) there is a homomorphic mapping ® of 9,(C€) 
onto %, called the canonical homomorphism, whose kernel is precisely $,(C€) ; 
(3) the homomorphism © is rational and is defined over K in the following 
sense: if p, is a rational divisor over K(w), where (w) is any set of elements 
(in the universal domain), then the point @(p,) is also rational over K (1), 
and if p, is any specialization of p, over a specialization (w) — (pz) over K, 
then ®(p,) is also a specialization of ®(p,) over the specialization (w) > (p) 
over K; (4) the rational homomorphism @ has the “ universal maping” 
property: if & is any rational homomorphism of 9 (€) into any Abelian 
variety M, then W is the product of © and a rational homomorphism of 3 
into %. It can be easily seen from the “ universal mapping” property (4) 
that the Jacobian variety 3, it it exists, is uniquely determined up to a 
birational isomorphism, so that we can sometimes speak of the Jacobian 
variety. However, the question whether such a Jacobian variety exists at all 
is not a simple one, and this existence problem of the Jacobian variety is one 
of principal importance not only for the theory of algebraic curves, but also 
for the theory of Abelian varieties. In this paper we shall offer a general 
solution of this problem. 


In the classical case, where the ground field is the field of complex 
numbers, a proof of the existence of the Jacobian variety can be obtained 
from the known results in the literature in the following way. (The 
terminology and notations in this one paragraph do not follow strictly the 
conventions set forth in section 1.) Let J,,- - -,Z, be a system of g inde- 
pendent Abelian differentials of the first kind on ©, and let y,,- - -,yog be 


a base of the first Betti group of © and we set r= I;; let €, be the 


complex linear space of dimension g, considered as an analytic (= complex- 
analytic) group variety, and let [P] be the discrete subgroup generated by 
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the 2g rows of the matrix P = (pj), each row being considered as a point 
in Ey, and let €,/[P] be the quotient group variety of ©, over [P]. Let 
(%o) be any fixed point in © and let ¢ be the analytic mapping of © into 


€,/[P] defined by the formula = (mod. [P]) ; 


then, if p= ni(x;) is any divisor in the correspondence p> > 
defines a homomorphism ® of §,(€) onto &,/[P], whose kernel is, by the Abel’s 
Theorem, precisely the group 9(€). If © is any homomorphism of §,(@) 
into an analytic commutative group variety then = ¥( (x) — (a) ) 
is a mapping of © into % and we have ¥(p) = S niw(a) ; we shall say that 
the homomorphism W is analytic if the mapping y is analytic, and we observe 
that, in case Yf is algebraic, the homomorphism © is rational if and only if 
y is rational (over the complex ground field). It is clear that the homo- 
morphism ©@ defined above is analytic; we shall show that if © is analytic 
and if %f is analytically isomorphic to a complex torus, then © is the product 
of @ and an analytic homomorphism of €,/[P] into Y. We set X—E&,/[Q], 
where = (qi) is a (27, 7)-matrix such that the matrix (Q,@) is non- 
singular, and let z,,---,2z, be the coordinates of a variable point in &,, 
so that dz,,- - -,dz, can be considered as Abelian differentials of the first 
kind on &,/[Q]; it is well known that a base 8,,- - -,82, for the first Betti 


group of €,/[Q] can be so chosen that we have qi; = f, dz; Let dyi,---, dy, 
% 


be the Abelian differentials of the first kind on © induced by dz,- - -, dz, 
respectively through the mapping y, and let be the image 
cycles of -, y29 respectively under y; then there exist a (g,1)-matrix 
M = (mj) and an integral (2g, 2r)-matrix N = (nj) such that we have the 


g 2r 
relations dy = > mylj, W(yi) = nisd;, and 
j=l j=1 


g g ad or 
PM = = > mus = dy; = dz; = > nix dz; 
k=1 k=1 k=1 ox 


2r 
= ps NikQkj = NQ. 
k=1 


This shows that the linear mapping of &, into &, determined by the matrix M 
induces an analytic homomorphism © of €,/[P] into &,/[Q], and we have 


the relation 


¥ 
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or y(z) = @¢(zx), from which it follows that Y¥— ©. Now, according to a 
result of Lefschetz ([5], p. 368), there exists an analytic isomorphism o of 
€,/[P] onto an Abelian variety 3; if we assume that there is an analytic 
isomorphism + of €,/[Q] onto an Abelian variety %, and if we now write 
V, in place of o¢, o&, Ty, r¥, respectively, then we have again 
the relations @©¢ and Since ¢, and © are now analytic 
mappings of an algebraic variety into another algebraic variety, it follows 
from a theorem of ours proved elsewhere ([2], Theorem VII) that they are 
all rational transformations, and hence the homomorphisms ® and © are also 
rational. This shows that Y is the Jacobian variety of © with ® as the 
canonical homomorphism. 

In the general case of an arbitrary ground field, it is clear that the 
above highly transcendental proof cannot be carried over and that new 
methods of a purely algebraic nature have to be introduced. A. Weil, who first 
proposed and studied this problem, obtained a solution by a generalization 
of the concept of an algebraic variety ([9], § V); in fact, the attempt to 
solve this problem is one of the main reasons which led Weil to introduce 
his notation of an “abstract” variety, an algebraic variety in abstracto in 
contrast to the usual ones which are embedded in a projective space. As 
Weil has been able to extend most of the fundamental results in algebraic 
geometry to the “ abstract ” varieties, his “ abstract ” Jacobian variety proves 
to be almost just as useful as the projective Jacobian variety in the classical 
case and thus constitutes a satisfactory solution of the problem. Nevertheless, 
it remains an important and interesting question whether a Jacobian variety 
exists as a projective algebraic variety in the usual sense; furthermore, what 
is probably even more important from the algebraic viewpoint, there remains 
still unsolved the question of the field of definition for the Jacobian variety 
and the canonical homomorphism, as has been pointed out by Weil himself 
({9], p. 68). Our solution of the problem will provide satisfactory answers 
to both these questions; we shall show that the Jacobian variety exists as a 
variety in a projective space, and that both the Jacobian variety and the 
canonical homomorphism are defined over the defining field of the curve G. 
This latter fact, which seems to be new even in the classical case, has important 
applications in the theory of Picard varieties over arbitrary ground fields, as 
will be shown in a forthcoming paper of ours. 

We shall describe briefly the main idea underlying our construction of 
the Jacobian variety. Since the positive divisors in © can be represented 
by the points of an algebraic variety, it is natural to try to represent the 
divisor classes by associating with them certain positive divisors. According 
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to the Riemann-Roch Theorem, each class of D, contains in general only one 
positive divisor; thus each point of @% will represent a class in D,, and the 
representation is one-to-one except for the points of a proper subvariety in ©’. 
It is therefore natural that this variety © has been usually taken as the 
starting point in the construction of the Jacobian variety; thus Weil’s 
“abstract ” Jacobian variety is essentially a collection of birational trans- 
forms of © patched together in a suitable manner. Also van der Waerden, 
in a paper [7%] which deals in reality with certain aspects of the Jacobian 
variety, has taken this variety © as the starting point and has constructed 
by means of the associated forms a variety in a projective space, the points 
of which are in one-to-one correspondence with the classes in D,; however, 
his results do not show that the so constructed variety is the Jacobian variety, 
as, for one thing, there is no proof that the variety is non-singular. We 
observe that in both cases the main difficulty lies in the existence of special 
divisor classes of degree g which are represented not by points but by sub- 
varieties in ©; one has then to cope with the fact that the divisor classes are 
represented by different types of geometrical entities. If now, instead of ©9, 
we consider the variety ©” for sufficiently high n (say n > 2g —2), then 
there will be no special divisor classes and every divisor class of degree n 
will be represented by a subvariety of dimension n—g in ©"; the method 
of associated forms then will enable us to represent the divisor classes by 
ponts of a certain variety, and from the homogeneous nature of the con- 
struction one would expect the so constructed variety to be non-singular. This 
is the underlying idea of the method by which we shall construct the Jacobian 
variety in the next section ; however, the proof that the so constructed variety 
is non-singular is not so simple and will be given later in section 5 as an 
application of a general theorem proved by us elsewhere. In the final 
section we then introduce the canonical homomorphism and complete the 
proof of the existence of the Jacobian variety. 


3. Construction of the variety 8. We start with the variety ©" of n 
dimensions, defined over the field k, for a fixed n > 2g—2. For any point 
(p) of ©", such that the corresponding positive divisor p is rational over a 
field K containing &, the Theorem of Riemann-Roch, applied to the curve € 
over the field K, shows that the complete linear system of positive divisors 
on © determined by the divisor p has the dimension n —g and is rational 
over the field K. In other words, the set of all positive divisors in the class 
8 determined by p consists exactly of those divisors cut out on the cuve 6, 
apart from a fixed component, by a linear system of n—g- 1 hyper- 
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surfaces Su jj(X) =0, where the y;(X) are forms of the same degree in 
j=0 


K[X] and the (U) = (Uo,: + +,Un-) is a system of indeterminates; this 
implies also that no hypersurface of this system contains the curve G. 
Let (wu) = (to, * *,Un-g) be a set of independent variables over K, and 
let p, be the divisor cut out on ©, apart from the fixed component, by the 


n-9 
generic hypersurface > ujj;(X) —0 of the system; then the divisor p, is 
j=0 


rational over K(u) and hence is represented by a rational point (py) over 
K(u) in ©. It is well known that over any specialization (wu) — (m) over K, 
the divisor ), specializes into the divisor p, cut out on ©, apart from the 


n-9 
fixed component, by the hypersurface } pjyj(X) =—0. It follows that over 
j=0 


any specialization (wv) — (») over K, there is a uniquely determined speciali- 
zation (py) of (py), which represents a divisor in $$, and that in this way 
all positive divisors in % can be obtained. Since K((py)), being a subfield 
of K(u), is a regular extension of K, this shows that if we denote by Gg the 
set of all points in ©" which represent positive divisors in $8, then Gg is a 
variety of dimension n — g, defined over K, and the point (p,) is a generic 
point of Gg over K. Furthermore, the correspondence u— (p,) defines a 
birational transformation of the projective space Gp, onto Gg, defined over 
K; in fact, the divisor p, consists of n distinct points (é'),- - -, (€"), which 
form a complete set of separable conjugates over K((p,)), andthe point (w) is 


-9 
determined rationally by the linear conditions upp; = 0, 1,---, 0. 
j=0 
We shall denote by Ga(W) = - -, W"-2) the associated form of the 
variety Gg, where for each 1—0,1,---,n—g, the (Wt) = (Wh, W4, 


- ++, Wi.) is a system of indeterminates in ©;; since the variety Gg is 
defined over K, the form Gy(W) is rational over K. 

Let (y) be a generic point of ©" over k&, and let y be the corresponding 
positive divisor on ©. Since y is rational over &((y)) and consequently the 
class 9) determined by y is also rational over k((y)), it follows that the 
associated form Gy(W) of the variety Gg is rational over k((y)). Let d be 
the degree of the variety Gy (as a subvariety in G;), and let A,(W), 
Ai(W),- --,A4:(W) be the set of all power products of degree d in (W), 
arranged in an order which is arbitrary but fixed once for all; if we set 


Gy(W) => GjA;(W), then the set of coefficients (Gy) = (Go, +, Gi), 
j=0 


arranged in this fixed order, determines a point in the projective space SG, 
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of dimension /, rational over K((y)). Since the variety ©” is defined over k, 
it follows that the point (Gy) defines a variety in ©, over k, which we shall 
denote by %; and the correspondence (y) > (Gg) determines a rational 
transformation ¢ of ©" onto %, defined over &. We maintain that the rational 
transformation has a unique value ¢(p) at every point (p) in ©, and that 
the positive cycle Gg of dimension n—g and degree d in ©; determined 
by the point ¢(p) consists of exactly the one variety Gg, possibly with a 
certain multiplicity. To show this, we observe ([7], § 14) that the property 
that any two points (p) and (q) in © represent linearly equivalent divisors 
is an algebraic condition over k; it follows then from this and Lemma 1 
(section 1) that the property that a positive cycle © of dimension n—g 
and degree d in G; is contained in the variety Gs can be expressed as an 
algebraic conidtion over & in terms of the point (p) in ©; and the point (G@) 
in S, representing &. In other words, there exists a set of forms A;(Y, Z) 
in k[Y,Z], where the (Y) and (Z) are systems of indeterminates in ©; 
and ©, respectively, such that the cycle G is contained in Gg if and only 
if the equations A;(p,@)—0O hold. It is evident that the equations 
A;(y, Gy) = A;j(y, 6(y)) =9 hold; if is any specialization of $(y) 
over the specialization (y) —>(p) over k, then we have the relations 
A;(p,¢(p)) =0. This shows that the positive cycle @’g determined by 
¢(p) can only have component varieties which are contained in ®g; and 
since every component in @’x has the same dimension n —g as that of Gm, 
it must coincide with Gs, so that the positive cycle @’g is a multiple of the 
variety ®g (we shall show in section 5 that Gs actually coincides with Gg). 
This evidently also shows that the specialization ¢(p) is uniquely determined. 

Thus we have shown that the points of the variety @ represent in a 
one-to-one manner the divisor classes of ©, and that for any two divisors p 
and q in ©, we have ¢(p) = ¢(q) if and only if p~q. The next step is 
to define a composition function in 8 which corresponds to the group addition 
in Do, or rather the addition in D, with respect to some one fixed divisor 
class ©. For this assume the integer nm so chosen that there exists a divisor 
class D in D, which is rational over k; it is easily seen that such an integer n 
exists. The corresponding point (@°) on % is then also rational over k. 
Let (G*) and (G?) be any two points in %, and let $8 and 9? be the divisor 
classes determined by (G*) and (@?) respectively; if (@*) is the point in B 
which represents the divisor class $* — 81 + %*— , then we define (@*) 
as the sum of (G@*) and (@?), and write (G*) = (G*) + (G*). We shall 
show that if (G*) and (G@*) are two independent generic points of B over k, 
then the point (G*) is rational over the field K = k((G*), (G?)). We observe 
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first that since both (G) and (G*) are generic points of B over k, the positive 
cycles @! and G* determined by them are subvarieties in ©", which by Lemma 
2 (section 1), are defined over the fields k((G*)) and &((G*)) respectively, 
so that both are defined over K. Let (p') and (p*) be independent generic 
points of the varieties G' and G* respectively over K ; since the divisor class 
is rational over K((p")) and the divisor class is rational over K ((p*)), 
the class + is rational over the field K((p*), (p?)) and 
consequently we have k((G@*)) C K((p"), (p?)). Now, let (g') and (q’) 
be another pair of such independent generic points of the varieties G' and G’ 
respectively, both independent with respect to the field K((p*), (p’)) over K; 
then we must also have k((G*)) C K((q'), (q?)). Since both K((p'), 
and K((q‘), (q?)) are regular extensions of K and since they are independent 
with respect to each other over K, it follows that K((p*), (p?)) N K((q), (q?)) 
=K. Therefore we conclude that k((G*)) CK =k((G'), (G@*)). In 
order to prove that the addition on & is a rational function, we have only 
to show that (G*) + (G*) is the only specialization of (G*) + (G*) over any 
specialization (G*) (G*), (G*) over k. Here again our assertion 
will follow immediately from the fact that the relation (@°) = (G@*) + (G’) is 
an algebraic condition over & between the three points. To show this, let $7, 
be the divisor classes determined by the points (G*), (G*), (G@*°) respec- 
tively, and let p*, p®, be any positive divisors in the classes 
respectively ; let o be a rational positive divisor over k in the divisor class ©. 
We recall again ([7], § 14) that linear equivalence between positive divisors 
is an algebraic condition over &; since the positive divisor o is rational over 
k and hence is represented by a rational point over k in ©”, the relation 
p* + p? ~ p°+ o can be expressed as an algebraic condition over & in terms 
of the points (p*), (p”), (p°). It follows then by Lemma 1 (the case where 
the points are not involved at all) that the relation (G°) = (G*) + (@°) is 
an algebraic condition over k. 

Thus we have shown that the addition in D, (or the addition in Do) 
induces on the variety ¥ an addition of points, which is a rational function, 
defined over the ground field *, on the product variety BX B with values 
in %. Moreover, this function is not only defined for a generic point of 
% X %, but also has a uniquely determined value for any point on BX &. 
From this it follows that the points 8 form an abelian group under this 
addition operation, with the point (G@°) as the unit element. The inverse 
of any point (G) corresponding to the class % is the point (G)-1 corresponding 
to the class 20 — $8; and in case (G) is a generic point of V over k, the 
point (G)-* is rational over &((G@)). From this it follows, by a similar 
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argument as before, that the inverse of a point is a rational function on &, 
defined over k. We shall write — for (G*) + (@?)-*. Finally, 
it is also easily seen that, for any fixed point (G*) on %, the mapping 
(G) > (G*) + (G@) defines a birational transformation of ¥ onto itself, which 
is one-to-one for every point of 8. In order to prove that & is an Abelian 
variety, it remains to shows that the function (G*) + (G?) is defined at every 
point of the product variety BX B; since a rational function is defined at 
every simple point for which it has a uniquely determined value, it is suffi- 
cient to show that the variety ¥ is non-singular. This will be done in section 
5, where this result is obtained as an application of a general theorem proved 
by us elsewhere; in the final section we shall then introduce the canonical 
homomorphism and show that it has the desired properties, thus completing 
the proof that & is a Jacobian variety of the curve ©. Before we proceed 
with this, we shall insert a digression in the next section, where we shall 
show that a derived (absolutely) normal model ¥% of the variety 2 is an 
Abelian variety and that the birational correspondence between B sad BW 
is one-to-one without exception; this result, together with the results of the 
final section on the canonical homomorphism, gives us an alternate proof of 
the existence of a Jacobian variety, though not necessarily defined over the 
original ground field &, except in case & is perfect. Although this alternate 
proof is now superseded by the more complete result in section 5, it still has 
the advantage of simplicity ; furthermore, the method of proof used here may 
have possible applications to other similar problems.* 


4, The derived normal model W. Consider a derived (absolutely) 
normal variety ¥% of ¥, which is defined over a purely inseparable extension 
K of the field &. The birational correspondence between and 
¥, also defined over K, has the property that to each point (H) of % corre- 
sponds exactly one point '{H) of %, while conversely to each point (G) of B 
correspond at most a finite number of points I-?(@) of @. Let (H') and 
be two independent generic points on over K, and let (G') = 


* [Added in proof, March 10, 1954] T. Matsusaka, in a recent paper “On the 
algebraic construction of Picard variety (II),” Japanese Journal of Mathematics,. vol. 
22, pp. 51-62, has successfully applied this method to prove the existence of the Picard 
variety over an arbitrary field; the application of this method was made possible by a 
crucial result, proved by Matsusaka in another paper, concerning the existence of a 
“regular maximal algebraic family” of divisors, Matsusaka has also shown that it is 
sufficient to take a derived normal model with reference to k, which can then be shown 
to be also absolutely normal; this removes the restriction on the field of definition 
stated above. 
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and (G?) -=1(H*) be the corresponding points on 8 under the correspon- 
dence T'; then (G*) and (@*) are also generic and independent with respect 
to each other over K. The sum (G@*) = (G*) + (G*) of the two independent 
generic points (G*) and (G?) on &, being a rational function on BX % 
defined over K, is carried by the birational coorespondence I into the rational 
function (H*) + T(A*)) on B X B®, also defined 
over K, which we shall define as the sum (H‘) + (H7?) of the two independent 
generic points (H*) and (H?) on &. Now, what is the behavior of this 
function (H*) + (H?) for a pair of special points (H*) and (H*) on ®? 
By definition, any value of the function (H*) + (H?) is a specialization (H°) 
of the point (H*) over the specialization (H*) — (H*), (H’) — (H°) over K. 
It is well known that any such specialization (H', H*, H*) — (H*, H°, H°) 
can always be extended to a specialization of (G*, G*, G*), while on the other 
hand it is clear from the above that any such extension of specialization is 
already uniquely determined by the specialization (H', H*?) — (H?, H®) alone. 
In fact, since the transformation [T(H) has always uniquely determined 
specialization at every point of we have (G') > I(H*), ~T(H°); 
from this it follows that since the sum (G*) + (G*) has always a uniquely 
determined specialization for every pair of points on %, we must have 
(G*) + 1T(H"). Since, in particular, the relation '(H*) = (G*), 
which defines the correspondence T, must be preserved under the extended 
specialization, we have the relation =T(H*) It follows 
then that any specialization (H°) of (H*), over the’ specialization 
(H', H*) — (H*,H”) over K, must be one of the finite number of points 
of %& which are carried over by T into the point T(H*) +1(H*) on &. 
In similar manner, we can consider a specialization (H°) of the point 
(H?) = — over any given specialization (H', H*)—(H?, H°), 
where the (H') and (H*) are now a pair of independent generic points of B. 
Since the specialization can also be extended in a unique way to the specializa- 
tion G*, G?) — (1 (A*), —T(H*)), we conclude that the 
relation —1T(H*) —I(H*) must hold for any such specialization, 
and hence (H”) must be one of the finite number of points of %& which are 
carried over by into the point —T(H?) of &. 

Thus we have shown that for any specialization H*)—>(H*, H°) 
the relation —1T(H*) + always holds, and that in this speciali- 
zation any two of the three points can be chosen arbitrarily on %%, while the 
third one is then one of the finite number of possible points determined by 
this relation. Let now (H*?) be any given point of Q, and let (H) be a 
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generic point of Y over K((H*)). Since the point T(H*) +T(H) is a 
generic point of over K((H*)), the relation T(H’) =T(H*) +T(H) 
determines uniquely a specialization (H*) — (H’) over the specialization 
(H', H?) — (H+, H) over K, and this point (H’) is also a generic point of 
% over K((H*)). Conversely, given a generic point (H’) of B over 
K((H*)), the same relation determines uniquely a specialization (H?) — (H) 
over the specialization (H', H*) — (H*,H’) over K, and the point (H) is 
also generic over K((H*)). In other words, for any given point (H*) of 
MW, the relation (H*) + (H) = (H’) determines a birational correspon- 
dence @,(H) = (H’) of the variety % onto itself; in fact we have 
(H’) (0(H*) + for any generic (H) over K((H*)) and 
—T(H*)) for generic (H’) over K((H*)). This bi- 
rational correspondence ®, between the normal variety ¥% and itself has the 
property that to each specialization of (H) correspond at most a finite 
number of specializations of (H’), and vice versa. For, any specialization 
(H, H’) — (H°, H°) of this correspondence @, over K is contained in a 
specialization (H', H*, H*) — (H*, H®, H°) over K, and we have seen above 
that there is only a finite number of the latter when two of the three points 
(H*, H®, H°) are given. This means that the birational correspondence @, 
has no fundamental points on YW, neither for the direct transformation ®, 
nor for the inverse transformation @,"'. Since for a birational transformation, 
or more generally for a rational transformation of a normal variety, a point 
is either fundamental or regular ({10], Th. 9 and Th. 10, note the difference 
in terminology), we conclude that the birational correspondonce ®, must be 
biregular everywhere on YW. 

From the biregularity of the correspondence @, for every point (H*) 
we can draw some important conclusions. Let (H°) be a simple point on 
W with the property that it is the only point on %& which is carried by T 
into the point T(H°) on &; we can take, for example, a generic point 
of W over K. Given any point (H”), let (H*) be any point such that 
I'(H°) =1(H*) + The birational correspondence @,, determined 
by the so chosen point (H*), will then carry the point (H*) into a point 
with the image T'(H*) + on under the correspondence I, which 
can only be the point (H°). Since the correspondence ®, is biregular and 
the point (H°) is simple, it follows that the point (H®) is also simple. 
As (H°) is any point on %&, this shows that the variety %& is free of singu- 
larities. Furthermore, let (H®) be another point such that 
then the same birational correspondence ®, must also carry (H®), into (H°), 


‘ 


468 WEI-LIANG CHOW. 


from which it follows, on account of the biregularity of @,, that (H®) = (H°). 
This shows that the correspondence T between YW and & is one-to-one for all 
points without exception. 

Thus we have shown the existence of an Abelian variety % which is in 
one-to-one birational correspondence without exception with the variety %; 
as we have mentioned in our paper [1], this is our original proof of the 
existence of a Jacobian variety for the curve ©. The underlying idea of the 
proof is the simple observation that if a variety is homogeneous in the sense 
that there is a transitive group of rational transformations which are every- 
where determined, and if the variety is normal, then it is non-singular. That 
a variety which is homogeneous in this sense, but not normal, need not be 
non-singular, is shown by the simple example of a plane cubic curve with a 
cusp, which is in one-to-one birational correspondence without exception with 
the projective line. The question naturally arises as to whether the variety ¥ 
is itself also non-singular and is already a Jacobian variety. This question, 
besides being of some interest in itself, is significant not only on account 
of the fact that & is defined over k, but also on account of the fact, proved 
in the next section, that for any point (G) in &% the corresponding variety 
in is defined over the field &((G)). 


5. Proof that the variety 8 is non-singular. Consider the rational 
transformation ¢ of ©" onto &%, defined over &; if (z) is a géneric point of B 
over k, then the inverse image ¢~*(z) is a subvariety G(z) of dimension n —g 
in ©", defined over k((z)), and for any specialization (z) — (£) over k, the 
variety @(z), considered as a positive cycle of dimension n—g, has a 
uniquely determined specialization @(f), which is a multiple of the variety 
¢*(¢). In the terminology introduced in our paper [1], the rational trans- 
formation ¢ induces an involutional system of positive cycles | G(z)| on ©*, 
and the variety @ is the associated variety of this involutional system; 
according to the main theorem in that paper ([1], p. 258), the variety B 
is simple at every point (£) for which the positive cycle G(£) coincides with 
the variety ¢-"(¢) and contains a simple point in ©". In order to prove that 
the variety @ is non-singular, it is therefore sufficient to show that (1) the 
variety ¢*(£) has the same degree (as a subvariety in ©;) for every point 
(¢) in &, i.e. the degree of the variety Gg is the same for every divisor class 
% of ©, and (2) the variety © is non-singular. 

We shall show that the degree of the variety Gg, for any divisor class 
% of degree n, is equal to h"#. By definition, the degree of Gg is the 
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number of intersections of Gg with n— g independent generic hyperplanes 

in over K, where K is a field over 

ae the divisor class 8 is rational. Consider the n — g independent generic 

hyperplanes — 0 in G,» over K, and for each 1 


i=1,---,n—g, let (é*"),---, (€+") be the A intersections of the i-th 
hyperplane with the curve ©; let p, where A = (m,,° runs through 
all the h"9 possible ordered sets of n—g positive integers m,<h, be the 
positive divisor of the class $$ which contains the n—g points (é#™*), 
i=1,-:-,n—g. It is then easily seen that the points (p*) in Gg 
corresponding to the h"-9 positive divisors p* constitute all the intersections 


t 
of Gg with the n — g special hyperplanes > 0;(v‘)¥;—=0 (¢=1,---,n—g) 
j=0 


in ©; In fact, let (q) ba the point in ©” corresponding to a positive 
divisor q of degree n consisting of the n points (y'),: - +, (y”), then we 


t m 
have > 0;(v')g; =I] ( t=1,---,n—g. Hence, all the 
j=0 k= j=0 


points (p*) of Gg lie on the n —g special hyperplanes; and conversely, if a 
point of Gg lies on these n —g special hyperplanes, then the corresponding 
positive divisors must contain at least one point from each of the n—g sets 
(€7),- -, (€6"), and hence must be one of the h*9 
positive divisors p’. Therefore, in order to prove our assertion, we need 
only show that each point (p*) is a simple intersection of Gg with these 
m—g special hyperplanes. Since the point (p*) is a generic point of Gg 
over K (it has the dimension n—g over K), it is a simple point of Gg; 
hence, we need only show that the linear variety of dimension t—n-+ g in 
©; defined by the »—g special hyperplanes is transversal to the tangent 
linear variety of Gg at the point (p*). As we shall from now on only deal 
with the one divisor p* or point (p*), for an arbitrary but fixed A, we can 
drop the superscript A and simply write p or (p). 

Let (z*),---, (z") be the n points contained in the divisor p, where we can 
assume that the first —g points are the points i= 1,---,n—g. 
Since p is a generic positive divisor of the class $8 over K, all the n points 
(z'),- - -,(a™) are generic points of the curve © over K and distinct from 
each other ; moreover, any n —g of these points are independent with respect 
to each other over K and determine the remaining g points uniquely. Hence, 
there is an automorphism of the field K((z'),- - -,(2")) over the field 
K((p)) which carries any set of n—g of the n points (2),- - -, (z") into 
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any other such set. If > vjA;==0 is a hyperplane in ‘ containing one of 
the n points (2°*),- ), then the hyperplane ;=0 in G, 


evidently contains the point (p); we shall call wii a hyperplane in ©; 
a derived hyperplane of the hyperplane in G». The n—g special hyper- 


planes j=0 considered above are all derived 
since each hyperplane in Gm contains the point 
(z*), t= 1,---,n—g. each 1,: - -,m—g, the hyper- 
plane vi,X,—0 is a generic hyperplane over K((z"),-- -,(2")) of the 
ge all hyperplanes in ©, which contain the point (z‘); and the 
n—g hyperplanes considered as generic 


hyperplanes of the n — g corresponding systems over K((z"),- - -, (2")), are 
independent with respect to each other over K((2"*),: - -, (2")). 

Consider now the set of all derived hyperplanes in ©;; it can be easily 
seen that the point (p) is the only point in SG; common to all these hyper- 
planes. For, each point in ©; corresponds to a form of degree n in (V), and 


n m 
the form > p,0;(V) =I] (= 2*;V;) is obviously the only form of degree 
j=0 4=1 j=0 


nm in (V) which vanishe for all the solutions of any one of the n linear 


equations ¥ t+=1,---,n. Thus there exist ¢ linearly indepen- 
dent auvived hyperplanes in ©;, and consequently there is a set of n—g 
hyperplanes (*—1,---,n—g) in Gm, each containing a 
point (2) of the divisor p, such that the n—g derived hyperplanes 
(4) Y;=0 («=1,---,n—g) define a linear variety of dimension 


t—n-+g transversal to the tangent linear variety of Gg at (p). The 
nm—g points (r+), 1=1,---,n—g, are distinct; for, otherwise there 
would be infinitely many positive divisors in the class $$ which contain all 
the (less than n—g distinct) points (7), 1=1,---,n—g, and the 
corresponding points in Gg would then be contained in the n —g derived 


t 
hyperplanes 0;(v‘)Y¥;=0 and consequently these 
j=0 


derived hyperplanes could not define a linear variety transversal to the tan- 
gent linear variety of Gg at (p). Thus, for each i—1,- - -,n—g, the set 
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is a solution of the linear equation > Let 
j= 


n—g, be respectively generic solutions of the linear equations = r;V;= 0, 


i=1,---,n—g, over K((z),- - -,(a")), independent with to each 
t 

other over K((z),- - -,(x")); then the n —g derived hyperplanes } 0,(0')Y; 


= 0 (t—1,---,n—g) define a linear variety which is also transversal to the 
en linear variety of Gg at (p). For, the set of n—g hyperplanes 


20, (v4)¥;=0 is a specialization over 
(z)) of the set of —g hyperplanes (t=—1,---,n—g), 
and the former set defines a "arly ail transversal to the tangent 
linear variety of Gg at (p). Now, there is an automorphism of the 
field K((z),:--+,(z")) over K((p)), which carries the n—g points 
(z»),- -, (29) into the n—g points (z'),- --, respectively, and 
this automorphism can be extended into an isomorphism between the fields 


by the correspondence (*") In this isomorphism 


the derived hyperplanes 0;( Y;=0 - -,n—g) are carried over 
into the derived Q;(vt)Y¥;=0 since 


the former set defines a linear ‘ens of Gg at (p), the same must be true 
of the latter. This completes the proof of the assertion that the degree of 
Gy is 

Next, we shall show that the variety @" is non-singular. We observe 
first that in a similar manner as above, by taking (p) to be a generic point 
of ©" over & and considering n instead of nm —g derived hyperplanes in ©, 
we can show that the degree of the variety ©" is equal to h™. In fact, the 
proof in this case is somewhat simpler than the above, for there it is not 
necessary to consider any automorphisms of the field &((2"),- -, (2")). 
Now, let (q) be any point in © and let the corresponding positive divisor q 
be composed of the n (not necessarily distinct) points (7'),- - -, (y") in &. 


Consider the linear equations (t= 1,---,); let. (p*) 
j=0 


(t=1,---,n) be m generic solutions of these n equations, respectively, 
over k((m'),- ~~, (y")), independent with respect to each other over 


(y")) ; for each i=1,---, 1, let (nt?) = (nt), (nt?), 
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be the h intersections of the hyperplane > p*;X; = 0 with the curve ©. Since 
j=0 


€ is non-singular (and the hyperplanes have been chosen as above), the 
points (i= are all distinct from each other 
and different from the points (i =1,---,n). It follows then that all 
the h" positive divisors q* = ---, where the A = My) 
runs through all the h” possible ordered sets of n positive integers m Sh 
(‘—1,---,m), are distinct from each other. It is easily seen that the h* 
corresponding distinct points (q‘) are precisely the intersections of ©* with 
the linear variety of dimension t—n in ©; defined by the hyperplanes 


t 
> 0;(p*)¥; = 0 (t—1,---,n). Since the degree of is it follows 
j=0 


that each point (g*‘) and hence in particular the point (q) is a simple 
intersection of ©" with this linear variety. This shows that (q) is a simple 
point in ©", and as (q) is an arbitrary point in ©”, this means that the 
variety ©" is non-singular. 

Finally, we observe that in exactly the same manner as above, we can 
show that there exists a linear variety of dimension {—n-+-g in ©; which 
intersects simply any given @g at any given point (q); it follows then that 
the variety Gg is also non-singular for every {$. 


6. The canonical homomorphism. Consider again the -rational trans- 
formation ¢ of ©" onto ¥; we shall show that it generates in a natural way 
(by linear extension, in the terminology of Weil) a rational homomorphism 
® of 9,(€) onto the Abelian variety B with the subgroup 9,(€) as the 
kernel, and that the so defined homomorphism © is defined over k and has 
the “universal mapping” property. We observe first that if p’,- - -, p* and 
q’,: - -,q® are two sets of positive divisors of degree n in ©, then we have 

8 8 8 
the relation } ¢(p‘) = } ¢(q‘) if and only if > pd q‘; in fact, let o be 
i=1 4=1 4=1 4=1 
any divisor in the class D and let p° and q° be respectively positive divisors such 
8 8 
that the relations pi~ (s—1)o+p° and qi~ (s—1)0+ q° hold 
respectively, then we have, by definition, the = (p*) and 


), and it is clear that ¢$(p°)=¢(q°) if only if 


Tpi~ Sai. We shall call this relation the compatibility condition for 
i= 
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the transformation ¢. Now, let (é'),---,(é*") be n—1 independent 
generic points over & in ©, and let (xz) be a generic point of © over 
(&*))5 let (y) be the point in ©” representing the divisor 
consisting of the points (é'),- - -, (é""") and (a), and let ¢’ be the rational 
transformation of © into B determined by the correspondence (x) > ¢(y), 
which is defined over k((é'),- -, (€"")). For any divisor p= 


in $.(C), we define &(p) —Sng@’(c‘); it is clear that the so defined 
4=1 


mapping ® is a rational homomorphism of 9,(€) into &%, defined over 
k((é),: > +, (é™*)). It can be easily seen from the compatibility condition 
that the kernel of the homomorphism @ is precisely the subgroup 9;(€), and 
that this definition of ® is actually independent of the choice of the points 
- ("1"). If (m*),- +, (y™") are n —1 independent generic points 
of © over k, independent with respect to the points (é'),- - -, (€"") over k, 
then the rational homomorphism is also defined over k((y*),° 
it follows then that must be defined over 


In fact, if p is any divisor in G, rational over any extension K of k, 
then the point ®(p) is rational over both K((é'),---,(é*) and 
(y"*)) and hence must be rational over 


K((&),°- K((7*),° (y*") ) == K, 


Consider now a rational homomorphism ¥ of $,(€) into an Abelian 
variety Mf, defined over an extension K of k; let (z) be a generic point of B 
over K, and let (q) be a generic point of the variety ¢4(z) over K((z)) 
and q be the positive divisor represented by (q). Since the divisor class D 
is rational over K,. it contains a divisor 0 which is rational over K; the 
point ¥(q—o) is then rational over K((q),(z)). The correspondence 
(q) > ¥(q—o) then determines over K((z)) a rational transformation of 
¢*(z) into %; since ¢*(z) is birationally equivalent to a projective space, 
it follows from a result of Weil ([9], Th. 8, Cor.) that this rational trans- 
formation must be a constant, so that the point ¥(q—o) is rational over 
K((z)). The correspondence (z) >¥(q—o) then determines a rational 
transformation ¥, of ¥ into Y, defined over K, which carries the unit element 
in ¥ into the unit element in %; according to another result of Weil ([9], 
Th. 9), this rational transformation ¥, must be a homomorphism of & 


into Y. Let p= n,(z2*) be again any divisor in %,(€), and for each 
é=1 
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1=—1,---,n, let y* be the positive divisor of degree n consisting of the 
8 
points and since }'n;—0, we have evidently the 
8 
equation p= >} ni(y'—o). We have then 


Thus we have shown that ® is the canonical homomorphism, and this 
completes the proof that B is the Jacobian variety of the curve G. 

Once we have obtained the canonical homomorphism ®, we can derive 
the rational transformation ¢ from it by the formula ¢(q) = ®(q—o0), and 
this formula holds for every positive integer n > 2g —2, provided there exists 
a rational divisor 0 over k of degree n. In fact, in order that the rational 
transformation ¢ be defined over k, it is sufficient that o is contained in a class 
© such that the variety Gp is defined over k. In general, we shall say that 
a divisor class $$ in D, (nm > 2g — 2) is rationally defined over a field KA if 
the variety @g is defined over K; and an arbitrary divisor class is said to be 
rationally defined over K if it can be expressed as the difference of two divisor 
classes which are rationally defined over K. It is clear that a rational divisor 
class over K is also rationally defined over K, but the converse of this statement 
is not generally true, as can be shown by examples. It can be easily shown 
that, for any positive integer n, if there exists a rationally .defined divisor 
class over k of degree n, then the formula ¢(q) = @®(q—o) determines a 
rational transformation ¢ of ©" into %, defined over k. We maintain now 
that, conversely, if there is a rational transformation of ©" into &%, defined 
over k, which generates by linear extension the canonical homomorphism 4, 
then there exists a rationally defined divisor class over k of degree n in G. 
In fact, for every positive integer s, the rational transformation ¢ induces 
a rational transformation ¢, of €"* into &, determined by the formula 


8 
¢.(p) => ¢(p*), where (p) is any point in and (p'),- -, (p*) are s 
points in ©" such that we have p=} p‘ for the corresponding positive 
divisors, the particular choice of the points (p*),- - -, (p*) being immaterial 
on account of the compatibility condition. It is clear that the rational 
transformation ¢, is defined over k, and that, again by virtue of the com- 
patibility condition, we have the equation ¢,"(¢'(p)) = Gg, where $ is 
the divisor class determined by p. It is also clear that for sufficiently large s, 
say s= 4g, the rational transformation ¢, is onto; if 9%, is the divisor class 


8 8 8 
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of degree ns such that $,7(0) = Gg,, then $8, is evidently rationally defined 
over k. The divisor class %..:—%s, for any sufficiently large s, is then a 
rationally defined divisor class over & of degree n. In case n 1, we have 
the “canonical function ” of Weil, and our result offers an answer to the 
question raised by Weil ([9], p. 68) as to when a field of definition is 
“complete ” for a curve. 


Finally, we shall add a remark concerning the special case when the 
curve © contains a rational divisor over k of degree 1; such is always 
the case, for example, when the field & is finite, as has been shown by F. K. 
Schmidt ([6], p. 27). We shall show that in this case every rationally 
defined divisor class over any field K (containing /) is also rational over K, 
so that a divisor class §§ is rational over the field &((Gg)). To prove this, 
let 0 be a divisor of degree 1, rational over k, and let be the class deter- 
mined by o. Let $$ be a rationally defined class of degree n over K, and let 
(p) be a generic point of Gp, over K. Then the corresponding positive 
divisor p is rational over K((p)), and consequently the class % is rational 
over K((p)). There exists an integer r, such that the class O = $%—rO 
of degree » —r has exactly the dimension 0 and hence contains exactly one 
positive divisor q. Since the class Q is rational over K((p)), the divisor q 
is rational over K((p)). Now, let (p’) be another generic point of Gg over 
K, independent with respect to (p) over K; then, by a similar argument, 
we can conclude that q is rational over K((p’)). It follows then that the 
positive divisor q is rational over K((p)) N K((p’)) =X, and consequently 
the class © is rational over K. This implies that the class % = Q + rO is 
also rational over K. 


REFERENCES. 


[1] W. L. Chow, “Algebraic systems of positive cycles in an algebraic variety,” 
American Journal of Mathematics, vol. 72 (1950), pp. 247-283. 


[2] » “On compact complex analytic varieties,” American Journal of Mathe- 
matics, vol. 71 (1949), pp. 893-914. 
[3] » “On the defining field of a divisor in an algebraic variety,” Proceedings 


of the American Mathematical Society, vol. 1 (1950), pp. 797-799. 


and B. L. van der Waerden, “ Zur algebraischen Geometrie. IX. Ueber 


[4] 


WEI-LIANG CHOW. 


zugeordnete Formen und algebraische Systeme von algebraischen Mannig- 
faltigkeiten,” Mathematische Annalen, vol. 113 (1937), pp. 692-704. 


[5] S. Lefschetz, “On certain numerical invariants of algebraic varieties with appli- 
cation to Abelian varieties,” Transactions of the American Mathematical 
Society, vol. 22 (1921), pp. 327-482. 


[6] F. K. Schmidt, “Analytische Zahlentheorie in Koerpern der Charakteristik p,” 
Mathematische Zeitschrift, vol. 33 (1931), pp. 1-32. 


[7] B. L. van der Waerden, “ Divisorenklassen in algebraischen Funktionenkoerpern,” 
Commentarii Mathematici Helvetici, vol. 20 (1947), pp. 68-109. 


[8] A. Weil, Foundations of Algebraic Geometry, New York, 1946. 
[9] , Varietes abeliennes et courbes algebriques, Paris, 1948. 


[10] O. Zariski, “ Foundations of a general theory of birational correspondences,” 
Transactions of the American Mathematical Society, vol. 53 (1943), pp. 
pp. 490-542. 


LIE RARY 


LOEWY-GROUPOIDS RELATED TO LINEAR GRAPHS.* 


By GeERT SABIDUSSI. 


1. Introduction. It is not intended to cover much of the theory of 
L(oewy)-groupoids here. We are content with pointing out a case where 
L-groupoids appear in the theory of linear graphs. 

We begin with a definition: Let G be a finite abstract graph containing 
no 1-circuit, and let H be a subgraph of G. 


Definition. By M(G,H) is denoted the set of all univalent transforma- 
tions 7’ of H into G which associate i-simplexes of H with i-simplexes of G 
(t= 0,1), and preserve the incidence relations of H. 


Notations. Throughout this paper any transformation T as defined above 
will be represented by a transmutation 7’, of the vertices of H together with 
a transmutation 7’, of the edges of H. The totality of transmutations T, for 
which there is a 7, such that (70,71) is an element of M(G,H) will be 
denoted by M,(G, H), and the set of those transmutations 7, for which there 
exists a T, with (7, M(G,H) will be denoted by M,(G, H). 


By G(H) we denote the subgraph of G consisting of those vertices and 
edges of G which are contained in at least one graph TH, Te M(G,H). In 
symbolic form: G(H) = M(G,H)H. Evidently M(G(H),H) = M(G,#). 

By &, G(H), §, etc., we denote the groups of automorphisms of G, G(H), 
H, etc. We agree that an automorphism of a graph G be represented by a 
permutation P, of the vertices of G together with a permutation P, of the 
edges of G. Here we define @, as the group of those permutations P, for 
which there exists a P, with (Po, P:) eG, and ©, as the group {P,} with 
(Po, P:) G. 

To characterize L-groupoids we shall use the postulates given by A. 
Loewy 

Given a set Yt of elements which contains a distinguishable subset 3 
(the nucleus), and a combination MM’ of elements M, M’ of M, M is called 
a left L-groupoid, if it satisfies the postulates: 


* Received September 28, 1953. 
1A definition of a similar groupoid was given by R. Baer [1]. Any Baer-groupoid 
satisfies (1) (2) (3) (4), but not conversely. 
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(1) % is a group under the combination MM’. 


(2) MN exists for any NeN, Me Dt—M, and is a uniquely deter- 
mined element of Yt —%. 


(3) Let M, M’, M” be elements of Mt, then MM’ and (VM’)M” exist, 
if M’M” and M(M’M”) exist.? and conversely. M(M’M’) = (MM’)M’ 
= MM’M” holds. 


(4) There is no 2* between Mt and ¥ such that the elements of N* 
and M—M* satisfy (1) (2) (3). 


If we replace postulates (2) (4) respectively by the postulates 


(2’) NM exists for any Ne Me Yi —M, and is a uniquely deter- 
mined element of Yi—9; and 

(4’) There is no * between Mt and Y such that the elements of * 
and Yt{— MN* satisfy (1) (2’) (3); 
we obtain the set of postulates which define a right L-groupoid. Since an 
[-groupoid has just one nucleus (cf. [3], p. 241, 8), 9% may be characterized 
as the maximal group in M such that Yt admits of a decomposition into 
cosets of 


2. The Structure of M, M,, M,. The following theorem is fundamental: 


THEOREM |. M(G,H) is a left L-groupoid,® § being the nucleus of 
M(G,#H). 


As a consequence we have: 
THEOREM II. A necessary and sufficient condition that M(G,H) be a 
group is that G(H) = H. 


Proofs. Theorem I: We prove: A n.as.c. that a transformation 
TeM(G,H) satisfy M(G,H)T = M(G,M) is that T be an automorphism 
of H. 


This implies that is the maximal group in M(G, H) such that M(G, H) 
is decomposable into left cosets of §. Hence M(G,H) is a left L-groupoid, 
§ being its nucleus. 


2? MM’, (MM’)M”, ete. need not be elements of QM. 
* More precisely, M(G,H) is a Baer-groupoid. 
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1. The condition is sufficient. Let Se M(G, H), Te, then STH = SH. 
Hence STe M(G,H), M(G,H)T=M(G,4#). 


2. The condition is necessary. Let 7 —=(T ,T:) be an element of 
M(G,H) which is not contained in §. Then 7, cannot be a permutation. 
Hence no product ST, S M(G, H), can exist. M(G,H)T =M(G,H) can- 
not hold. 


Theorem II: That the condition is sufficient is evident. The condition 
is also necessary, since Theorem I implies that: (G,H) is a group if and 
only if M(G,H) =. But M(G,H) =§ is equivalent to G(H) =H. 
Theorem II may be given the form: A necessary and sufficient condition that 
M(G,H) be a group is that G contain no subgraph H’ 4H which is iso- 
morphic with H. 

Theorem I implies that M,.(G,H), M,(G,H) are also L-groupoids. We 
shall now discuss the relations between the three groupoids M, Mo, M,. Let 
G be any graph, then, with the notations as defined in the introduction, and 
(’ (a direct product of symmetric groups) being the subgroup of © con- 
sisting of all elements (10, P,) (1o identity of is a normal 
subgroup of G, and G) = G/W holds. If G contains no 2-circuit, &’ reduces 
to the identity, hence G) = ©. If, moreover, G contains no connected com- 
ponent isomorphic with a single edge incident with two vertices, ©) = ©, = G. 

One might conjecture that a similar relation holds for M(G, H), M,(G, H), 
M,(G, H), but this is not the case. 

Suppose that G(H) contains no 2-circuit, and H contains no connected 
component isomorphic with the graph H, consisting of two vertices joined by 
an edge. Then given any 7,¢M,(G,H) there is just one 7, ¢ M,(G, H) 
such that (7,71) is an element of M(G,H), and conversely. Thus the 
following one-one correspondences are set up: 


M,(G,H)< 


With respect to these correspondences the behavior of My, M, is different. 
We have 


THeoreM III. Jf G(H) contains no 2-circwt, and H contains no 
connected component isomorphic with Ho, M,(G,H) =M(G,H) holds; 


whereas the correspondence M,<—> M is not necessarily an isomorphism. 


Proof of Theorem III. Since M, M, are in one-one correspondence, a 
n.a.s.c. that the two groupoids be isomorphic is that their left nuclei be 


isomorphic. § = §,, hence , must be the left nucleus of M,(G,H). But 
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this is obvious, for otherwise the nucleus of M,(G,H) must contain a true 
transmutation, and hence cannot be a group. 


To discuss M,(G,H) it is useful to introduce subgraphs PH of G(H), 
PeM,(G,H), where PH means the graph TH, P<—>T ce M(G,H) under 
the correspondence M,<—> M. We may also write G(H) = M,(G, H)H, ete. 

Let N be a subgraph of G(#) such that: 


1. N has the same vertices as H; 

2. N(H)—N; 

3. Any subgraph H’ of G which ie isomorphic with H 1s contained in a 
subgraph N’ isomorphic with N; 

4. M,)(N,H) ts a group; 

5. N is maximal with respect to 1, 2, 3, 4. 


Since H itself satisfies 1, 2,3,4 such an NW exists. Theorem VI will 
show that it is unique. 

Let 9 be the group of N, then N, = M, as N is a subgraph of G(H#), 
and G(H) was assumed to contain no 2-circuit. 

The above definition of N makes it possible to formulate the following 
theorems. 


THeEorEM IV. If G(H) has the same vertices as H, M,(G, H) is a right 
L-groupoid, Go(H) being its nucleus. 

THEOREM V. If G(H) contains no 2-circuit, Mo(G,H) is a left 
L-groupoid, %, being its nucleus. 


THEOREM VI. WN is uniquely determined by G, H. 


THEOREM VII. If G(#) contains no 2-circuit, a necessary and sufficient 
condition that M,(G,H) =M(G,H) is that N=H. 


Proofs. Theorem IV. As G(H) and H have the same vertices, the 
elements of G.(H) and M,(G,H) are permutations of the same vertices, 
and hence may be combined in an arbitrary manner. Obviously )(H) is a 
subset of M,(G, H). 


We shall prove: A n.a.s.c. that 7>M,)—M, hold for a permutation of 
the vertices of G(H) is that T, e G)(H). 

The condition is necessary: Let S,eM,.(G,H), Toe G(H). Then 
there is an S,¢ M, such that S = (So, eM, similarly for Both SH 


ue 
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and 7'SH are subgraphs of G(H); S, T preserve the incidence relations, 
hence My(G, H). 

The condition is also sufficient: Let a’ be an edge of G(H). Then, 
according to the definition of G(H), there is at least one Se M(G, H) such 
that SH contains a’ - S = (So, 81) So = To Ro, Ro H), since To Mo = Mo. 
That means there is an edge a of H such that S) maps the end points (P, Q) 
of a onto the end points (P’, Q’) of a’: 8o(P, Q) =ToRo(P, Q) = (P’, 
Ry(P, Q@) = (P’, Q’). RoY are also vertices of G(H), and since 
R,e Mo(G,H), there is an R = (Ro, R,) e M(G,H), in other words RP, 
R,Q are joined by an edge of G(H). This means that T,P’, To*Q’ is an 
edge of G(H), and since a’ was arbitrary, T,-'e¢ @.(H). From this, as in 
the proof of Theorem I, the assertion follows. 


Remark. It can be obtained by the same reasoning that Gp is a sub- 
group of ©(H) for any H which is a subgraph of G and has the same 
vertices as G. 


Theorem V: As NV has the same vertices as H, the elements of §, and 
and M,(N,H) are permutations of the same vertices, and hence can be com- 
bined in an arbitrary manner. 

We infer from Theorem IV that a n.a.s.c. for Mo(N, H) to be a group is 
that H) = %.(H). N(H)=N implies 9,(H) = No, hence M,(N, H) 
=. Condition 2 of the definition of V implies that any Te M,(G, H) 
can be written in the form where Pe M,(N,H), Se M,(G, N), 
hence M,(G,H) C M.(G,N)M(N,H). M.(G,H) D> M.(G,N)M(N, H) is 
obvious, so that we obtain M,(G, H) = M,(G, N)M(N, M(N, H) = No 
is contained in the left nucleus of M,(G, NV), hence VY) = M,(G, H). 

That means Jt, is contained in the left nucleus of M,(G,H). We will 
now prove that Jt, itself is the left nucleus of 1/,(G, H). 

Denote, temporarily, by 9to* the left nucleus of M)(G, H), and suppose 
that 9, is a proper subgroup of Jp*. 

We denote by V* the subgraph of G(H) which consists of all subgraphs 
P;H, No* = {Pj}, in symbols N* —9,*H. N* has the same vertices as H, 
and it is evident that N*(H) = N*. 

M,.(G,H) may be decomposed into cosets of So, i.e. Mo(G,H) 
+ PpOo. The cosets are in one-one correspondence with 
those subgraphs of G which are isomorphic with H. 

Since 9t)* is the left nucleus of M,)(G,H), we have the decomposition 
M,(G,H) =M* + -+ and accordingly G(H) = N.*H 
+ RM *H +---+ RMN *H = N* + R.N* +---+ R,N*. This means 
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G(H) is decomposed into a set {Rj;N*} of r graphs isomorphic with N*, 
no two of which have a common subgraph isomorphic with H. Also condition 
2 of the definition of N is satisfied by N*. 

N.* = H + 8.H +: 8sHo. Hence N* contains the graphs H, 
S.H,:--,S8,H. Suppose that N* contains a graph H’ isomorphic with H, 
but different from the Si:H. In M,.(G,H) H’ corresponds to a coset PS, 
different from the Sio. P&» must be contained in one of the cosets B,9,*, 
hence the graph R;,N* also contains H’. This is a contradiction, as no two 
of the graphs R;N* have a common subgraph isomorphic with H. 

Hence H, S.H,: --,S,H are the only subgraphs of N* which are iso- 
morphic with H. But this implies M,(N*,H) =,*. Hence M,(N*, H) 
is a group, i.e. condition 4 of the definition of N is also satisfied by N*. 

No C Mo*, Mo A Mo* implies that NV is a proper subgraph of N*. This 
is a contradiction, since N has been defined as a maximal graph in G(H) 
satisfying 1234 (This is condition 5). Hence Yt, is the left nucleus of 
M,(G, HH). 


Theorem VI: 1,(G, H) contains just one left nucleus (cf. [3], p. 241, 8), 
hence 2, = M,(N, H) is uniquely determined by J/)(G, H). Let N’ be a graph 
which has the same vertices as H, and also satisfies 2345 of the definition of 
N. Then M,(G, N’) = M,(G, H) = M,(G, N). Hence M,(N’, H) = M,(N, H), 
and N’ = N’(H) = M,(N’, H)H = M,(N, H)H = N, which proves the unique- 
ness of N. : 

Theorem VII: Sufficiency is obvious. To prove necessity suppose H 4 N. 
Then must be a proper subgroup of For So = Mo = Mo (N, H) implies 
Hence § and M, follows 
(cf. [3], p. 245, Satz 1). | 


3. Conversion theorem. 


THEOREM VIII. Given a finite group &, and a finite set Di,- - -, Uy 
of finite left L-groupoids, there always exist cyclically connected graphs 
G, H,,---,H,, and connected graphs k=1,:--, 
where s; 1s any given positive integer, such that: 

1. any H# is a subgraph of H;; any H; ts a subgraph of G; 

2. H;i is a subgraph of H# if and only if j Sk; 

3. any M(H, Hi) = Hi) = Hi) = Hy), Sh, 

is @ group. 


( 
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4. & is tsomorphic with the group of G; 
5. M(G, H*) = M,(G, H*) = M(G, Hi) = Hi) = Mi, 


Proof. According to a theorem by R. Frucht [2] one can always find a 
cyclically connected cubical graph whose group is isomorphic with a given 
finite group. Denote by ; the nucleus of 9t;, and let H;’, G’ be connected 
cubical graphs whose groups are isomorphic with §;, & respectively; += 1, 

-,r. Let H;” be the graph which is obtained from /H/;’ by subdividing every 
edge of H;’ by i vertices. Thus neither G’ nor any H;” can contain a subgraph 
isomorphic with H;’, is4j. Obviously is connected and has a group 
isomorphic with 

We construct a graph H;' by adding to H;” five vertices Qi, R,°, Ri, Bi?, 8; 
joining Q; with every vertex P,j of Hy’, with Q;, with with 
and S; with Qi, R,°, Ri, R. The definition of H;*, k = 2, is, by induction: 
H# is the graph obtained from H,** by adding a vertex R,*** joining it with 
R# and 8;, k =1,---,s; Evidently Hjj is a subgraph of H# if and only 
if j=k. 

Now we join S; with every vertex P;i of Hj’, and subdivide the edge 
PS; by j vertices. Since H;’: is connected, the graph H; thus obtained is 
eyclically connected, and obviously contains all H*, k =1,- - -, s;. 

Consider W(H;*, Hi/), Sk. Q; is the only vertex of H;*, which is 
joined with every vertex of H;’, H?’ is the only subgraph of H;*, H;/ iso- 
morphic with H/’; hence any transformation of M(H#, must map 
onto itself, and Q; onto Q;. Since §; is the only vertex of degree & + 3 of the 
complement of H;” in H;*, 8; is mapped onto itself, hence Ry > RF, and 
consequently Ri 7 Sk. In other words Hii) = group of 
automorphisms of H;”. If we repeat the above reasoning replacing H# by 
H;, we obtain (H;; H;/) = group of H;’”. This completes the proof of 123. 
M(H#, = Hi’), = M,(Hi, Hi) is obvious, since the 
only subgraph of H; which has the same vertices as H;’, and contains H;/ is 
H# itself (cf. the definition of N in section 2). 

Take n; copies Hii, - -, Hin, of Hi, where nj; is the order of || 92;/%; ||, 
identify Si; with a vertex of Hij.”, say Pija', -+,m—1, finally 
identify Sin, with P;,1. The graph H;* thus obtained is cyclically connected. 
To obtain G identify the vertices 8;, and Siz, i—=1,-- -,7r, and join Siz, Siz 
with every vertex of G’. Since G’ and H;* are cyclically connected, so also 
is G. 

Since no H; is isomorphic with a subgraph of an H;, j 1%, and no H; 
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admits of a “ reflection ” as automorphism, the group of the complement of 
G’ in G is the identity, if r= 2, or a cyclic group 3n, if r—1, mapping 
consecutive subgraphs H,; onto consecutive subgraphs H,; of H,*. This makes 
it clear that for any r= 1 the vertices of the complement of G’ are fixed 
vertices of G. Hence the group of G= group of G’ = © by construction 
of @’. 

There remains to prove M(G, H;) =M;. (That M(G, H;) = Mo(G, Hi) 
ete., is obvious). From this M(G, H**) = M; follows directly, since M(G, H}#) 
= M(G, Hi)M (Hi, H#). 

According to a theorem of Loewy (cf. [3], p. 245, Satz 1) two L-groupoids 
M, MY’ are isomorphic if and only if their nuclei §, §’ are isomorphic, and 
order || M/H || = order || M’/H’ ||. By the construction of H; the nucleus 
of M(G, H;) is isomorphic with §;. As already pointed out in section 2, proof 
of Theorem V, the elements of the factor-groupoid of M(G, H;) by its nucleus 
are in one-one correspondence with those subgraphs of G which are isomorphic 
with H;. Hence the order of this factor-groupoid is precisely the number of 
different graphs isomorphic with H;. H;* contains just n; such graphs, while 
G’, H;*, 74%, contain none. Hence the order of the factor-groupoid of 
M(G, H;) by its nucleus is n; = order || Nti/H; ||, and M(G, H;) = M; follows. 


4. Noteworthy special case. In view of Theorem V it would be of 
interest to have an explicit method of constructing graphs N with M,(N, H) 
a group. Unfortunately no such method can be given, except for one trivial 
case: Let n be the number of vertices of H, then My(Cn, H) = Gn, where C, 
is the complete graph on the vertices of H, and ©, is the symmetric group 
on n symbols. 

Clearly, for any given special case the graphs NV can be computed, which 
shall be done here for H = W being a Whitney line of the given graph G. 

Let D, be the dihedral group on n symbols, then clearly the group of 
automorphisms of an n-circuit (a Whitney line) is isomorphic with D,, and 
will always be referred to as D,. The cyclic group 3, on n symbols is a 
normal subgroup of Dp, and Dn/3n = Bo. 


THEOREM IX. M,(G,W) is either a group, or else a left L-groupoid 
with nucleus Dy. 


Let C,,’, n even (if n is odd, no such graph C,’ can exist), be the maximal 
subgraph of C, which contains no k-circuit, k odd. If the labels 1,-- -,n 
are associated with the vertices of C,’, and W = (i,---,%,) denotes any 
Whitney line of C,’, C,’ is characterized by the condition that two vertices 4,, % 
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of C,’ are joined by an edge if and only if j —k is odd. The group G, ofC,’ 
consists of those permutations ) for which r;—1r, = 7 — k (mod 2). 
D Su X Su D Am Sm D An Un D Am D EC, m — 1/2, n= 10, is a 
composition series of 


THEOREM IX’. M,(G,W) is a group in the following cases only: 
My(G,W) =Dn, of G(W) = W; 
M,)(G,W) if G=C,; 
M,.(G, W) =@,, if n is even, and G(W) =C,’. 


Proof of Theorems IX, 1X’. . Let the notation be so chosen that 
W=(1,:--,n). The totality of permutations P for which M,(G, PW) 
= M,(G,W) is obviously a group; we denote it by Ito- Io C Mo(G, W), 
Dn C Mo. 


By F& we denote the set of those numbers p for which G(W) contains an 
edge joining the vertices k, k-+p for some k&. Let P=(*--" ) eo, 
pi = Mir — mM, (Modn), lSp<n, then ppeR. Otherwise there must be 
a k such that the edge joining k, k + p; is not contained in any Whitney line 
of G, are the elements which generate 
D,°Z" =1, T?=—1.) This is a contradiction, since Z, P are elements of No. 

This makes it evident that Jt) =D, and Rk = (1,n—1) are equivalent 
statements. Hence we may confine our interest to the case that RF contains at 
least one element p, 2 = p= n—2. 

Let P= yeMNo,~Dn, and pi, 1,- - -,n as defined above. 
Without loss of generality we may assume that 2S p;< [n/2] for some i 
(otherwise take PT’ c 9, instead of P). Consider 


Ox = 

= h =m 1,3,°--, 
where [n/2], r—1,---,n, if 
k—=1; 7=p;(mod2), k(p;,—1) +1, if k=2. The difference of 
the numbers sj, s;,, into which j,7 +1 (j=1,:--+,m) are carried by Q; is 
congruent to 1, m—1, or r. Hence Q,¢M)(G, W); moreover, as we shall 
see below, Mo(G, W) implies W). Suppose Q; M,(G, W). 
My(G, W) carries over 

i> m +7, 
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over all odd or even numbers between 1 and n— 1, according as p; is odd or 
even. If pp< (n+h—1)(k+ 1) the maximum value attained by 7* 
under the above conditions for o, 7 is (k + 1) (pi; —1) +1 which is precisely 
the upper bound for 7 in the conditions for Q:.:. Thus 4/,(G, W) is 
' ascertained, and the whole reasoning may be repeated with Q; replaced 
by 

After a certain number of steps we obtain that any += pp; (mod 2), 
2S7rSn—2, is an element of R. If n is odd, any 7p; (mod 2), 
2=7’=n—2, is also contained in R, since re R implies n—-reR. This 
means = (1,---,n—1), G(W) =G=C,. 

If n is even, any Pe, must have at least one odd p;, which implies 
1,3,5,- - -,n—3,n—1eR, i.e. C,’ is a subgraph of G(W). That means 
M,(G, W) is an L-groupoid whose nucleus is either D, or ©,. Assume the 
latter, and G(W) ~C,’. Then G(W) must contain a Whitney line con- 
taining two edges (r—a,r-+a), (s—b,s-+ 6) which are not contained 
in C,’. 

Let W’ = (i,’,-- +, %n’) be a Whitney line of C,’, n = 4, W = tm) 
a Whitney line of C,,, m —n/2, then we denote by C,” the graph which is 
obtained by identifying the vertices i; and 7 =1,: -,m, of C, and C,’. 
Clearly C,”” is independent of W, W’, and C,”(W) =C,’ holds (this can 
easily be proved by induction). C,’’(W) =C,’ implies H(W) = (C,’ for any 
subgraph H of C,” which contains C,’.. In other words, let (r—a,r-+ a), 
(s—b,s+) be the edges introduced above, then r+ ss4a+b (mod. 2). 
Hence G(W) contains a Whitney line in which (r—a—1,r+4a), (r+a, 
r—a), (r—a,r—a-+1) are consecutive edges; i.e. there is a P ¢ M,(G, W) 
carrying over 7—a—1—>r—a—1, r—a+2—>r—a-+l. 


Q = (r—a,r—a+2)(r—a+1,r—a+3) 
hence PQe M,(G,W). PQ carries over 
r—a—>r—a+l. 


Consequently the vertices r—a—1, r—a- 1 are joined in G(W). So are 
s—b—1, s—b-+1 which follows readily by interchanging the roles of 
r,a and s, b. 

Let the vertices r—1, r+ 1 and s—1, s+ 1, r—s odd, be joined in 
G(W). Then P=(r,r+1,---,s—3,s—2)(s—1,s,s+1)eM,(G, W). 
Q =(s,s + 2)(s+1,s+3)eG,, hence PQeM,(G,W). PQ carries over: 
s—1—-s,s—s-+2. Hence the vertices s and s+ 2 are joined in G(W). 
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So are r, r+ 2. The reasoning may be repeated with r—1, r+ 1 replaced 
by 7, r-+ 2, etc., and we obtain 2¢ R. 
Since 2 R, 


W). 
By P the vertices of G(W) are divided into two classes 


where m=n/2, complement of in such that any 
vertex re S;, can be reached from any vertex se S;,’, r—s even, by passing 
through an odd number of edges of PW. This means there is a suitable 
Qe, such that r and s are consecutive vertices of the Whitney line PQW. 
Hence r,s, —s even, are joined in G(W). If we take k =—1,2,---,n, 
we obtain that any even number is contained in R, which together with 
1,3,- - -,n—38,n—1eR implies G(W) =G=C,, a contradiction, since 
G was assumed to be a proper subgraph of C,. 
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THE INSOLUBILITY OF CLASSES OF DIOPHANTINE 
EQUATIONS.* 


By N. C. ANxeEny and P. Erpés. 


Introduction. Consider the non-trivial rational integer solutions in the 
variables X,, X2,- - -,X, of the equation 


where m, @2,* * are non-zero rational integers, and m > 0. By a non- 
trivial solution we mean one in which not all X¥;—0, 7 =1,2,---,n. 


Let U be a large positive real number tending to infinity, and let 
D(U, an) = D(U) be the number of m =U for which (1) has 
a non-trivial rational solution. Putting a mild but necessary restriction on 
the coefficients, something may be said about the order of magnitude of D(U). 


THEOREM I. If, for every selection of e; =—0 or +1, (j = 1,2,°--,n) 
except (@:,° *,€n) =(0,0,---,0), we have Ane, then 
D(U) =0(U) as U>+ 0. 


Theorem I could be interpreted as stating that equation (1) is “ almost ' 
always ” unsolvable; or the density of m, for which (1) has a non-trivial 
solution, is zero. 

One very important case that the hypothesis of Theorem I excludes is 
when a; = d,—4a;=—1. However, our methods still yield a result of some 
interest in this case. 


THEOREM II. The density of integers m, for which the equation 
+ + X," = 0 has a rational solution and for which (X,X2X3,m) 
= |, ts zero. 


The restriction (X,X.X;,m) —1 is sometimes referred to as the first 
case in Fermat’s equation. 

The result M(U) = 0(U) can be strengthened to M(U) = O(U(log U)-*) 
for some positive constant c. The proof of this stronger inequality requires 
a good deal more effort and will not be presented in this paper. 

The result of Theorem I can be generalized from the rational number 
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field to any algebraic number field #. The restriction on aj, which are now 
any non-zero algebraic integers contained in F, is that aye: +-* + * + nen AO 
where «; = 0 or any root of unity contained in F. The proof of this generaliza- 
tion will not be given in complete detail, but will be briefly outlined at the 
end of this paper. 

In Section 1 we shall present some introductory Lemmas and in Section 2, 
the proof of Theorems I and II will be presented. 


1. Notations. U denotes a large positive variable. ¢,,¢2,- * - denote 
absolute constants. p, qg are rational primes. ¢, is a primitive g-th root of 
unity. 


LemMA 1. Let +, satisfy (2),g >2, and If en) 
is any one of the 3"—1 n-tuples referred to in the statement of Theorem I 


and if hy,- -+,hn are any non-negative integers then 
n 

k=1 


Proof. Suppose first that g = p or 2p where p is an odd prime. Since 
£,? = + 1, the assumption that (3) is false leads to a relation 


p-1 
= 0, where 6; = axex, (j=0,---,p—1), 
j=0 keS; 
and §; is a (possibly void) subset of the set of numbers {1,---,n}. The 
sets So,- - +, Sp, are non-overlapping and their union is the set {1,---, mn}. 


Thus, because of (2), there is an 7 such that 6;540 and, for every ’ A1, 
by 4b; On the other hand, £, is a root of either a *+---+2+1 


or — gP-2 1. both of which are irreducible polynomials 
over the rational field R. It follows that a 
contradiction. 

To complete the proof of the lemma, let g = pi%p.%--: - ps” or 
2p,4p,%- --ps4* where the p’s are distinct odd primes and the d’s are 
positive integers. Assume by induction on d,+--:--+d, that the lemma 


holds for g’ = g/p, (> 2). Since £,% — the assumption that (3) is false 
leads to a relation 


>> Bild? 0, where B; (j 0, 1), 
j-0 keS; 

the f’s being non-negative integers and the sets So,---,S»,-. having a 


meaning similar to that in the first part of the proof. 
By the inductive hypothesis there is an 7 such that 6; 0 and, for every 


t 
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v 1, By AB; On the other hand, the irreducible equation satisfied by £, 
in the field R(Z,-) is either + ----+2+1=0, (d; = 1), or 
(d,>1). Thus By) = Bi =: a contradiction. 


Lemma 2. If 34g then, for any non-negative integers hi, ho, hs, 
(4) + + A 0. 


Proof. Assume there exist hi, he, hs such that + + —0. 
Divide through by ¢,"s, yielding 


(5) + 1—0, 


for 2 integers k,, k.. Taking the imaginary parts of both sides of (5) yield 
that sin(2zxk,/g) + sin(2zk2/g) =0. This implies =—hk, or ki + 9/2 
(mod g) where only the former is possible if 2 f g. 

Now taking the real part of (5) yields cos (2ak,/g) + cos(2ak2/g) = —1 
or, on substituting k,==—k, or k, + 9/2 (mod gq), yields that 


2 cos(2xki/g) = —1, or + cos(2a(ki + 29/9) ) =—1. 


This last equation is clearly impossible. The former equation implies that 
3 | 9, which is contrary to our hypothesis. 


THEOREM III. Jf +, Gn satisfy condition (2), then for a given 
m there exists no non-trivial rational solutions of (1) provided we can find 
a rational prime p such that 


(6) m divides p—1, mr=p—1, 
(7) 
(8) $(r) < a* log p, 


+-+-+]|a,|), and $(r) isa Euler function. 


where «= log(| a: | + | ae 


Proof. (cf. [4], H. S. Vandiver). Assume there exists a p which 
satisfies (3), (4) and (5), and that (1) has a rational solution such that 
X,X,: p). Without loss of generality, assume (X1, °°, 
X,) =1. Then consider (1) in the field R(Z,). 

As p=1(modr), the ideal factorization of p is (p) =P,P.:- +P, in 
R(é,-), where s=¢(r), and =p. Hence, the group of m-th 
power residues of the multiplicative cyclic group of residues (mod P,) has 
(p—1)/m =r elements. One sees that the elements %,/, 7 =0,1,---,r—1 
are incongruent (mod P,). So ,/ form a subgroup of r elements in a multi- 
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plicative subgroup of residues (mod P,). Hence, these two subgroups must 
coincide. 

As a fortiori, +: - -+a,X,"=0 
(mod P;) or, by the coinciding of the two subgroups, ++ dn, = 
(mod P,) for some n-tuple of integers (t1,---, tn). Hence, p = 
divides + + But, 


| R( +° S (| | An |) 


Thus pS(la.|+---+|a.|)%, which is a contradiction to our hypo- 
thesis unless a,¢,41 + - - - + This case, however, is impossible by 
Lemma 1. 

Hence, we have shown that X,X.: - -X,==0 (mod p). Hence, p divides 
one of the variables, say Y,. However, proceeding in the same way with the 
truncated equation a,X," + +: we will see that p will 
divide each X;, i= 1,2,---,n. This is a contradiction to (X,, Xo,: +, Xn) 
=1. This proves Theorem 1. 


Corottary. If n = 3, a; = = 1, m square free, and a prime p 
exists that satisfies (6), (7), (8) in Theorem III, 3f r, then (1) has no non- 
trivial solution relatively prime to m. 


Proof. Using the proof of Theorem III and Lemma 2, we immediately 
infer that there exists no solution of + -+ (mod p) and 
X,X.X3 540 (mod p). Hence, if there exists a rational solution XY,” + X.™ 
+X," 0, then p | X,X.X;. 


If g denotes any prime factor of m, and (X1,X2X3,m) =1 we have, by 
using Furtwangler’s criterion on Fermat’s Equation (cf. Landau [2]), that 
for any p | X,X.X5, pv'==1 (mod As p==1 (mod m), p=1 (mod q). 
Therefore, p==1 (mod q?). As m is square free, p= 1 (mod m?) ; therefore 
p—1= m’. 

By hypothesis, ¢(7) < log p/log3. Thus r< (log p/log3)*. Now 
p—1—mr < (log p/log 3)*m. Hence, m < (log p/log 3)? or p—1 
< (log p/log 3)*. 

This last inequality is clearly contradictory and this completes the proof 
of the corollary. 


2. To prove Theorems I and II, we shall derive a set of integers m 
which satisfy Theorem III and such that almost all integers are divisible by 
at least one element of our set. 

Denote by A(n) the least prime diivsor of n. 
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Lemma 3. If y denotes Euler’s constant, 


= e-VU (log log log U)-* +- O(log UV), 
neJ,(U) 


where J,(U) denotes the rational integers lying between U and 2U which 
have all their prime factors > log log U. 


Lemma 4. If d < U4, then 


= 1—e7¢(d, log log U)U (log log log U)-? + O(log U) 
neJo(U) 
where (d, V) =d]|[ (1—1/p), and J.(U) ts the set of integers n between 


p<V 
U and 2U, and n=1 (mod d). 


Lema 5. For any constant c,, and U sufficiently large, 


> 1>¢.U (log log log U)-, 
log U<r<2logU J3(Usr) 
where J;(U,r) denotes the set of primes p < c,U log U, p=1 (modr), and 
A((p—1)/r) > loglog U. The constant c. depends only upon the choice 
of 


Lemmas 3, 4, and 5 are quite elementary in nature. The proofs of them 
are very similar. We shall give here only a proof of Lemma.3. 


Proof of Lemma 3. Let d be any square free number < log log U, and 
let f(d,U) denote the number of integers which lie between U and 2U and 
which are divisible by d. Then f(d,U) =U/d+O(1). If w(d) denotes 
the Moebius function 


neJy(U) U<n<2U d|(n.h) 


where h= J] 7, as this last inner sum is 1 if m has no prime factors 
p=loglog U 


S log log U, and zero otherwise. Hence, 


ned,(U) d\h d\h 


(u(d)U/d + O(1)) = u(d)/d + 


= vil (1—1/p) + O(h) = (log log log 4+ O(log U) 


4 
( 
( 
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by using Merton’s Theorem on prime numbers. This proves Lemma 3. The 
proof of Lemma 4 is almost identical to the above. Lemma 5 is again the 
same as above using the Siegel-Walficz theorem on primes in arithmetic 
progressions. 

Lemma 4 implies that to the primes < c,U log U there corresponds at 
least c,U (log log log U)-* numbers m < U, m = (p—1)/r, A(m) > log log U. 
However, this correspondence is not necessarily unique, and possibly many 
primes could correspond to the same m. With this in mind we prove 


Lemma 6. Let H(U,g) denote the number of integers m, U << m < 2U, 
\(m) > log log U, and such that there are exactly g distinct primes p1, p2,* + * 5 Dg 
where pj==1(modm), and pj<ecUlogU for Then 
H(U, 9) = O(9°°U log log log U). 


Proof. The function g-* in the above Lemma could easily be replaced by 
a function of g which tends to zero far more rapidly as g increases, but this 
improvement is not needed for this present paper. 

To prove Lemma 6 we shall derive an upper bound on the number of 
times that (r:m +1), (mr.-+-1), -+1) can simultaneously be primes 
where A(m) > log log U, US mS 2U, 1=1, 73 S log U. 

Now for the moment regard 1, 72,73 as fixed and m varying as we 
described. Then the problem is to derive an upper bound on the number of 
elements 


(9) (mr; + 1) (mr, + 1) (mrs + 1) 


which have no prime factors = U4. This corresponds to the slight generaliza- 
tion to the twin prime problem where there the number we sieve is (m)(m + 2). 
In our problem we have a polynomial in m composed of 3 linear factors. 
Utilizing the general method developed by Selberg (cf. [3], especially pp. 
291-292), we can easily prove that there are less than 


c4U (log log log (log U)“*y (11) 


element of (9) with 7, 72,73 fixed which have no prime factors < U3, where 


¥3(r) = (1—1/p). 
p\r 
Hence, summing over 1, f2, 73, we have that the number of elements of 


log U 
(9) is less than c,U (log log log U)-* (log Now 
r=1 


2 
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= (1 + 1/p) < IT (1 + 1/p) IL (4 


pir pir 
=7°/6 (1+ 1/p) > 1/d. 
p\r 


Hence 


log U log U log U log U 
Sl/d—csd1/d 1 Sc; Dd log U/d? < log U. 
r=1 r=1 dir d=1 d\r d=1 
r<log U 
Hence, the number of elements of (9) which have all their prime factors 


< Ui is < c-U (log log log U)-+. However, if ¢,.” is the binomial coefficient, 


this number is equal to }c,7H(U.g) where g > 2. Therefore 
g=3 


(10) (U, 9) < col (log log 
g=3 


Lemma 6 follows immediately from (10). 

Lemmas (4) and (5) showed that to the primes corresponded various 
numbers m. Lemma 6 shows conversely that to each m there cannot corre- 
spond too many primes. 

Therefore, if we define the set M(U) to be all integers m < U, 
A(m) > log log U, and m satisfies the conditions of Theorem ITI, then 


(11) 1>e,U (log log log 
meM(U) 


3. In this section we shall establish that almost all rational integers 


are divisible by an integer of the set M where M=— |) M(U). 
Let D(M(U)) = D(U) denote the density of integers not divisible by 


any integer of our set M(U). Let U, be some large real number. 
LemMa 7. There exists a constant 0 < <1 such that D(U,) < 


Proof. As D(U,) denotes the density of integers not divisible by any 
meM(U,), 1—D(U,) denotes the density of integers which are divisible 
by some me M(U,). Hence 
(12) 1—D(U,)=  3(m), 

meM(U ) 
where 5(m) denotes the density of integers divisible by m but by no other 
integer of our set M(U,) except a divisor of m which might be contained in 
M(U,). Now, the density of integers ¢ which have no prime factors between 
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log log U, and U,, is well known to be > }(log log log U,) (log U,)"*. Now 
the set mt is divisible by m and no other element of our set M(U,) except 
possibly a divisor of m, as all numbers of M(U,) have all of their prime 
divisors lying between log log U, and U,, and hence to divide mt implies, 
by our definition of the integers ¢, that it would divide m. Therefore, 


(13) = 4 (log log log U,) (log 
By (12), (13) and (11), 
1— D(U,) = 4 (log log log U,) (log U,)* 1/m>e,>0. 


meM(U,) 


Letting cs = 1— cy, we have proved Lemma 7. 

If U, is another large constant, then by Lemma 7, D(U2) < ¢s also. 
If U. > exp{exp{2U,}}, then the elements of M(U,) are relatively prime to 
the elements of M(U.). As all prime factors of W(U.) are greater than 
log log U. > 2U;, and all prime factors of the elements of M(U,) are less 
than Uj. 

As D(M(U)) denotes the density of integers not divisible by any element 
in M(U), 


D(M(U,) U DUl(U2))) = D(M(U,)) -D(M(U2)) < 


Similarly, defining U; = exp{exp{2U.}}, Us = exp{exp{2U3}},- - -, 
gives that 


D(U M(U;)) =IL D(M(U;)) < 0 
j=1 j=1 
asn—>oo. Hence, 


(14) D(M) <lim D( M(U,)) =0, 
j=1 


where )(M) denotes the density of integers not divisible by any element 
of M. 

Conversely (14) may be interpreted as saying that almost all integers 
are divisible by some element of our set M. If an integer n is divisible by 
an m, me M, we see that the equation (1) has no non-trivial solution for m, 
and hence, no non-trivial solution for n. This completes the proof of 
Theorem I. 

To prove Theorem II we would need to add to our conditions on M 
that the (p—1)/m be relatively prime to 3, and that m be square free. 
These additional assumptions could easily be incorporated in Section II, and 
present no real difficulties. 

To establish the generalization of Theorem I to an algebraic number 


n n 
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field F we merely sum in Lemma 5 over the rational primes which are norms 
of prime ideals in /. Theorem III can be bodily carried over by changing 
the definition of « to R) log(|a,| +--+ where (F: R) denotes 
the degree of F over the rational number. The remainder of the proof is 


almost identical. 
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ON CURVES DEFINED BY BINARY NON-CONSERVATIVE 
DIFFERENTIAL SYSTEMS.* 


By Puitip HARTMAN and AUREL WINTNER. 


f a prime denotes d/dt and 


a b 
(1) 
is a constant matrix, then the determination of the solution paths (a(¢), y(t) ) 


of 
(2) x’ =ax + by, y =cx-+ dy 


in the real (x, y)-plane, hence the description of their asymptotic behavior as 
too, is elementary. From the second of these points of view, the so-called 
geometrical theory of real, binary differential systems, as initiated (in the 
case of power series f, g) by Poincaré and extended (to non-analytic cases) 
by Bendixson and Perron, deals with conservative but non-linear “small 
perturbations ” of (2), namely, with a system of the form 


where both functions f, g are real-valued and continuous for small 
(4) 

and satisfy 

(5) f(x, y)/r—>0, g(x, y)/r—>0 as 0, 

while 

(6) ad — bc 0. 


But there is another standard “small perturbation” of (2), one which is 


non-conservative but linear and homogeneous, that is, of the form 


(7) —a(t)e+B(t)y, +8(E)y, 


where a, 8, y, § are real, continuous functions of ¢ satisfying 
(8) as 


* Received November 16, 1953. 
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In [2], known results on the asymptotic behavior of the solution paths 
(x(t), y(t)) reaching to the origin, that is, satisfying 


(9) r(t) as too, 


were collected and refined for the problem (3)-(5). The present paper will 
transfer those considerations from (3)-(5) to (7)-(8) and to certain generali- 
zations of both (7)-(8) and (3)-(5); ef. Theorems (*) and (t+) below. 

A question considered in [2] for (3)-(5) is whether or not a solution 
path (a(t), y(t) ) reaching to the origin (as has a tangent there, that 
is, whether or not a continuous determination of 6(1), where 


(10) 6 = arctan y/x, 

is such that 

(11) lim 6(¢) = 6) exists 
t- 


(as a finite limit). For the case in which such a tangent exists, there was 
considered in [2] the question whether or not the half-line 6 = @, is the limit 
of tangents, that is, whether or not 2*-+ y’°0 holds for large ¢ and a 
continuous determination of y(t), where 


(12) y(t) = arctan y’/2’, 

is such that 

(13) lim exists and is 6)(mod 7). 
t- 0 


In the case of the linear system (7)-(8), these questions concerning 
(10), (11) and (12), (13) can be raised for all solution paths (and not only 
for those reaching to the origin). In view of (4), the differential equation 
satisfied by (10) is 


(14) 6’ = y(t)cos? 6 + (8(t) —a(t) )cos sin 6 — B(t)sin? 6. 


Thus it cannot depend on (9) whether (11) does or does not hold; in fact, 
if a sufficiently large constant is subtracted from both a and 8, then (9) 
holds for all solutions and (14) remains unchanged. 

The answer to the question concerning (12), (13) for the linear system 
(7)-(8) is trivially in the affirmative when (6) holds. 

Consider the following classification of the possible matrices (1): The 
matrix (1) has 


(i) real, distinct characteristic numbers; 
(ii) non-real characteristic numbers ; 
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(iii) equal characteristic numbers, multiple elementary divisors ; 
(iv) equal characteristic numbers, simple elementary divisors. 


There will be no statements concerning the case (iv). 


(*) Let a(t), B(t), y(t), 8(t) be continuous functions on 0St <0 
satisfying (8). In case (i), every solution 0=6(t) of (14) satisfies (11) ; 
in case (ii), every solution 6=6(t) satisfies 


(15) |A(t)| as to; 
in case (iii), either all solutions satisfy (11) or all satisfy (15). 


Furthermore, in the cases (i) and (iii), the only possible limits (11) 
are the zeros of 
(16) c cos? 6 + $(d—a)sin? 6 — b sin? 6 = 0. 


In ease (i), each of the two distinct zeros (mod 7) is the limit (11) for some 
solution ; in case (iii), (16) has only one zero (mod 7). 

The assertion of (*) concerning (ii) is trivial. That concerning (i) is 
contained in sharper results of Perron [3], pp. 148-160. It will be clear 
from the proof of (*) that both alternatives can be realized in the case (iii). 


Proof of (*). Although the assertions of (*) concern merely the solu- 
tions 6(t) of the differential equation (14), it will be convenient to involve 
in the proof the trivial differential equation 


(1t) /r = a(t) cos? 6 + (B(t) + y(t) )cos 6 sin 6 + 8(t)sin? 


for r==r(t). The arrangement of the proof will make it possible to imme- 
diately extend the results concerning (3) and/or (7) to corresponding state- 
ments concerning certain non-linear, non-conservative binary systems. 


In view of the remarks following (14), it can be supposed that-the real 
parts of the characteristic numbers of (1) are negative. It is also clear that 
an affine transformation of the (x, y)-plane leaves the assumptions and asser- 
tions of (*) unchanged. Hence it can be supposed that (1) is in a normal 


form satisfying 
(18)  acos?-+ $(b + ¢)sin? 6+ d sin? —const. < 0 for all 6. 


Accordingly, if a(t), y= y(t) is a non-trivial solution of (7), then 
dr/dt <0, by (17), and (9) holds. Thus r= r(?) has an inverse ¢ = ¢(r), 
and t(r) ~o,asr—>0. Put 


f(a, y) = (a(t) + (B(t) —b)y, g(a, = — + (S(t) — dy, 
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where t= ¢(r) =t((2?+ y?)4). Then (5) holds by virtue of (8), since 
t(r) >0o as r—>0. Clearly, c—-2(t), y=y/(t) is a solution of (3). It 
follows from the results in [2], pp. 123-125, on the system (3)-(5) that 6(¢) 
satisfies (11) in case (i), and (15) in case (ii), finally either (11) or (15) 
in case (ili). 

It remains to be shown that the decision in the alternative of the case 
(iii) is independent of the choice of the solution. To this end, it can be 
supposed that (1) has the normal form in which 


(19) a=—1, c=—1, d=—1; 


in particular, y(t) 40 for large ¢. Consider for large ¢ the linear, second 
order differential equation 


(20) + h(t)2’ + q(t)2—=0, 
where 

(21) p(t)=y"(t), h(t) =8(t) —a(t), g(t) =— A(t). 

If z=2(t) is a non-trivial solution of (20), then a continuous determina- 


tion of 
(22) 6 = arctan {z/(pz’)} 


satisfies (14). Conversely, if 6 = @(¢) is a solution of (14), then a quadra- 
ture of (22) leads to a solution z = z(t) of (20). (Actually, the quadrature 
in question is first just formal, because of the infinities of cot 6(¢), corre- 
sponding to the infinities of log | z(¢)|; so that the integration constants must 
be chosen on successive ¢-intervals, bounded by the points ¢ of infinity of 
cot 6(¢), so as to insure that z(t) has a continuous derivative.) 

At a given t-value, 0(¢) becomes 0 (mod) if and only if z(t) =0, 
and, at such a value of ¢, #’(¢) >0. Hence it is clear that (15) holds if 
and only if one and/or every non-trivial solution z= z(t) of (20) has 
infinitely many zeros (as t->0). 

Consequently, in the normalized case (19) of (iii), the first or the second 
alternative of the cases (15) of (11), where 6. —0 (mod 7), holds for every 
solution according as (20) is or is not oscillatory. This proves (*). 


Remark. The argument just completed contains a proof of the following 
theorem (cf. [1], pp. 391-392): If (20) ts non-oscillatory for large t and 


p(t) > 1, h(t) > 0, q(t) as t>0, 
then, for every non-trivial solution z= 2(t) of (20), 


(23) z’(t)/z(t) +0 as t-00. 
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' A careful perusal of the proof of (*) shows that (*) can be extended as 
follows : 

| (+) Let f(x, y, t), g(x, y, t) be continuous functions for x? + y? S const., 
| 0<t<o satisfying 

(Sbis) f/(2? + > 0, g/(a’ + > 0 as (a, y, t) > (0, 0, 0). 

j Let the real parts of the characteristic numbers of the matrix (1) be negative. 


Let be sufficiently near, but not equal to, (0,0) and let x(t), 
y= y(t) bea solution (for small non-negative t) of the initial value problem 


(3bis) 2 y=cx+dy+ Q(z, y,t) 


and y(0) = Then the solution x=—-2x(t), y=y(t) can be 
continued over the half-line OSt <a. The function r—r(t) ts positive 
and satisfies (9). A continuous determination of (10) satisfies (11) when 
(1) is in the case (i), and (15) when (1) ts in the case (ii), finally either 
(11) or (15) when (1) is in the case (iii). Furthermore, when (11) holds, 
6= 6) ts a zero of (16); in addition, 2’* + y” 0 for a large t and a con- 
tinuous determination of (12) satisfies (13). 


In contrast to the linear case of (*), the alternative in case (iii) can 
now depend on the solution; ef. [2], p. 123. 

As to the first part of (+f), that concerning the continuability of 
(z(t), y(¢)) and (9), cf. [4], pp. 816-817. 
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UMBILICAL POINTS AND W-SURFACES.* 


By HarrMan and AurEL WINTNER. 


A surface will be called a special Weingarten surface (special W-surface) 
if its Gaussian and mean curvatures, K and H, are connected by an identity 


(1) F(K,H) =0 
in which F satisfies the following conditions: 


(*) The function F(K,H) is defined and of class C? on the portion 
K = H? of the (K,H)-plane and satisfies 


(2) when K =H’. 


Actually, the assumption that F is of class C? will be needed only on a 
set of points (K,H) on and near the parabola K = H*. In terms of the 
principal curvatures k,, k2, the relation (1) can be written in the form 


(3) W (ki, kz) == 0), 
where W(k,, = F (kiko, $(k, + Correspondingly, (2) means that 
(4) OW /ok,-OW/dk2 >0 if ky 


In fact, it is readily verified that the product of the partial derivatives in (4) 
is (4Fy + HF x)? = 0 if kj and so the inequality in (2) implies that 
in (4). 

This fact makes it clear that the above definition of a special W-surface 
is essentially that of Chern [1], who, without specifying the smoothness of 
W, requires that a relation of the type (3), (4) be satisfied. The assumption 
that F(H,K) is of class C' or C? implies that (4) is of class C* or C”’, 
respectively. On the other hand, the assumption that W(k,,k.) is smooth 
does not imply that F(H, K) = W(H + (H*— K)3, H — (H?— K)3) is 
smooth at points where K = H®. It is for this reason that a special W-surface 
was defined above in terms of a relation of the type (1), rather than one of 
the type (3). This definition also circumvents the hypothesis of H. Hopf 
[5], p. 234, that « = dk,/dk, exists at the points of the surface where k, = kz 
(that is, K =H”). 


* Received November 23, 1953. 
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UMBILICAL POINTS AND W-SURFACES. 


The main theorem to be proved is the following: 


(I) Let S be a (small piece of a) special W-surface of class C*. Then, 
unless S is part of a plane or of a sphere, the umbilical points (if any) are 
isolated and thew indices are negative. 


An umbilical point of S is a point where H* = K (20). According to 
this definition, a flat point (that is, a point where H = K = (0) is an umbilical 
point. It is understood that “index” in (1) refers to the net of the directions 
of principal curvature. The terminology (net, index) is the same as in [6], 
where self-contained proofs of what is needed are developed. 

The use of elliptic partial differential equations in the proof of (I) will 
show that, by virtue of a theorem of Nirenberg [7], p. 106, the surface S 
has local parametrizations z = z(a, y) in which the function z = 2(z, y) is of 
class C* (in fact, the third order partial derivatives of z satisfy uniform 
Holder conditions) in a vicinity of the umbilical points. 

The assertion (1) is contained in a theorem of Hopf ([5], p. 236) if the 
assumption that S is of class C? is strengthened to the assumption that 8 is 
analytic in a vicinity of its umbilical points. On the other hand, Hopf’s 
proof is not applicable even if it is assumed that S is of class C®. 

If a net of directions (say, principal directions of curvature) on a 
closed orientable surface of genus zero has only a finite number of singular 
points, then the index of at least one of those points is positive ; cf. [6], p. 753. 
Hence, an immediate corollary of (1) is the following: 


(II) If a closed orientable surface S of genus 0 is a special W-surface 
of class C?, then S is a sphere. 


This theorem was proved by Chern ([{1], Theorem 1, p. 279) under the 
additional assumption that the Gaussian curvature K on S is positive through- 
out. Chern’s smootliness assumption is that S be of class C*; actually, a 
perusal of Chern’s proof seems to indicate that his proof implicitly assumes 
that S is of class C*; cf. his use of d?k, on p. 283. Hopf ([5], p. 234) points 
out that a generalization of Chern’s theorem can be proved along the lines 
of Hilbert’s proof (cf., e.g., [8], pp. 122-123) that a closed surface of 
constant Gaussian curvature is a sphere. Since this proof requires that, 
near a non-umbilical point, S be of class C? in terms of a line of curvature 
parametrization, it would also seem to require implicitly that S be of class 
C*; cf. the results of [4] for the possibly analogous situation in the problem 
of asymptotic parametrizations. 

If S in (II)-.is assumed to be analytic, then (II) is contained in a 
theorem of Hopf [5], p. 237. 
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Then z is of class C* and satisfies the differential equation (1), in which 


and p = 22, ¢ = 1 = = t= Zyy. A simple calculation shows that 
if the left side of (1) is considered to be a function ®(p, q,1, s, t) of p,q, 1, 8, ¢ 
by virtue of (5), then 46,6,—®,’, up to the factor (1-+ p*+ q’)?, is 
Fy? + 4H FyF x + 4K Fx’, which is + HF xr)? when K = H*. Hence, 
(2) means that the partial differential equation (1) is of elliptic type near 
an umbilical point. Correspondingly, a theorem of Nirenberg [7], p. 106, 
implies that if (7, y) = (0,0) is an umbilical point of 8, then z= 2z(z, y) 
is of class C* (and, in fact, its third order partial derivatives satisfy a uniform 
Hélder condition) in a vicinity of (z,y) = (0,0). 


and that (2, y) = (0,0) is a flat point, that is, that K — H —0 holds at 
(z,y) = (0,0). Then (K,H) (0,0) satisfies (1). Hence, the plane 
z==2(0,0) is a solution of the partial differential equations (1), (5). The 
results of [3], pp. 464-469, imply that if the two solutions z= 2z(z,¥), 
z==2(0,0) are not identical, then the second order derivatives of the 
difference z(x,y) —2z(0,0) satisfy asymptotic formulae of the form 


where n is a non-negative integer, z= pcos 6, y=psin@ and ¢,, C2 are con- 
stants for which c,* + ¢.20. Since (z,y) = (0,0) is an umbilical point, 
the integer n is positive. 


Consequently, the Gaussian curvature of § is negative for small positive 
= 2? + y’, and so (x,y) = (0,0) is an isolated umbilical point. 


by the net of asymptotic directions (the angles of which are bisected by the 
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Proof of (1). Let S be represented as z= 2z(a,y) for small 2? + y?, 


K = 


H = ((1+ @?)r—2pqs + (1+ p*)t)/2(1+ p?+ 9°)! 


Flat points (K =0). Suppose that 
p(0, 0) oT q(0, 0) =0 


Zaz = p"(C, cos nO + sin nO) + 0(p"), 
= p"(C2 cos NO — sin nO) + 0(p"), 0) 
Zyy = p"(— c, cos nd — cz sin nO) + 0(p"), 


In particular, (7) implies that 


— = — (Cx? + Co”) + 0(p™) as p—> 0. 


Since the net of the directions of principal curvature can be replaced 
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directions of principal curvature), it follows that the index of (0,0) with 
respect to the net of the lines of curvature is the same as that with respect 
to the net of asymptotic lines, the net defined by 


(9) + + Zyydy? = 0. 


Since the omission of the o(p”)-terms of (7) in (9) is equivalent to a 
deformation of the net (9), it follows that the index of (0,0) with respect 
to the net of lines of curvature is —n < 0. 

(Actually, the asymptotic formulae (7) are not needed to show that the 
index of (0,0) with respect to (9) is not positive. To this end, it is sufficient 
to know that K(2,y) < 0 for small 2? + y? > 0; cf. [6], p. 756.) 


Spherical points (K >0). Let (6) hold and suppose that K > 0. 


Then if 
(10) N =N(2,y) = (44,9) (2 + p? =1) 


is a continuous unit normal to S, say V(0,0) = (0,0,1), then the spherical 
mapping (z,y) — (A,) is one-to-one. If z is of class C? or C%, then the 
supporting function 


(11) w= = + yp + 


is of class C? or C*, respectively, in a vicinity of (A, ») = (0,0); ef. [2], 
p. 810. In particular, the function w(A,») is of class C® if (2, y) = (0, 0) 
is a spherical point of S, that is, an umbilical point (H? = K = 0) which is 
not a flat point (HW = K —0). 

The function F(1/V,U/2V) of (U,V) = (2H/K,1/K) is of class C? 
for U?=4V>0. The equation (1) becomes a partial differential equation 
for w(A,) by virtue of the Weingarten formulae 


where A, is the Laplace operator with respect to the third fundamental form, 
(12) | aN |? = {(1—p?) dr? + 2apdrdp + (1 —d*)dp?} 


and A». is the corresponding Monge-Ampére operator; cf., e. g., [2], p. 311. 
If (z,y) = (0,0) is a spherical point of S, then the assumption (2) 
means that the partial differential equation (1) for w is of the elliptic type 
at, hence near, (A, 2) = (0,0); ef., e.g., [2], pp. 330-331. 
If the Gaussian curvature of at (z,y) = (0,0) is K(0,0) = 1/R’, 
then the sphere corresponding to the constant supporting function w==R 
is a solution of (1). Hence, the results of [3], pp. 464-469, imply that if 
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the origin on the z-axis is chosen so that z(0,0) —R (hence w(0,0 = 8), 
then either w(A,~) =F or the difference w(A,u) —R has second order 
partial derivatives which satisfy asymptotic formulae, analogous to (7), with 
n=O (in fact, those which result by replacing z,y,z2 in (7) by A,p,u, 
respectively). Hence, quadratures show that the partial derivatives w), w, are 
O(x"**) and that is R + O(y"*?), where x= (A? + and 

Let a= B(A, nu), B—=B(A, pn) and denote the first, second 
and third fundamental matrices of S in its (A,)-parametrization. Thus, 
y is the matrix of the quadratic form (12) and 


(13) y= 


ef. [2], p. 304. If 8=—8(A,p) is the symmetric matrix the elements of 
which are the components of the second covariant derivative of w with respect 
to the quadratic form (12), then? 


(14) B=8+4 wy. 


The asymptotic estimates for the first partial derivatives of w show that 
8(A,) is the Hessian matrix, say 7—7(A,p~), of w, up to an additive 
correction of as x = (A? 0. Hence, the asymptotic formula 
for w shows that (14) can be written, as y > 0, in the form 


(15) OG")- 
Thus, by the asymptotic formula, analogous to (7), for the elements of », 
det (8 — Ry) = — + + 0(x"), as x0. 


In particular, det(B— Ry) <0 if x >0 is sufficiently small. Hence, the 
characteristic numbers k,, k, of satisfy ki << 1/R < k, for small x > 0. 
Since by (13), the numbers k,, are the principal curvatures. 
Consequently, these curvatures are distinct for small x > 0, so that (A, g) 
= (0,0) is an isolated umbilical point of 8. 

This consideration shows that the integer n is positive; for otherwise 
(A, #) = (0,0) were not an umbilical point of 8. 

Since (13) implies that 8 — y/k = Ba-1(ka — B)/k and since the direc- 
tions (dA, du) of principal curvature are the characteristic vectors of the 
(singular) matrices k,a — B, k2a — B, they are also the characteristic vectors 


1In formulae (66) and (67) of [2], p. 312, the partial derivatives w,, should be 
replaced by the covariant derivatives V,,w. 

The discrepancy of a minus sign between the formula (14) of the above text and 
the formula (66) of [2] is due to a different orientation of the normal vector. 
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of B—y/ki, B—y/k2. In view of ki <1/R < ke, the net of directions of 
principal curvature can be replaced by the net of directions (dA, du) for 
which the quadratic differential form belonging to B— Ry vanishes (the 
bisectors of the angles of the latter net are’ the directions of principal 
curvature). It is seen from (15) that the latter net can be deformed into 
the net of directions (d\,du) for which the quadratic differential form 
belonging to » vanishes, that is, into the net defined by 


(16) Wy Ad? + 2Wp,drAdp + Wypdp? = 0. 


Hence, the index of (A,u) = (0,0) relative to the net of directions of 
principal curvature is —n <0. This proves (I). 


Remark. Let (II*) denote that weakened form of (II) in which it is 
assumed that the Gaussian curvature on 8 is positive (as in Chern’s theorem 
[1], p. 279). Then (II*) follows from the theorem (0) of the Appendix 
of [2], pp. 330-333. 


Under more restrictive conditions of differentiability on F and 8, the 
result (II*) is contained in a theorem of Pogoreloff (1948). Under the 
assumption that / and S are analytic, (II*) is implied by a result of 
Alexandroff (1939). It should be mentioned that, in the analytic case, the 
use of index theory in differential geometry is due to Cohn-Vossen (1927) 
and the combination of indices with the theory of elliptic partial differential 
equations to H. Lewy (1938); cf. also Hopf and Samelson (1938), Alex- 
androff (1939) and Zhitomirsky (1939). For references to the papers men- 
tioned, see [2], p. 334 and [8], p. 476. 


Remarks on the Appendix in [2]. In theorem (0) of [2], p. 330 (in 
which the last word should be satisfy instead of satifies), it was assumed that 
the surface S is of class C*. Since the function F occurring in the. elliptic 
partial differential equation #0 is assumed loc. cit. to be of class C?, 
it follows from Nirenberg’s above-quoted result ([7%], p. 106), which has 
appeared in the meantime, that it is sufficient to assume only that S is of 
class C?, for § will then be of class C? by necessity. 


Another improvement on the Appendix of [2] is as follows: On pp. 332- 
333, the asymptotic formula for det(wi,—w) was used to show that the 
congruence points are isolated. The proof that the index of a congruence 
point is negative (is omitted but) follows at once from the asymptotic 
formulae for wi, — « (analogous to (7) above) supplied by the result of 
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[3]. (That the index is just non-positive, instead of being negative, can also 
be concluded from a theorem of Hopf and Samelson, [6], p. 756, top.) 

The paper referred to by [3] on p. 334 of [2] is the paper [3] in the 
bibliography below. 
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SOME FUNDAMENTAL FORMULAS FOR HERMITIAN 
MANIFOLDS WITH NON-VANISHING TORSION.* 


By M. Bootuey. 


1. Introduction. In this paper is developed for the case of complex- 
analytic manifolds with a Hermitian metric the useful technique originated 
by Chern which consists in referring to the principal fibre bundle B over a 
manifold M the tensor calculus of M. The first part of the paper is an 
exposition of general facts about complex manifolds which are already well 
known, including a brief discussion of the method of Chern in the complex- 
analytic case. In sections 10-13 some basic formulas are worked out including 
those for d6, *6, 50 for a differential form 6 of M referred to B. These are 
generalizations of the formulas given by Guggenheimer for the case of a 
Hermitian-Kiahler metric. The final paragraphs give applications of these 
formulas which use Bochner’s Lemma and which are similar to the applica- 
tions given by Bochner in the case of a Hermitian-Kihler metric. 

This study was undertaken at the suggestion of S. S. Chern to whom the 
author is deeply grateful for helpful comments and discussions during its 


progress. 


2. Complex manifolds. Let M be a topological manifold of dimension 
2n and let {U} be a family of open sets covering M in each of which is given 
a system of complex coordinates u’,- - -,w". Further assume that wherever 
two coordinate neighborhoods U,V overlap the corresponding change of 
coordinates is given by complex-analytic functions vt = vi(u,- - -,u*) with 
non-zero Jacobian. The family of coordinate neighborhoods is then said to 
define a complex-analytic structure on M. For functions defined in some 
neighborhood of pe AM, not necessarily the same for each function, the concept 
of being analytic at p is then invariant and we let @(p) denote the family 
of all functions which are analytic at p. The complex-analytic structure 
automatically defines a real-analytic structure: corresponding to complex 
coordinates we have real coordinates where? 


* Received October 9, 1953. 
* We adopt the following summation conventions: latin indices i, j,k, ete., run from 
1 to n; greek indices a, 8,7, etc. from 1 to 2n. For indices from 1 to n a star (*) 
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ui =i +iri*. The change from real coordinates z* to real coordinates y* 
corresponding to vi =y/ + iy” is thus necessarily given by real-analytic 
functions with positive Jacobian. Any real-analytic function may be written 


as a formal power series in w',---,u",u**,---,u*" by replacing 2 by 
+ u*/) and by so we may and shall use -, 
u*1,---,u*" in lieu of - wherever real-analytic coordinates are 


required. We let ¥(p) denote the family of all complex-valued functions 
f =wu- iv whose real and imaginary parts are real-analytic, i.e., the family 
of all functions having expansions in powers of +, uw", 
local coordinates at p. Then at p both df/dui and df/du*) are defined for 
fe ¥(p), and the subset @(p) of ¥(p) consists of exactly those functions f 
for which 6f/du*/=0, j7=1,---,n. We let @*(p) denote the functions 
whose complex conjugates are analytic at p, i.e. the functions f for which 
éf/dui =0 for j=1,---,n. Fp) is a linear space over the complex 
numbers and ((p), @*(p) are subspaces. Complex conjugation, f > f* is a 
conjugate automorphism of ¥(p) carrying Z(p) onto @*(p) and conversely. 


3. The vectors tangent to M. The definition of a tangent vector at 
peM will be the complex analog of the invariant definition given by Chevalley 
[4] for real-analytic manifolds. A vector at p is any linear map v of # (p) 
into the complex numbers which on a composite function f = f(q',- - +, 9"), 
fe satisfies the chain rule 


v(f) = +: + . 


from which it follows easily that the collection of all such vectors forms a 
vector space of dimension 2n, W?"(p) over the complex numbers. If 


u',- + *,u" are coordinates at p, then every fe F is a composite function 
f=f(w,---,u",u*,---,u*") and this gives rise to a natural basis 
of W" associated with the coordinates and charac- 


terized by fa(w®) = Sag. Although we do not do so, it is suggestive to use 
the notation f; = 0/duj and f*; = 0/du*/, since if ve W*" and f = f(u',:--, 
u*i,- --,u®") ¥, we have 


v(f) = v(u*)df/du* = v(u*) fa(f). 


affixed to the index automatically adds n, and for those from n + 1 to 2n subtracts n. 
Complex conjugates are denoted by a star (*) rather than a bar (-) and for symbols 
representing complex numbers the placing of a star on the index is equivalent to taking 
the complex conjugate. For example we write u*i or ui* for the conjugate of w. 
(This notation is adopted for typographical convenience. A better notation is to replace 
all stars, including those on indices, by bars, see [6]). Repeated indices are to be 
summed. 
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We remark that since the coordinates u/ are valid in a neighborhood of p, 
they furnish a natural basis for the tangent space W?"(q) at each point g in 
the neighborhood, thus f, —0/du*% actually is a field of bases over the 
coordinate neighborhood. If the coordinate neighborhood V overlaps with 
U, we have 0/dv% = (du®/dv*) (0/du8), that is, go (du®/dv*) fp. 

The interesting point is that W" carries further structure in the form 
of certain maps and subspaces which may be invariantly defined in our 
schema. First we may define a conjugate automorphism v— v* of W" by 


fed. 


We shall refer to this map as the conjugate automorphism of W?". For the 
natural basis f,,- -, fn, +, f*n defined just above we have f*; = fie. 
Second we may define subspaces V" (and V*") of W®" as those vectors of 
W2" whose value on (@*(p) (and @(p) respectively) is zero. Then we may 
prove the following statement. 


Proposition 1. W°" is the direct sum of its subspaces V" and V*", 
each of which is of dimension n. The conjugate automorphism carries V™ 
onto V*" and conversely. In any natural basis relative to local coordinates 


Proof. It is clear from the definitions that the conjugate automorphism 
carries V" onto V*" and V*" onto V”, so that both subspaces have the same 
dimension. If any vector v is in both V" and V*", then v(u/) = 0 = v(u*), 
j=1,---.,n for local coordinates w1,- --,u" at p; and therefore by the 
chain rule v(f) = 0 for every fe ¥, so that v0. It remains to show only 
that V" + V*" spans W*"; but it is easily seen from’ their definitions that 
fi-- +, fae V" and f*,,- - -, f*ne V*" and these vectors form a basis of W?". 
We shall call the vectors of V" pure complex vectors, since, in the sense that 
were we to restrict ourselves entirely to complex-analytic functions in defining 
the differentiable structure on M, M being then a manifold of complex 
dimension n, we could proceed by analogy with an n-dimensional real-analytic 
manifold, and define just the n-dimensional vector space V” over the complex 
numbers as tangent vectors to M. However, for most purposes restriction 
to complex-analytic functions is too severe, and we are impelled to consider 
real-analytic functions. It is in this latter context that it seems natural, 
though not essential, to introduce the vector space W2". 

A vector willbe called real if its restriction, as a mapping, to the real- 
valued functions of F(p) is real-valued, that is if its value on every real- 


* It is convenient to denote vectors of V*" by y*, i.e., image of ye V". 
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valued function in ¥(p) is a real number. The real vectors in W?" form a 
subset R®" which is closed under addition and multiplication by real scalars 


and is therefore a vector space over the real numbers. > 
Proposition 2. A vector v of W*" is in R*” tf and only tf v* =v; the 
dimension of R?" is indeed 2n; and each vector v of W*" may be uniquely F tu 
written in the form v =v, + iv, where v, and v, are real. 
Proof. Let ve R™ and fe F(p), then f =f, + tf. where f:, fs F (p) 
and are real so that w(f,), v(f2) are real numbers. Since v is linear, § 
v(f) =v(f.) + iv(f.). On the other hand 
v(f) = [v(f*) ]* = [o(f:) — (fe) ]* = + 
Then v* —v. Suppose on the other hand that v* = v and that fe F(p) is 
a real-valued function. Then [w(f)]* v*(f*) =v*(f) =v(f), that is 
the value of v on f is a real number, whence ve R*". Now the dimension of § 
R" is = 2n since the 2n vectors 
are in R2" and form a basis of W®" and must therefore be independent over (6 
the real numbers. On the other hand if av,-+----+ a%v,=0 is a com- 
bination of real vectors v,,- - -, v, with complex numbers as coefficients, then, | is 
since v;==v*;, the conjugate automorphism applied to both sides gives — on 
--+ g*ry,=-0 so that the same combination with only the real (7 
parts or with only the imaginary parts of the @’s is zero. From this and f 
the fact that every set of more than 2n vectors of W2" is dependent over the f wh 
complex numbers, we see that the dimension of R°*™~=2n. Hence it isf bs 
exactly 2n and the vectors above form a basis. It is worth remarking that the hi 
above basis of corresponds exactly to the natural basis - - ,0/da", (8 
0/dx™,- - -,0/dx"" corresponding to the real-analytic coordinates - if 
-+,2™ mentioned in section 2, and 
0/dw = 0/dai + 10/02", 0/du*) = 0/dri — 10/02". are 
a | 
Finally let » be an arbitrary vector in W*". It is easy to see that pr 
v=}3(v-+ v*) + 4(v—v*) is a unique representation of v as a com 
bination v, + iv, of real vectors v, and v». (9 
Finally we define the automorphism Y (see Eckmann and Guggenheimer 
[5] and Guggenheimer [6]) introduced by Hopf. If ve V", then let Wo = wf yp: 


and, if ve V*", let Wo =—iv. Since W2" = V+ V*" this defines © for f 
The following statements about © are clear. 


an arbitrary vector of W?". 
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PROPOSITION 3. W commutes with the conjugate automorphism: Wv* 
= [Wvv]* and V", V*" are the invariant subspaces associated with the two 


| proper values of + % of ©. 


4, Inner products. Let v-w be a symmetric, bilinear, complex-valued 


| function on W*, so that 


(1) for v,we Wn 


(2) [aw + Bo']-w —a[v-w] + 


+ Bw’ | = alv-w] + Blv-w’] for « complex numbers, 


| and further satisfying 


(3) [v-w]* = v*-w* 
(4) v-v20, with equality if and only if v0, and 


(5) v:w=0 if v and w are both in V" or both in V*". 


| Then this function defines a real-valued inner product on R2". Moreover, 
the function (v,w) defined for vectors of V" by 


(6) (v,w) = v-w* 


isa Herimitian scalar product on V". Conversely a Hermitian scalar product 
on V" allows us to define a function with properties (1)-(5) on W*", namely 


(7) [v + v*]-[w + w*’] = (v, w’) + (w, 0’) 


| where v + v*” and w+ w*’ are the unique representations of (arbitrary) 
} vectors of W?" as the sum of a vector of V" and a vector of V*". Let 
fas f¥n be a basis of and let gag = fa'fg. Then we have 


(8) jg = 0 — and = jie = 


If at each point p of WV a function v-w as described is given on W2" such 


| that as functions of local coordinates relative to the natural basis the gag 
| are real-analytic, then M is called a Hermitian manifold. Clearly it is also 


a Riemannian manifold. We remark merely that relative to such an inner 
product the map W of section 3 is a rotation on R?", so that 


(9) vw v-w. 


Relative to such a product on W*" is defined a natural isomorphism from 


| W" to W’2" with w’, the image of w, defined by 


(10) 


<v, w’> = v-w for ve 
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W2" and let é*f, be a vector of W*" with components If is 
the image in W’*" of €*f, we have 

= = fi "fa = and, 
(11) 

In tensor notation these are standard formulas for lowering indices. By 
virtue of (8) they may be combined into 


(12) Pa = gapt?. 


5. Tensors on M. A tensor T at p of type (r,s) is an element of the 
Kronecker product = W”* of r 
copies of W*" and s copies of W’". If a tensor of type (r,s) is given at each 
point of M, we have a tensor field T(p) on M. Let u*,- --,u" be local 
coordinates with - fn, f*n and ¢',- p*" the 
corresponding dual bases in W*" and W”". Then we have 


The components 7-8, 4, are in our case assumed to be real-analytic 
functions of the local coordinates u',- - -,w" and their conjugates. Cor- 
responding to the subspaces V" and V*" of W*" and their duals, we 
have a direct sum decomposition of W*’, into subspaces. For example 
W?, = W*" is the direct sum of 2* subspaces such as 
@ V", V" V*" V", etc. Tensors in these subspaces have, 
relative to a natural basis, all components other than those of the form 77j«j+;, 
Tij*x, ete. respectively, equal to zero. The elements of these subspaces are 
called pure tensors, and in studying linear operations on tensors one may 
conveniently restrict oneself to pure tensors, as will be done below. Tensor 
components written with latin indices may always be assumed the component: 
of a pure tensor. 

An important subset of W’, consists of linear combinations with real 
coefficients of elements of W", of the formu, ®--:-@Qu,-QH@-::-@W# 
where u; are elements of C and 6! elements of R’" C This 
subset is naturally isomorphic to the Kronecker product 


over the real numbers of r copies of R?" and s copies of R’" which will be 
thought of as thus imbedded in W’,. Bochner [1] has given the simple 
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condition which follows, that a tensor T of W’, lie in R’,, i.e. be a real tensor. 
For definiteness consider again the space W*, = W?" @ W*" @ W2" and let 
T = T**v, & vg & vy be an element thereof. We define on W%, the con- 
jugate automorphism T — T* by 


T* ax v*s & v*,. 


Now every vector v, has a unique decomposition vg = Ua + iW, Ug and wy 
real vectors, and also — Raby 4 with R*87 and real numbers. 
Thus T may be written 


— + i897) (4g + inva) @ (up + itvp) @ (wy + ivy). 
Using the linearity in each of the “ factors” it is seen that 
T =P + iQ, where P, Qe Rh?" @ R*" & R** 


to which T itself belongs then if and only if @=0. In this notation T* 
becomes 
T* = P— 1Q, 


so that T — T* is the condition for T to be in R*" @ R*”" & R**, i.e. to be 
real. J is said by Bochner to be self-adjoint in this case. 

If an inner product is defined on W*" we may use the corresponding 
natural isomorphism of |W?" to W’" to raise and lower indices of the com- 
ponents of tensors. Thus relative to dual bases 74, 4, is carried to, 
g. where 


If the basis chosen in W*" is unit orthogonal in V", then gj; = 8;; and 


We remark that using multi-linearity of elements of a Kronecker product 
it is easy to derive the classical formulas for change of components of a tensor 
under change of basis. If fn. ++, f¥n and g1,°° B%1,° °°, Bn 
are bases of W?" related by fi = Plig;, f*: = P*/g*; and T is, for example, a 
tensor of type (3, 0) 

T @ fp ® fy 


then we have 


0 that the components of T relative to ga are T°®vP\,P#gP",. This is 
the usual formula in the case of change of coordinates from u to v with 


e 
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P%,, = 6v/du". This formula has been used classically to define a tensor and 
for brevity we shall revert to the classical definitions for tensorial differential 
forms. For example a tensorial differential form of order p and type (3, 0) 
is given if in each local coordinate system wu’,- - -,w”" are given (2n)* differ- 
ential forms 0°47 of order p in du‘, du*‘ transforming according to the rule 
given above for the change of components of T. A tensor of type (r,s) isa 
tensorial form of order 0 and type (r,s). 


6. Affine connections. There are two methods of computing a metric 
affine connection on a Hermitian manifold M, that is to determine for each 
local coordinate system u’,- --,u" a set of (2)? linear differential forms 
68, =T%,,du” defining by df, = O%afg the differentials df, of the natural 
basis vectors f,,: fn, in a manner independent of local 
coordinates. This latter means that if v’,- --,v" is a coordinate system 
(where we have * then f, = with P®, = dv8/du* and ga = QFafg with 
Q8, = du8/dv"), then we may compute df, using either 6, of the u-coordinates 
or y4, of the v-coordinates, and both must give the same result: 


(1) df, = d(PFage) = (dPPa + 8s, where dga 

whence we must have 

(2) 68, = + P’,6",) 

This is the classical formula for the change of components: of an affine con- 
nection with change of coordinates. We impose the condition 

(3) df*; = [df,]* (i.e. = 

as part of the definition of affine connection on a complex manifold. The 
connection is called metric if 


(4) Al fe] = fa + 


If we define a real displacement dp of pe M by dp = du'f; + du*/f*; and 
a pure complex displacement by dp =—du/f; then we may compute the 
torsion by 


(5a) d(dp) = — duidf; — du*idf*; = + 0% 
in the case of a real displacement or by 
(5b) d(dp) = — duidf; = 6°%;du'f, 


8 Pt, = /du/, P* = = P**. and =0=P**. since the v’s are 
analytic functions of the u’s. Similar formulas hold for Q6,. Note that P*8, = PF... 


In 
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in the case of a pure complex displacement. We say that the connection is 
without torsion (in the pure complex sense to be understood) if d(dp) as 
defined in (5b) is zero. 

Having made these definitions we consider the first of the two methods 
of computing a metric affine connection. In this method we ignore the 
complex-analytic structure and consider V as a real-analytic manifold with 
a Riemannian metric (albeit a metric with special form when referred to our 
complex-analytic local coordinates). Then it is standard that there exists a 
unique metric affine connection without torsion in the real sense, (5a). In 
fact we have 68, B8,,dz” = g8VB,, where 


(6) Bov,y = + — Ogay/du"). 
These formulas simplify somewhat due to the identities (8), section 4, to give: 
(7) Bis = dut — Ogi+;/Ou*) = B* 
Bij xe = 4 (0gix+/du? 09 jxx/Ou') = B* js 

This affine connection, being the standard one on a real Riemannian 
manifold defines dv only for real vectors v, however it is easily extended to 
all of W*" since every element of W°" may be uniquely written in the form 
a+ ib, where a and b are real vectors. Thus we define d(a + ib) = da + 1db. 

Of more interest for complex manifolds is an affine connection which 
makes essential use of the complex structure, the second method referred to. 


By analogy with the Riemannian case we would like to think of M as a mani- 
fold of m complex dimensions, restrict ourselves to the pure complex vectors 


of W?", i.e. to the subspace V", and use only the w',- --,w" and not 
u*t,- +,u*" to obtain thus the forms = by which we define 
df; = 6/,f; for the basis vectors f,,- --,fn of V". Given such a connection 


we could extend it to V*", using df*;==[df;]* as definition of df*;, and 
hence by linearity to W?". The condition that the connection be metric gives: 
= afi + fir or 
Agijs = + whence 
(8) Cine = = g* Agi» 
and following df*; [df;|* we define 
C* ne = and = All other are zero. 


In general the two connections arrived at here are not the same. The 
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condition that they coincide is exactly that the connection be without torsion 
(5b). This is equivalent to the Kahler condition on the metric: dgix+/dui 
= 09j,+/du‘. However, the main interest below is in a preliminary investi- 
gation of this connection when the torsion does not vanish, i.e. the non- 
Kahler case. 


7. The bundle of frames on M. A frame pe,:-: -en,e*,: ata 
point p of a Hermitian manifold MW will mean an ordered set e,,: - -, e, 
e*,,- -,e*, of 2n vectors of W°", the vectors -,e, in V”, with order 


coherent with that orientation of W*" induced by the orientation on M, and 
satisfying e;-e*; = 8);. The collection of all such frames at all points of M 
constitutes the bundle of frames B, a fibre bundle over M with the unitary 
group U(n,C) as structure group. Since each frame at p is uniquely 
determined by the vectors e,,- --,@,, this bundle is, except for notation, 
the same as that defined by Chern [8]. If ut,- - -, wu" are local coordinates 
in a neighborhood U of M and f,,- - -, fz» the corresponding field of natural 
bases of W°", $,- - >," their duals, then the vectors of a frame are given 
by ej = X*;f, and e*; = Y**,f*,, or, i.e., = with = 0 = 
With change of coordinates the Y*, transform contravariantly in the upper 
index. The duals to e’, e*/ are given by & = = or 
e* == where X/,Y";, and so we also have f; = Y*,e,, 
f*; = Y**je*,, and so on. The variables - -, uw", X¥*); give a system 
of local coordinates in x*(U) in B, where 7: B > M is the natural projection, 
although the variables Xi; are not independent but satisfy the relations 
= whence B has complex dimension n(n+1)/2 and real 
dimension n(n + 1). 

At this point we introduce one of the more important notions in the 
paper, namely what will be loosely spoken of as referring a tensor of M to B. 
Arguments given in section 5 allow us the convenience of restricting ourselves 
to pure tensors. To be definite we will consider a tensor T of type (2, 1) 
having components in the local coordinates - -, wu". 


Relative to the basis of W*" determined by a frame pe,- -@,e*,: 
we have 


where, of course JT’, are given by 
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determining thus n* functions defined in *(U) C B. It is easy to see that 
the functions T’*, as defined in B are actually independent of the local 
coordinates on M used on the right side of (3) and thus are functions on 
all of B, depending in fact only on the tensor T itself. They will be called 
the components of T in B. They may be considered as components of T 
in a sort of universal coordinate system defined over all of M, and a coordinate 
system for which the vector basis is moreover unit orthogonal throughout. 
We shall see how it is possible to use these components, functions on B, just 
as we use the components of T in a local coordinate system ; thus “ referring ” 
the tensor calculus of / to B. This is a great advantage especially for global 
considerations as one is freed from the nuisance of local coordinates and, 
moreover, since the components of Tim are with respect to unit orthogonal 
bases, it is possible to raise or lower indices merely by placing a bar over 
the index moved. Thus 7", corresponds to 7'nm and to T’™*"*, ete. This 
is the method of Chern mentioned in section 1. Below its ramifications are 
worked out for the case of complex-analytic manifolds and a few applications 
are given. 

In the same fashion that tensors may be referred to B, tensorial forms 
may also be referred to components in B. Let /; be local components of a 
pure tensorial form of order p and type (1,1); then 0’; has as components 
in B the n? forms Q/; on B determined in local coordinates u‘, X4, X¥*); by 


(4) = 


In particular the complex displacement dp referred to in section 6 is given 
by dp = du‘f,, the du‘ being therefore tensorial forms of type (1,0) and 
order 1. They have the forms wt = Y‘,du/ as components in B, and, since 
dp:dp* = ds*, the differential of are length, we have ds? = (w'w*?) +--- 
+ (w"w*"). If v is a vector field on M, then the absolute differential dv is a 
tensorial form. Let-v = Vf, in local coordinates then dv has components 
DVe = dV* + §%,V® relative to these coordinates. As components in B we 
get DVY = (dV* + 6%,V8)Y%,. In particular, if we are given in some 
neighborhood of M a field of frames e;, then de; = (dX); + 05,X*;) f; which 
relative to the frames is 


(5) de; = we; = (dX); 6/,.X*;) 


Now if we allow X/;, ué to vary independently then the o!; are forms defined 
in each coordinate neighborhood of B. But since on every local cross-section 
of B, i.e. field of frames on I, they define de;, it is clear that they define 
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forms on B independent of local coordinates, which assertion is easily checked 
directly. The 1, w**, w/;,o*/,; span the space of n forms on B. Relations 
among them will be considered in section 8. As is already perhaps apparent, 
the w/; will play the role of @; in our calculus on B of tensors of M. 

Now given as above a tensor T = T",,f; ® f*; ®& ¢* on M, the absolute 
differential DT of T is a tensorial differential form whose components in the 
local coordinates are 


(6) = dT*, + + — 61, 

Lifting this tensorial form we get on B 

(7) DT, = DT", 

On the other hand the expression by which we will now define DT*",, namely 
(8) DT, = aT*, oT, + oT, — TH wh 


is easily seen to give the components of DT relative to a field of frames on M 
and hence, allowing the frames to vary independently as above, to define n° 
forms in B. Further, using dT‘; = d(T",Y',Y*/,X",) and (5) we see that 


(9) DPT DT", 


These formulas show that we can refer to B the operation of covariant 
differentiation merely by using o; rather than @’;. The component differential 
forms of DT in local coordinates are of the form T*”;),du* where T*";|, are 
the components of the covariant derivatives; and hence DT”; are seen to be 
given by expressions DT‘*, = involving only the os, and not w/,’s 
or their conjugates. Of course T#",), and T*";),. are tensor components of a 
tensor, the covariant derivative of T, whose components in B are Ty), and 
Ti**,) i. We digress to note here that if T is a pure tensor with components 
T'", relative to a given coordinate system then T* is also a pure tensor with 
components S**,« where S#4,. == T*‘i*,, and from property 3, section 6 of the 
affine connection it follows that 


(10) Sti = T*i 10 and | = 


The content of equation (9) is that for tensors on M the operations of referring 
to B and of taking covariant derivatives are interchangeable, the forms 
being used to take covariant derivatives relative to the components of 7 in B. 
This leads to an answer to the following question: where are given a@ priori 
in B the components in B of a tensor of M? 


t 
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Proposition 4. Let 7", be n® functions on B. These are the components 
in B of a tensor on M vf and only if * 


(11) + + Til* TF" = 0(w*). 


Proof. 'The necessity is proved above and the sufficiency follows from 
the sufficiency proof of the following proposition, as does in fact the entire 
proposition if we regard tensors as tensorial forms of order 0. 


Proposition 5. Necessary and sufficient conditions that a set of n® 
differential forms Q*"), of order p on B are the components in B of a tensorial 


form of M are * 


(12) 
and 
(13) dQ™,, + ot OM), 4 — 0.04", Rog]. 


Proof. The necessity is a matter of direct verification and is similar 
to the remarks above for a tensor 7'‘#*, and so will not be repeated. Before 
proving the sufficiency we make some preliminary statements about forms of 
M and B and prove a lemma. We continue to let 7: B—M denote the 
natural map carrying each frame pe,- - -e,e*,- - - e*, into its base point p, 
and let + again denote the induced map of forms on M/ to forms on B. By 
considering local coordinates u’,- + -,w" in a neighborhood U on M and 
un, in on B we see that the image in B of a form 
of M must involve only du* and not dX/; or dX*/;, or, using w%, 04, o*;, then 
only w* and not o/; or w*; Thus a necessary condition that ¢, a form of B, 
be the image, ¢ = z(y), of a form of M is ¢e R[w*]. Further, in the local 
coordinates of 7-*(U) any form +(y) must have as coefficients, when expressed 
relative to du*, functions of u1,---,u",u*1,---,u*" for which an equi- 
valent condition is that the exterior derivative involves only the du*’s and 
not dX%,, dX*/, so that the following lemma holds. 


Lemma. A form ¢ of B is the image under x of a form wy of M if and 
only if 
(14) ge R[w*], dd R[o*]. 


Now consider again 0", satisfying (13). A tensorial form on M which has 


‘A differential form ¢ of B will be said to be congruent to zero mod we, ¢ = 0(we), 
if @ is in the ideal generated by the wa’s, a= 1,...,2n, whereas ¢e R[we] will 
denote that ¢ is a polynomial in the wa’s. For ¢ linear the two are equivalent. As 
previously remarked restriction to tensors of type (2,1) is for definiteness only. 
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components 0", in B would in local coordinates in r*(U) on B necessarily 
be given by 
(15) Que") = Xm 
and if the coefficients of the forms 2”**, are functions of u’, u** alone, then 
it is easy to see that 2”"", are in fact the components in the coordinates wu! of 
a tensorial form on M. By the lemma this holds if do”™",e R[w*]. Now 
4 Xm Vi + Xen dV 
Using (13) and the equations (5) 
we get 


which completes the proof. 


8. Equations of structure. Chern [3] has proved that given a Her- 
mitian manifold /, its bundle of frames B and the forms o',- - -, w” defined 
locally by o! = Y',dw in the notation of the previous section, then there 
exists a unique set of forms o'; on B satisfying 
(1) = 0 

If we denote by Q‘ the forms A‘j/o* he has further shown that the forms 
Q/; defined by 

(3) = doi, — 

are of the form ° 

These he has called the curvature forms. By taking the derivatives of (2) 
and (3) and using these equations themselves one gets 

(5) dai Q == wi; 


whence, since the right hand side of each equation is in R[w*], we have 


5 We have altered slightly the notation of Chern writing as R,;,,,. the quantity for 
which he has written — Ri 
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from Proposition 5 that Q', 0; are the components in B of tensorial forms 
of M. The corresponding tensorial forms of M of course are the torsion and 
curvature forms as will be shown. 

First we must show that the w’; whose existence, uniqueness, and role in 
n the local equivalence problem are demonstrated in [8] are the same as those 
if we defined in section 7 by 
(7) = (dX, + ONY) 
that is, we must show that our /; satisfy (1) and (2). Then it will appear, 
by solving for 6’, in the equation above, that the 64 we determined by 
assuming the connection to be metric, to satisfy df*; = [df;]*, and to be of 
the form 6/; = (C’,du" are the same essentially as those determined by (1) 
and (2) above, which throws more light on why this second type of affine 
connection is natural for Hermitian manifolds. We have 
(8) dwt = dY*;dw = — 0% 
and using (7) we get 

dwt = — Gow’, hence 

d (9) dwt = + A with 
(10) = $ (VE — XG) 
Reference to (5b) of section 6 will show the forms Q' = A‘y;,w%w* to be exactly 
the components in B of the torsion 6‘;du’ = C+;,du*dw as defined there. It 
remains to show that the forms /; defined by (7) satisfy (1). We have 
(11) w) + w*t; = (dX*; + + (dX**; + X* V* 
The condition e;-e*; 8;; may be written 
(12) = 833. 
Differentiating we get- 
) If we use gx = Y?;,Y*?,, this becomes 
(14) + dX*1j,Y*t, + = 0, 
and we may write 
(15) + = — + + X*L gtk, 
Then substituting for @,, 6**, the expressions found in section 6, (8), we get, 
using gi* — 
(1) + wt; = 0 
as desired. 


ye 
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If we denote by A*; the curvature forms in the local coordinates w, u* 
we can show in much the same fashion as above that if these are given locally 
in M by the formulas 


(16) Ak, = — 6" 0"), 


then the components in B of these forms are in fact 0*; as defined in (3). 
The equations (16) allow us to compute A*; explicitly. We have from section 


6, (8), 
(17) = = Ogij+/0u! 
so that 

AK, = d(C*iydu!) — 


(18) AR, = (00% — ChimC*n:) du™du! + 0C%,/du*™ du*"du'. 
Using the notation 

(19) AK; = REnidu™du! + du'du*™ 

where + Ri*im = 0 we have 


RE mi = $ — OC* im — ChimO* + 
(20) 


Substituting from (17) in (20) and using d(g‘i*g;;+) =0 we get 
(21) Rém—=0 and 

(22) RE mie = — — (Og*" /Ou*™ ) 
which may be written 

(23) 

(24) = — +- (Ogige/Ou*"). 


10. The Bianchi identities. We have defined 9‘, the torsion forms, by 
(1) = Ab 
and have noted section 8, (5) that 
(2) — = 0/5. 


If we take the exterior derivative of both sides of equation (1) and sub- 
stitute for dQ‘ in (2) we are led to 
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which gives us the Bianchi identities 


(4a) — = 2A | 


Concerning (4a) we remark that if A‘;,);—0 then the curvature tensor 
has the same symmetry properties as in the case of a Kahler metric. This 
seems to be one of two rather natural conditions that can be imposed on the 
torsion, the other being A*;,—0. Unfortunately examples of non-Kahler 
manifolds are not plentiful, and at present there seems to be no example 
of a compact manifold for which either of these conditions is satisfied. In 
particular if A‘;,);.—0 we may conclude that the three “ Ricci tensors ” 
coincide : 
Rijs = Ransije 


(5) == Rinenje 
Tijs = Rijenne. 


On the other hand, if A”;, then That these three tensors 
need not be the same is shown by the following example. 

We consider the manifold M == S* X S°, the Cartesian product of a 1- 
sphere and a 3-sphere. Hopf [7] has shown that this space is a complex- 
analytic manifold as follows. Let w',u? be complex coordinates in the 4- 
dimensional Euclidean space, and @ the discontinuous group on E* — {(0, 0)} 
*==2u°. Identification of 


, 


generated by the transformation = 2ut; 
points congruent under @ gives 8' X S*® plus a complex-analytic structure 
inherited from #*. A natural metrie is given by 


(6) ds? = (du'du** du?du*?) /X, d= wu* 
so that 


An easy computation then gives 
Cty, — Abin == — /20 
(8) Rijexre = — /d8, Rijs = 2 — 
Size = — = 8;;/X?. 


11. Closed forms of M. We will call any form on B which is the image 
under + of a form on M a form of M. Necessarily any such form ¢ satisfies 


(1) ¢e R[o*]. 


| 
| 
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Let ¢ be a form satisfying (1). From section 5, ¢ is a sum of pure forms, 


to which we shall then restrict ourselves: 


where Skew in and skew in jg. We evaluate d¢: 


q 


If df=0O then the term in braces must be a linear combination of o*’s 
i.e. =:0(*) and hence in particular the terms in braces not involving 
A**,; are =0(w%), 18 a pure tensor of M: 


Dp 
r=1 


= | + Nig. ip | 


For any function A on B (or on M) da splits into the unique sum d,A + dd 
where is a form in w+, and is a form in w*t, and (3) may be 
written 


Dp 
r=1 
l 


q 
g=1 


= 


If dé = 0, we see from Lemma 1], section 7, that d= 7y, i.e. @ is a form 
of M. Moreover since d and z are interchangeable and z is an isomorphism 
into it follows that dy) —0. If we substitute (3a) and 3b) in (2) we get 
the conditions 


Dp 
(4a) 
r=1 
g=1 


q q 
* *K 
s=1 


Giving in summary 
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Proposition 6. If ¢, a pure r-form of B of type (p,q) is in R[o*], 
then db =0 implies ¢ is a form of M, i.e. 6 =m, and that dy=0. The 
necessary and sufficient conditions that a form of M be closed are (4a) 
and (4b). 


12. The forms *® and S80. We let 


for a fixed set of distinct indices 1,- - -i, and a fixed set of distinct indices 
*Jq and define 


tations of 1,---,n. Then 


where we denote the volume element by © 

(4) 

We define *6 for general forms by linearity. It is worth noting that 
(5) = (—1)0, 


Given a pure form @ of type (p,q) we can write for #0 


1 1 
where €p, », is zero unless form a permutation of 1,---,m and 


then is + 1 if it is an even permutation, —1 if it is an odd permutation. 
If we introduce the notation 


(7) Pky... 


1 1 
* * 
(n— p)! (n—q)! 


we have 


(8) 
1 1 


p! q! 
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This notation allows us to write 


(9) «6 = 1) x, on hi w* In-q_ 


n-g 


and thus using our computation for d, and assuming Ni..ipity...j%q ATE Com- 
ponents of a tensor of M, i.e. 6 a form of M, we get 


n-p 
I 
r=1 
n-q 
g=1 


Following this by a straightforward but long computation we compute 
50 = *d*6 to be 


p-1 


q-1 
s=1 


The condition that 56 = 0 is then that each term in parenthesis is zero. We 
remark that since it is clear that * commutes with z, i.e. *my =-a*y, and 


since we have noted that dz = md, we have 


Proposition 7%. Jf 6m), then 86=0 if and only if More- 
over if Oe R[w], then 860 is sufficient to ensure that 6 is a form of M. 


18. The interchange formulas. We consider the components in B of 
a pure tensor of I, Xi,...i,;*,...;2,, these components being functions in B. The 
condition that these are components of a tensor in M/ is, according to section 7, 


Proposition 4, 


q 
r=1 s=1 
which expresses the fact that the left side is =0(o%). Aiy...apies...7%g]a ate the ‘ 
components in B of the covariant derivative of the tensor. Taking the : 


exterior derivative on both sides of (1) and collecting terms we get 


4 
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p k q 
(2) 
p 
g=1 r=1 
m- 
1, 
By adding terms whose sum is zero we see that (2) may be written 
k k 
r=1 
r=1 
g=1 
k 
r=1 g=1 
we k 
This gives at once 
nd (4) Dp q 
3 
r=1 g=1 
i. From this, by equating coefficients, we obtain the formulas for the inter- 
d q 5 
change of the order of covariant differentiation indices: 
of 
he R 
r=1 
g=1 
h 
(5c) — = “ike 
. Note that (5a) is the same as in the Kahler case, i.e., the case where 
7 Q'—=0. Of course, since covariant differentiation is a linear operation, and 


since the order of the indices was not used, these formulas may be extended 
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to any tensor, including those with contravariant indices, since it is easy to 
show that the lowering and raising of indices commutes with differentiation. 


14. Bochner’s lemma.* We now prove a slightly extended version of 
the lemma used by Bochner [2]. In the sequel the lemma will be applied 
in the same fashion as in his paper only to the more general case of a 
Hermitian metric with torsion. The presence of torsion, it will be seen, 
enormously complicates the formulas, so that only the simplest cases will be 
discussed. 

Let @ be a real-valued scalar function on M, defining then a function, 
also denoted ®, on B. We have 


(1) dd = 4 = 
Our formula for 86 then gives 


where 4; and +; are the second covariant derivatives of ® and satisty 
Dj == js; and Dj D* whence 


If we take * of (3) we get 

where @ = i%w!- - -w"wo*!- - -w*" is the volume element of M. Then by 
Stokes’ Theorem, since the integral of an exact differential over a cycle 
vanishes, 

(5) + + == (), 


From (5) we get at once 
BocHNER’s LEMMA. If 0 on M, then 
+ + = 0. 

Corotuary. If = 0, in particular if the metric is a Kahler meiric, 
then = 0 implies = 0. 
15. Pure forms of type (p,0) and analytic forms. We consider a 
differential form 6 of M which is pure and of type (p, 0) 

= 


* Hereafter M is supposed compact. 
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Such forms include the analytic forms since a form is said to be analytic if it 
is pure of type (p,0) and if, when expressed in local coordinates ui, u*/ 
of M it depends only on wu’,---,w”, i.e. has analytic functions as com- 
ponents. Writing down the expression for the covariant derivative in local 
coordinates gives at once the 


LeMMA. 6 is analytic if and only if Xi,.4,|ne = 0. 


The conditions d#—0 and 860 simplify slightly to (1) and (2) 
respectively 


r=1 r=1 
p 
+ 44, = 0 
tr 
(1b) = 0 
p-1 
r=1 


From which we can draw the immediate conclusions: 
THEOREM 1. If 6 is pure of type (p,0) and dé = 0, then @ ts analytic. 


However the converse is not in general true as it is in the Kahler case, 
where an analytic form is necessarily harmonic in fact. In the case of non- 
vanishing torsion we cannot even conclude that a closed analytic form is 
co-closed (i. e. satisfies 890). However we do have the following theorem 
for a very special case: 

THEOREM 2. Jf 6=Ajw! ts an analytic 1-form and if, moreover, A‘ jx 
satisfies Ak, 0, then and the form is harmonic if dé = 0. 

Proof. If 6 is analytic, then using the Lemma, (2) reduces to 2A,A“p% 
which is zero by assumption. 

Let us define & = A*,A; for 6 = Ajw* an analytic 1-form. Then we have, 
USING 0, A* + and using the interchange formulas 
= A* AT + Since = O we get from Bochner’s Lemma 

THEOREM 3. Jf satisfies and if the form is 
positive definite, then there exist no analytic 1-forms on M. 

We may attempt a similar analysis in the case of a pure form of type 
(p,0), as above. Define 4 Then, 
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Assume 6 closed, or analytic, so that (1b) prevails, then 

Using the interchange formula, = + le 
Again, by Bochner’s Lemma we get 


THEOREM 4. If satisfies = 0 and if is positive definite, 
then there are no analytic forms on M nor are there any closed pure forms of 
type (p,0) on M. 


16. Pure forms of type (1,1). We consider next a form 0 = Ajjew'w*!, 
The conditions that it be harmonic, i.e. that d@=0O and 800 are 


respectively 

(1b) — Aire — = 0 
(1c) + = 0 

(1d) Nine + = 0. 


Again we form ® = )j;+\;j« and compute ®;;+ for the case where @ is harmonic. 


Let y denote the first two terms of this expression, then using d@ = 0, 66 = 0 
and the interchange formulas and the Bianchi identities we get: 


(A* ine | Aj + A*ijeAnje [xe Ain) 
It is clear that ®,,5=0 if ~=0. In the case A’;, —0 we have 
— (Rinexje + Rajeine)- 


In this case we can state the following weak theorem, which is of some interest 
since in the Kahler case 8,jo‘w*/ is a harmonic forms and 84), = 0. 


THroreM 5. If A’, and 
+ — (Rinexje + Rejeine) 


is positive then there are no pure constant harmonic form of type (1,1). 
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17. On the non-existence of a harmonic vector field. We consider a 
differential form of order 1, which we assume self-adjoint and harmonic 


= Nw! + Ajew*?, A*, == dé = 0 = 86. 


Any real harmonic vector field on M would give rise to such a form and 
conversely. The conditions d6 0 and 86 =0 give respectively 


(1a) — An = 

(1b) — = 0 

(1c) — Ane A * iy 

(2) Anne + Ane + = 0. 


We let = A*,A; = 
= + + Ae Ass Ke 


Let y denote the first two terms on the right. Then using d# =0 and the 
interchange formulas we get 


If we assume A",, 0 we get, using further d@ = 0, 66 = 0 and the Bianchi 
identities 


THEOREM 6. If and is positive definite, then there 
are no constant harmonic vector fields on M. 


None of the theorems so far stated involves Rijs = Ransije; they use Tij« 
which as we have seen in section 10 need not equal Rij. Suppose Rij to be 
positive definite, then R;;+ defines a metric tensor on M. It is Hermitian 
by section 8 (4) and the form Rijsw'o*) = O*,. That this form is closed and 
hence the metric Kihler was noticed by E. Emry of Ziirich in an unpublished 
thesis. It follows at once from section 8 (6). Thus in this case stronger 
results known for Kahler manifolds follow. 
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SUR LES MODULES DE REPRESENTATION DES ALGEBRES DE 
LIE RESOLUBLES.* 


par J. L. Koszut.? 


1. L’effacement des classes de cohomologie. Soit g une algébre de Lie 
sur un corps k et soient J/ et N deux modules de représentation de g (on dira 
également deux g-modules). Tout homomorphisme ¢: M—WN définit une 
application linéaire ¢’ de l’espace H*(g, M) des classes de cohomologie de g 
a valeurs dans M dans l’espace H*(g, N) des classes de cohomologie 4 valeurs 
dans N.? 


Définition. Soit MJ un g-module de dimension finie sur k. Une classe 
de cohomologie ce H*(g, M) sera dite effagable si il existe un g-module NV 
de dimension finie sur k et un tsomorphisme ¢: M > N tel que ¢’-c = 0. 


Ce travail tend essentiellement & déterminer les classes effagables et a 
étudier certains modules de représentation en rapport avec cette détermination. 


1) Une classe de cohomologie de degré 0 non nulle n’est jamais effagable. 
En effet, une classe ce H°®(g, M) est un invariant du module M et son image 
par ¢’ est invariant du module N. 


2) Toute classe de cohomologie de degré 1 est effacable. En effet, une 
classe ce H'(g,M) définit une classe d’extension de M par le g-module 
trivial k. On vérifie immédiatement qu’en prenant pour N une de ces exten- 
sions et pour ¢ l’isomorphisme identique de M dans N, on a ¢’-c=0. 


3) Toute classe de cohomologie de degré 2 est effacable. C’est la un 
résultat profond, du & Iwasawa, qui exprime en termes de cohomologie l’exis- 


“Received November 4, 1953. 

* Les résultats de cet article ont fait l’objet d’une note “Sur les représentations des 
algébres de Lie résolubles,” Comptes Rendus, 236 (1953), pp. 2371-2372. Il m’a été 
signalé que le Théoréme I du présent travail figure dans la thése de G. F. Leger 
(University of Illinois, 1951) avec une démonstration se rattachant au travail de G. 
Hochschild, “ Lie algebras and differentiations in rings of power series,” American 
Journal of Mathematics, 62 (1950), pp. 58-80. 

* Pour les définitions et propriétés fondamentales de la cohomologie des algébres de 
Lie, cf. [1] et [2]. 
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tence d’une “splitting algebra” pour toute extension abélienne d’algébre 


de Lie.* 


4) Pour quune algébre de Lie g sur un corps k de caractéristique 0 
soit semi-simple, il faut et il suffit que, quel que soit le g-module M de dimen- 
sion finie sur k, H*(g,M) ne contienne pas de classe effagable non nulle. 
En effet, si g est semi-simple, pour tout isomorphisme ¢ de M dans un g-module 
N de dimension finie, il existe un homomorphisme y de WN sur M tel que yd 
soit ’identité de M. I] en résulte que ¢’ est un isomorphisme. Inversement, 
si pour tout g-module M de dimension finie H*(g, M) ne contient pas de classe 
effagable autre que 0, ceci signifie d’aprés 2) que H*(g,M) = (0), donc que 
tous les g-modules de dimension finie sont complément réductibles. 


5) Soit g une algébre de Lie sur un corps k de caractéristique 0. Si 
les classes de cohomologie de degré 3 de g & valeurs dans k sont effacables, 
alors g est résoluble. Soit en effet g — b+ r une décomposition de Levi de g 
(r étant le radical de g et b une sous-algébre semi-simple supplémentaire de r 
dans g). Tout g-module Z peut étre considéré comme b-module et il existe 
un homomorphisme canonique du complexe des cochaines de g a valeurs dans 
L sur le complexe des cochaines de b a valeurs dans Z. Cet homomorphisme 
définit pour tout entier p= 0 une application pz de H®(g, L) sur H?(b, L).! 
Si b A (0), alors H*(b, k) ~ (0) et il existe donc une classe c e H*(g, k) telle 
que p,-c 0. Si il existe un isomorphisme ¢ de & dans un g-module WN qui 
réalise l’effacement de c, cet isomorphisme étant compatible avec les struc- 
tures de 6-modules va définir une application ¢’,: H*(b, H#(b, N) qui 
vérifie la relation = Par conséquent, p; c serait une classe effacable 
non nulle de H*(b,/), contrairement 4 4). On a done b= (0) et g est 
résoluble. 


Le Théoréme suivant est l’un des principaux résultats que l’on a en vue: 


THEOREME I. Si g est une algébre de Lie résoluble sur un corps de 
caractéristique 0, alors, quel que soit le g-module M de dimension finie, toutes 
les classes de H®(g,M) sont effacables pour p > 0. 


Sa démonstration sera donnée a la fin du paragraphe 8. Compte tenu de 
5), on voit que l’on obtiendra une maniére de caractériser les algébres de Lie 
résolubles par les propriétés d’effacement de leurs modules de représentation. 


* Cf. [3]. 
“Ceci résulte de l’existence d’un homomorphisme de g sur b inverse 4 gauche de 
lisomorphisme } > g. 
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2. Le module V(g) et son interprétation. Soit A une algébre asso- 
ciative avec unité de dimension infinie sur un corps k. Etant donné un 
module 4 gauche M/ sur A, on dira qu’un élément de M est de caractére fim 
si le sous-module qu’il engendre est de dimension finie sur /. Pour que ue M 
soit de earactére fini, il faut et il suffit que son annulateur (idéal & gauche 
des ae A tels que aw) soit de codimension finie dans A. II revient au 
méme de dire que son annulateur contient un idéal bilatére de codimension 
finie. Les éléments de caractére fini de M constituent un sous-module. Si h 
est un homomorphisme de J dans un A-module & gauche N, image par h 
dun élément de caractére fini est un élément de caractére fini de N. On 
définit de maniére analogue les éléments de caractére fini pour un module a 
droite sur A. 

Soit E(g) Valgébre enveloppante de Valgébre de Lie g. Les g-modules 
ne sont autres que les modules unitaires 4 gauche sur l’algébre associative 
E(g). Pour tout g-module M, on définit une structure canonique de g-module 
dans espace des applications k-linéaires de dans M en 
posant (af)-e=f-(er) pour feM(H(g),M), ee L(g) et Liso- 
morphisme défini par = ea pour ae M et 
ee L(g) est alors compatible avec les structures de g-modules. On démontre 
que, pour tout g-module M et tout entier p > 0, H®(g, (L(g), M) = (0). 
Toute classe de cohomologie dans H?(g, M) est done annulée par ¢y lorsque 
p> 0. On observera que, pour M ~ (0), 2(F(g), M) est de dimension infinie 
sur k, si bien que ¢y ne réalise pas l’effacement des classes de H?(g,M). 
On va cependant voir que cet isomorphisme $y est utile dans le probléme que 
Yon a en vue. 

Notons %,(H#(g), le sous-module des éléments de caractére fini du 
g-module 2(H(g), considéré comme module a gauche sur ; c’est le 
g-module des applications k-linéaires de H(g) dans M qui sont nulles sur un 
idéal bilatére de codimension finie dans Z(g). Si M est de dimension finie 
sur k, on a MC &(L(g),M). On notera encore ¢’y lapplication des 
espaces de cohomologie associée 4 l’isomorphisme M M). 


THEOREME 2. Quel que soit le g-module de dimension finie M, les classes 
effacables de H*(g,M) sont les zéros de Vapplication 


¢'u: H*(g, M) > 


°Ce résultat est lié au suivant: il existe un complexe acyclique pour les degrés ~ 0 
et muni d’une structure de module libre sur E(g) tel que, pour tout g-module M, 
Hom;,,,(C, M) soit le complexe des cochaines de g & valeurs dans M. On en déduit 
que le complexe Hom,,,)(C,2(H(g),M)) qui s’identifie au complexe des applications 
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Supposons en effet qu’une classe ce H?(g, M) soit un zéro de ¢’y. 
Soit f un cocycle de g & valeurs dans M appartenant a la classe c. Le cocycle 
¢u f de g a valeurs dans 2y(H(g), M) est un cobord, done de la forme dg 
ou g est une cochaine de degré p—1 de g a valeurs dans &,(H(g), M). Soit 
N le sous-module de &;(H(g),M) engendré par les valeurs de g et l'image 
de ¢y. Etant engendrable par un nombre fini d’éléments de caractére fini, 
N est de dimension finie sur k.6 L’isomorphisme ¢y: M— N réalise visible- 
ment l’effacement de c. 

Supposons inversement que c soit une classe effagable de H*(g, M) et 
que y soit un isomorphisme de M dans un g-module N qui en réalise l’efface- 
ment. Soit p une application k-linéaire de N sur M telle que py soit l’identité 
de M. Les applications y et p définissent des homomorphismes de g-modules 
Ye: &(E(g), M) >&(E(g),N) et per & (L(g), N) &(E(g),M) tels 
que pe soit Videntité de &(£(g),M). Par suite est un isomorphisme 
de H*(g,2,(E(g),M)) dans H*(g,2,(£(g),N)). On a d’autre part 

On est ainsi conduit 4 étudier la cohomologie de g 4 valeurs dans le g- 
module 2;(F(g),M). Celle ci se raméne a la cohomologie de g a valeurs 
dans &2,(F(g),%) muni de la structure canonique de g-module. En effet, 
M étant supposé de dimension finie sur /, on a un isomorphisme naturel du 
produit tensoriel (sur k) Q(£(g),k) @M sur Q(E(g),M), qui est com- 
patible avec les structures de g-modules si dans le produit tensoriel on définit 
e(f®u) pour reg, feX(F(g),k) et we M. Cet isomorphisme 
applique done &)(H(g),k) M sur M) et il existe un isomorphisme 
naturel de H’(g,&,(F(g),k)) @M sur pour tout p. 
Le module 2;(H(g),&) sera noté V(g). 

Soit A(g) Vidéal bilatére de Valgébre enveloppante de g qui est. engendré 
par les éléments de g.? Sa codimension est 1 et tous les idéaux A(g)" (r entier 
> 0) sont de codimension finie. Etant donné un g-module M, les éléments 
de VM dont l’annulateur contient un idéal de la forme A(g)* seront appelés 
des éléments de caractére nilpotent de M. Pour qu’un élément wu de M soit 
de caractére nilpotent, il faut et il suffit qu’il soit de caractére fini et que la 
représentation de g dans M induise dans le sous-module F(g)u une représen- 


k-linéaires de C dans M est acyclique pour les degrés ~ 0. Les démonstrations de ces 
résultats, qui ne seront pas utilisés ici, figureront dans le livre de H. Cartan et S. 
Kilenberg (Princeton Mathematical Series). 

* Une cochaine de g 4 valeurs dans M étant une application multilinéaire alternée 
de g dans M, ses valeurs constituent un sous-espace de dimension finie dans M. 

7On identifiera toujours une algébre de Lie avec le sous-espace de son algébre 
enveloppante qui lui correspond. 
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tation par des endomorphismes nilpotents. On aura accessoirement 4 utiliser 
le sous-module des éléments de caractére nilpotent de 2(H(g),k). Il sera 
noté W(g); c’est le module des fonctions k-linéaires sur #(g) nulles sur un 
idéal de la forme A(q)’. 

On donnera dans ce qui suit une interprétation des cochaines de g a 
valeurs dans les modules V(g) et W(g) lorsque g est Valgébre de Lie d’un 
groupe de Lie G, le corps de base étant alors le corps des réels R. D’une 
maniére précise, on supposera g identifiée & V’algébre de Lie des champs de 
vecteurs invariants par les translations 4 droite sur G. Soit L Valgébre des 
fonctions différentiables de classe infinie 4 valeurs réelles sur G. Soit 0(x) 
la transformation infinitésimale du groupe des translations & gauche associée 
au champ x. En possant = 6(x)-F pour tout xeg et Fe L, on définit 
dans LZ une structure de module 4 gauche sur H(g). Soit 2’ le champ de 
vecteurs invariant 4 gauche qui coincide avec x au point neutre « de G et soit 
6(x’) la transformation infinitésimale du groupe des translations 4 droite qui 
correspond 2’. On a 6(x’)6(y’) — 6(y’)0(2’) = —0([a, y]’) et par suite, 
on définit dans Z une structure de module & droite sur H(g) par la condition 
Fr = 6(2’)-F pour «eg et Fe L. On observera que (eF)e’ = e(Fe’) quels 
que soient e, e’e H(g), Fe L et que d’autre part, les fonctions zF et Fa ont 
méme valeur au point «. 

Pour tout L, on notera la fonction R-linéaire définie sur 
par (A°F)-e= (eF)(e). Ona (A: (aF)):e = (exF) (ec) = (A°F)- (er). 
L’application 4: L > 2(#(g), R) est done compatible avec les structures de 
module 4 gauche sur H(g). On définit dans 2(#(g), FR) une structure de 
module 4 droite sur /(g) commutant avec la structure 4 gauche envisagée 
jusqu’ici en posant (he): e’=h-(ee’) quels que soient he X(H(g), R), 
eee H(g). On a (A:(Fr)):e = (e(F2)) (e) = ((eF)2x) (c) = (reF) (e) 
= (A:F)-xe; par suite, Vapplication A est également compatible avec les 
structures de modules’a droite sur #(g). Le noyau de X est l’idéal des fone- 
tions qui sont nulles ainsi que toutes leurs dérivées au point «. 

Soit L; C L la sous-algébre des fonctions sur G@ dont les translatées a 
gauche (ou, ce qui revient au méme, les translatées & droite) sont dans un 
espace vectoriel de dimension finie sur R. Ces fonctions étant analytiques et 
de caractére fini pour la structure de module & gauche sur E(g) définie dans 
L, \ applique isomorphiquement L; dans V(g). Soit Ln C Ly la sous-algébre 
des éléments de LZ; qui sont de caractére nilpotent pour la structure de module 
i gauche sur (gq), ou, ce qui revient au méme, dont l’image par d est dans 
W(g). Tl est clair que W(g) est également le sous-module des éléments de 
&(E(g),R) qui sont de caractére nilpotent pour la structure de module a 
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droite sur E(g). Les éléments de Z, peuvent donc aussi se définir comme 
étant les éléments de Ly qui sont de caractére nilpotent pour la structure de 
module a droite sur E(g). 

Supposons maintenant G simplement connexe. Tout élément he V(q) 
engendre un sous-module H(g)h C V(g) de dimension finie sur R. Puisque 
G est simplement connexe, 4 la représentation linéaire de g dans H(g)h qui 
correspond a la structure de module a gauche sur H'(g) est associée une repré- 
sentation linéaire s—>r(s) du groupe G dans l’espace vectoriel E(g)h.& La 
fonction F'(s) définie sur G par F(s) = (r(s)-h)-J, ot J désigne Punité de 
E(g), est un élément de Ly dont V’image par A est précisément h. Dans ce 
cas donc, A applique isomorphiquement J; sur V(g) et L, sur W(g). 


THEOREME 3. Sovt g l’algébre de Lie d’un groupe simplement conneze G. 
Le complexe des cochaines de g a valeurs dans V(g) est canoniquement tso- 
morphe au complexe des formes différentielles sur G dont les translatées sont 
dans un espace vectoriel de dimension finie. Dans cet isomorphisme, les 
cochaines de g a valeurs dans W(g) correspondent aux formes différentielles 
sur G dont les translatées sont dans un espace vectoriel de dimension finte ow 
les transformations infinitésimales du groupe des translations a drotte sont 
nilpotentes. 


Soit w(C,, C2,- - -C,) une fonction L-linéaire alternée, 4 valeurs dans 
L, de p champs de vecteurs de classe infinie sur G, autrement dit, une forme 
differentielle de degré p sur G. Si les translatées 4 droite (ou ce qui revient 
au méme, 4 gauche) de cette forme sont dans un espace vectoriel de dimension 
finie, alors, pour des C; invariants 4 droite, w(C,,C2,- - -,Cp) sera dans L. 
Par restriction des formes aux champs invariants 4 droite, c’est 4 dire 4 g, 
on définit done une application linéaire canonique A, de lespace des formes 
différentielles dont les translatées sont dans un espace de dimension finie dans 
Vespace des cochaines de g 4 valeurs dans L; qui s’identifie 4 V(g) par l’iso- 
morphisme A. Le module des champs de vectenrs sur @ étant un Z-module 
libre admettant pour base une base de g sur R, A, est un isomorphisme sur. 
D’autre part, la différentielle d’une forme w(C,, C2,- - -,Cp) peut s’exprimer 
par la formule 


+ ~ (— 1)#0(C;) C,, Ci, 
ot 6(C;) est la transformation infinitésimale correspondant au champ (; 


® Cf. Chevalley, “ Theory of Lie Groups,” Princeton, p. 113. 
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Comme cette formule est aussi celle qui donne le cobord d’une cochaine de g, 
Visomorphisme A, est compatible avec les structures de complexes. I fait 
eorrespondre aux formes qui sont de caractére nilpotent vis a vis des trans- 
formations infinitésimales du groupe des translations 4 droite *® les cochaines 
de g a valeurs dans W(q). 


3. Remarques préliminaires. Jnvarints des modules V(g) et W(g). 
Pour qu’une fonction #-linéaire f sur H(g) soit un invariant du g-module 
Q(E(g),k), il faut et il suffit qu’elle soit nulle sur Vidéal A(g) C E(g) 
Soit 8 Phomomorphisme de Valgébre sur ayant pour noyau A(qg). 
En associant 4 tout ce k la fonetion e—c(B-e) de H(g) dans &, on obtient 
un isomorphisme de / sur le sous-module des invariants de 2(/(g),%) qui 
est contenu dans W(g) C V(g). Il lui correspond des applications naturelles 
H*(g,k) > H*(g, W(g)) ~ H*(g,V(g)). Pour le degré 0, ce sont des 
isomorphismes de H°(g,k) sur H°(g, W(g)) et sur H°(g, V(g)).% 

Cas d’un produit direct d’algébres de Lie. Soient g, et gz deux algébres 
de Lie sur un corps &. On identifiera Valgébre F(g: X g2) au produit 
tensoriel (sur H(g,) H(g2) au moyen de l’isomorphisme qui prolonge 
Yapplication (x,y) de gi X dans E(g,:) @L (gz) (les 
unités de H(g,) et L(g.) étant notées I). Soit + Visomorphisme de 
(91), k) dans (g: X gz), &) qui associe 4 f, @ fs la 
fonction e, © (fortes) pour e,e (gi) et H(gz). Les espaces 
et Q(H(g2),%) etant des modules 4 gauche sur £(g,) et H(g2) 
respectivement, leur produit tensoriel est muni d’une structure de module 4 
gauche sur H(g,) @H(g.). L’isomorphisme est un homomorphisme de 
modules 4 gauche sur @ E(g2) et applique done V(g:) V(g2) dans 
V(9: X gz). Quoique V’image de ne soit qu’un sous-module de 2(E(91 X g2), k), 
on vérifie que r-(V(g:) ® V(gz)) = X gz). Cet isomorphisme définit 
done un isomorphisme canonique du produit tensoriel du complexe des co- 
chaines de g; 4 valeurs dans V(g,) et du complexe des cochaines de g. a 
valeurs dans V(g.) sur le complexe des cochaines de g; X gz i valeurs dans 
Vig: X gz). On en déduit le 


°En général, l’algébre des formes de caractére nilpotent vis 4 vis des translations 
& droite est distincte de celle des formes de caractére nilpotent vis & vis des translations 
& gauche. La structure multiplicative dans V(g) et W(g) qui résulte de V’isomorphisme 
\ pourrait se définir algébriquement dans le cas général en prenant l’application trans- 
posée de l’isomorphisme diagonal de E(g) dans E(g) @ E(q). 

*° Si g est une algébre semi-simple sur un corps de caractéristique 0, l’isomorphisme 
de k dans W(g) C V(g) induit un isomorphisme de H?(g,k) sur H®(g,V(g)) et 
H?(g9,W(g)) pour tout entier p. 
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LEMME 1. Si g, et go sont des algébres de Lie sur un méme corps k 
il existe un isomorphisme canonique du produit tensoriel (sur k) 


H*(9:, V(9:)) @ H*(g2, V(g2)) sur H* (gi X go, V (91 X G2)). 


4. Cas des algébres de Lie abéliennes. 


TuioriME 4. Si g est une algébre de Lie abélienne, alors H?(g, W(g)) 
= H*(g,V(g)) = (0) pour tout entier p> 0. 


Soit (t= 1,2,---m) une base de g et soit la base duale 
dans le dual de g. L’espace W(g) s’identifie 4 V’algébre des polynémes en 
les 2’;, la structure de g-module dans W(g) étant celle qui associe 4 chaque 4; 
la dérivation par rapport 4 2; dans W(g). On sait que, dans ces conditions, 
H*(g, W(g)) = (0) pour tout p > 0.7" Pour démontrer le second résultat, 
il suffit, en vertu du Lemme 1, de |’établir dans le cas ot g est de dimension 1 
sur k, auquel cas l’énoncé se raméne 4 H*(g, V(g)) = (0). 

Soit f(z) un cocycle de degré 1 de g 4 valeurs dans V(g) et soit wu un 
élément non nul de g. Puisque f(u) e V(g), c’est une fonction linéaire sur 
Valgébre H(g) des polynémes en w qui est nulle sur un idéal de codimension 
finie, done sur un idéal engendré par un polyndme P(u) ~0. Soit h la 
fonction linéaire sur H(g) définie par et pour 
p> 0. Elle est nulle sur l’idéal engendré par uwP(w). C’est done un élément 
de V(g) ou encore, une cochaine de degré 0 de g a valeurs dans V(g). Son 
cobord dh est un cocycle de degré 1 associant 4 tout «ew une fonction 
(dh) (x) sur E(g). Ona ((dh)(x)) =h: (wer) = cf(u)- wv 
==f(z)-u®. Par conséquent, dh = f, ce qui achéve la démonstration. 

On donnera dans ce qui suit une généralisation du Théoréme 4 qui sera 
utilisée aux paragraphes 7 et 8. 

Soient g une algébre de Lie quelconque et IM un g-module. On notera 
ay Vapplication linéaire f f-I de 2(F(g), sur M, désignant Punité 
de L(g). On définit deux représentations linéaires p,(x) et p2(Z) 
de g dans l’espace 2(#(g), M) en posant 


+f-(er), (p2(4) *f)-e =f: (ex) 


quels que soient fe X(H(g),M), ee et reg. La seconde représen- 
tation est celle qui correspond a la structure de g-module envisagée au 
paragraphe 1. Ces représentations se prolongent en des homomorphismes 
€—>pi(e) et e—>pe(e) de Palgébre H(g) dans l’algébre des endomorphismes 


11 Le raisonnement classique, par récurrence sur l’entier n, sera généralisé dans le 
Théoréme 6. 
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de ’espace (L(g), M). On va montrer que ces représentations sont équi- 
valentes. 


LemMME 2. II existe un automorphisme 6 et un seul de Vespace 2(E(g), M) 
tel que 

1° = p2(x)6 pour tout reg, 

2° = ay. 


Un endomorphisme 6 de l’espace M) vérifiant ces conditions 
est nécessairement donné par 

lorsque ee H(g) et fe X(H(g),M). Réciproquement,, l’endomorphisme @ 
défini par (6-f)-e = (p:(e)-f)-I vérifie les conditions du Lemme. I] reste 
4 démontrer que @ possede un inverse. Soit » lV’isomorphisme de l’algébre 
E(g)dans Valgébre © E(qg) qui est défini par les conditions =I @I 
et pour tout On notera e é lautomorphisme 
involutif de espace H(g) qui conserve l’unité et transforme tout mondme 
morphisme de l’espace ‘2(H(g),M) défini par (¥-f)-e = &(f-e%) lorsque 


feQ(E(g), M) et pre Ona p-(exr) (ex) Se, + De, @ (ez). 
i 4 i 


Comme ev == — il en résulte que, pour tout xe g, on a pi(@) = Yp2(Z). 
Dautre part, ayy == ay et par suite = ay. Siveg eteeLH(g), 
on a done 


(YO-f) (ex) = (pi(x)yO-f) -e —x((yO-f) = (YOpi(x) — -e). 
Il en résulte que l’ensemble des éléments e E(q) tels que (y6-f)-e = pour 
tout f constitue un idéal a droite de H(g). Puisqu’il contient J, 


cest done que y@ est. Videntité. La relation p2(x)Op = O~p2(z) montre 
Wauire part que Oy est Videntité. Par suite y est l’inverse de 6. 


LemME 3. Le sous-espace des invariants de 2(E(g),M) pour la repré- 
sentation p, est appliqué isomorphiquement sur M par Vapplication ay. 


Soit en effet f un invariant de 2(#(g), ) pour la représentation p:; 
$f est un invariant de 2(#(g),M) pour la représentation p2. C’est donc 
une fonction linéaire de H(g) dans M nulle sur Vidéal A(g). Si ay-f —0, 
alors (0-f) -I = ay0-f = et par suite 6-f —0, donc f D’autre 
part, quel que soit ae M, il existe une fonction fe &(H(g), M) et une seule 


* Cf. (7). 


‘4 
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telle que =a et f-A(g) = (0). Cette fonction est un invariant de 
&(H(g),M) pour la représentation p2. Par suite 6-'-f est un invariant de 
pour la représentation p, et Von a ay6-'+f = =a. 


Lemme 4. Si g est abélienne et si M est de dimension finie, Vespace 
V(g) @M identifié a un sous-espace de (L(g), M) est stable par 6 et 6". 
L’application ay induit un isomorphisme de Vespace des wvariants de 
V(g) @M pour la représentation p, sur Vespace M. St de plus la repré- 
sentation de g dans M est une représentation par endomorphismes nilpotents, 
ces résultats subsistent en remplacant V(g) par W(q). 


Désignons par (pi), (pz) et (ps) les trois structures de g-modules définies 
dans L=(E(g),M) par les représentations T—>p2(z) et 
&—> = pi(x) —pe2(x). Si g est abclienne, les endomorphismes p;(2), 
p2(y) et ps(z) commutent entre eux quels que soient z,y,zeg. Il] en résulte 
que, si un élément de ZL est de caractére fini pour deux de ces structures, il 
Vest également pour la troisiéme. Si M est de dimension finie, tout élément 
de ZL est de caractére fini pour (p;) et les éléments de caractére fini pour 
(p:) sont done aussi les éléments de caractére fini pour (p.). Puisque 
6p1(«) = p2(x)6 pour tout reg, le sous-espace V(g) M qui est le sous- 
espace des éléments de caractére fini pour (p2) ou (pi) est donc stable 
par 6. De méme, la relation 0-%p2(x) = pi(z)6* montre qu’il est stable par 
6*, Tout élément de Z invariant pour (p,) étant de caractére fini pour cette 
structure est done dans V(g) @ M. La seconde partie de lV’énoneé résulte 
ainsi du Lemme 3. Dans le cas ot la représentation de g dans M est une 
représentation par endomorphismes nilpotents, on obtiendra des résultats 
analogues pour W(g) © WV en remarquant que les éléments de /, sont tous 
de caractére nilpotent pour la structure (p;) et que les invariants de L pour 
la structure (p,) sont de caractére nilpotent pour cette structure. 

On utilisera dans la suite la situation suivante: b est un idéal de l’algébre 
de Lie g sur k; U est un g-module arbitraire; A est un homomorphisme de U, 
considéré comme b-module, dans un b-module X. En composant l’homomor- 
phisme canonique du complexe C(g, U) des cochaines de g 4 valeurs dans U dans 
le complexe C(b, U) des cochaines de b 4 valeurs dans U avec Phomomorphisme 
de C(b,U) dans C(b,X) qui prolonge A, on obtient un homomorphisme 
a:C(g,U) +C(b, X) induisant une application o*: H*(g, U) H*(b, X). 
On utilisera dans H*(b, U) la structure de g/b-module défini comme suit. 
Dans le complexe C'(6, U), on définit une structure de g-module en posant 
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pour f de degré p dans C(b,U), reg et Y1,42,°°-Yypeb. Comme 
a(d-f) = d- (af) pour tout fe C(b, U) et xe g, cette structure se retrouve dans 
H*(b,U) ow elle induit une structure de g/b-module car zc —0 pour tout 
ce H*(b,U) lorsque ve b. 


LEMME 5. Pour que o* soit un tsomorphisme de H*(g, U) sur H*(b, X), 
il suffit que les conditions suivantes soient vérifiées: 


1° Hi(g/b, H*(b,U)) = (0) pour tout 7 > 0, 


2° Vapplication »’: H*(b,U) > H*(b, X) applique isomorphiquement 
le sous-espace des invariants du g/b-module H*(b,U) sur H*(b, X). 


Ce Lemme est une conséquence des résultats sur les suites spectrales 
relatives 4 un idéal établis par Hochschild et Serre.'® Soit A/ le sous-com- 
plexe de C(g,U) dont les éléments de degré p sont les cochaines de g a 
valeurs dans U qui sont nulles lorsque p—j-+ 1 variables au moins sont 
dans b. A la filtration de C(g,U) par les A‘ est associée une suite spectrale 
E, (s=0,1,2,---). D’aprés le Théoréme 4 de [2], il existe pour tout 
entier 7 = 0 un isomorphisme canonique de l’espace HJ, des éléments de HL, 
ayant j pour degré de filtration sur l’espace Hi(g/b, H*(b, U)), la structure 
de g/b-module de H*(b, U) étant celle qui a été décrite plus haut. Le sous- 
complexe A* est le noyau de l’homomorphisme C(g,U)—~C(b,U). Cet 
homomorphisme induit done une application de #, dans H*(b, U) qui, com- 
posée avec A’, donne une application canonique o*,: #, > H*(b,X). Cette 
application est nulle sur #/, pour 7 >0. On vérifie d’autre part que, aprés 
identification de 4 H°(g/b, H*(b, U)), c’est & dire & espace des invariants 
du g/b-module H*(6, U), o*, coincide sur H°, avec la restriction de ’. Les 
hypothéses du Lemme signifient done que o*, applique isomorphiquement 2, 
sur H*(b, X). Ceci entraine que o* applique isomorphiquement H*(g, U) 
sur H*(b,X) daprés un résultat bien connu de la technique des suites 
spectrales. 

On particularisera les hypothéses de ce Lemme sous deux formes. Pour 
commencer, on supposera U == V(g/b), X =k et 2» sera ’Phomomorphisme 
canonique a: V(g/b) —> kk (les structures de b-modules étant celles qui corres- 
pondent aux représentations nulles). On notera dans ce cas o, ’homomor- 
phisme de C(g, V(g/b)) dans C(b,&). On envisagera également le cas:ou 
U=W(g/b), X =k, d étant alors la restriction de a 4A W(g/b). Il conduit 
aun homomorphisme o,,: C(g, W(g/b)) C(b, k). 


8 Cf. [2], paragraphes 2 et 3. 
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THEOREME 5. Si b est un idéal de Valgébre de Lie g sur k tel que g/b 
soit abélienne, alors o*, est un isomorphisme de H*(g, V(g/b)) sur H*(b,k). 
Si de plus g est une algébre de Lie nilpotente, alors o*, est un isomorphisme 
de H*(g,W(g/b)) sur H*(b,k). 


Puisque la représentation de b dans V(g/b) est la représentation nulle, le 
g/b-module H*(b, V(g/b)) s’identifie au produit tensoriel V(g/b) @ H*(}, k) 
et application ’: H*(b, V(g/b)) H*(b,&) devient Vapplication v@a 
— (a-v)a pour ve V(g/b) et ae H*(b,k). Puisque g/b est abélienne et 
que H*(b,k) est de dimension finie, la condition 2° du Lemme 5 est donc 
vérifiée en vertu du Lemme 4. II résulte également du Lemme 4 que les 
représentations de g/b dans V(g/b) ® H*(b,k) définies par p:(x) (v @a) 
et p2(x)*(v@a) —av@a sont équivalentes. Si 
ealcule Hi(g/b, V(g/b) © H*(b,&)) en prenant dans V(g/b) © H*(b, k) la 
représentation p;, comme on doit le faire dans la condition 1° du Lemme 35) 
on obtiendra done un espace isomorphe 4 celui qu’on obtiendrait en pre- 
nant la représentation ps. Pour cette derniére, on a un isomorphisme de 
Hi(g/b, V(g/b) @ H*(b,k)) sur Hi(g/b, V(g/b)) H*(b,&) pour tout 
entier 7=0. Or g/b est abélienne, done H/(g/b, V(g/b)) = (0) pour tout 
j>0 (Théoréme 3). La condition 1° du Lemme 5 est done également 
vérifiée, ce qui prouve l’assertion sur o*,,. Lorsque de plus g est nilpotente, 
la représentation de g/b dans H*(b,k) est une représentation par endo- 
morphismes nilpotents et les Lemmes 4 et 5 permettent de démontrer de la 
méme maniére l’assertion sur o*,. 


5. Filtrations des algébres enveloppantes. Soit g une algébre de Lie 
sur k et soit b une sous-algébre de g. L’isomorphisme b —g se prolonge en 
un isomorphisme de £(b) sur la sous-algébre de H(g) engendrée par l’unite 
I et les éléments de b C g. On va dans les paragraphes suivants identifier 
E(b) avec cette sous-algébre de H(g). L’inclusion C E(g) définit 
dans E(g) une structure de module & gauche sur H(b). TI résulte du 


Théoréme de Poincaré-Birkhoff que si 2, 2%2,- - est une base d’un 
supplémentaire de b dans g, les monémes de la forme - ow les 


pi sont des entiers => 0, constituent une base de E(g) sur E(b). 

On suppose désormais que b est un idéal de g. Soit A Vidéal bilatere 
de H(b) engendré par b et soit N le noyau de ’homomorphisme canonique 
de L(g) sur E(g/b). On peut encore définir N comme l’idéal & gauche (ou 


14 Cf. G. Birkhoff, “ Representability of Lie algebras and Lie groups by matrices,” 
Annals of Mathematics, 38 (1937), pp. 526-532. 
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droite) engendré par 6 dans H(g). On a done N=EH(g)A = AE(qg) et, 
plus généralement, N? = 2(g)A’ = A?E(g) pour tout entier p>0O. On 
posera A°= et N°— 


LEMME 6. Stunt donnée une base de E(g) considéré comme module a 
gauche sur H(b), pour quun élément de E(g) soit dans N?, i faut et il 
suffit qu’tl soit combinaison linéaire a coefficients dans A” des éléments de base. 


La condition est suffisante car A? C NP, Elle est nécessaire car, si 
ce N®? == APE(g), on peut écrire c= Daye; avec A? et H(g). En 


4 
exprimant chaque e; comme combinaison linéaire 4 coefficients dans E(b) 


des éléments de base, on obtient pour ¢ une combinaison linéaire 4 coefficients 
dans A? de ces éléments de base. 

A la famille @idéaux N? correspond une famille de sous-modules C, dans 
le g-module 2(/(g),%), chaque C, étant le sous-module des fonctions nulles 
sur V?, Ona (0) =C, CC,:--C,C---, mais ne sera pas 
la réunion des Cy. On étudiera dans ce qui suit les modules quotients Cp)/C5-1. 

A toute fonction feC, (p>0), on associe l’application linéaire 
F: A? définie par (F-c):e =f-(ce) quels que soient ce A? 
et ee H(g). Si Von définit dans espace des applica- 
tions k-linéaires de A?-' dans 2(147(g), &) une structure de g-module en posant 
(wxeg,ae Vapplication devient un homo- 
morphisme de g-module. Puisque f-N? = (0), F est nulle sur A? et peut 
done étre considérée comme élément du sous-module 2(A?1/A?, 2(F(g), k)). 
D’autre part, si ee N, alors (F-c)+e =0 pour tout ce A? *. Par conséquent, 
F-c est dans 2(F(g/b),%) considéré comme sous-module de 2(#(g), 
On a done obtenu ainsi pour tout entier p>0 un homomorphisme de 
g-modules 

Cp > 2 (EF (g/b), 


Si feC>., alors T,-f=0. Réciproquement, si feC, et si 0, 
alors, pour tout ce N?-', done de la forme > aje; avec a;e A? et ee E(g), 


4 
ona = > f:(ae:) = = ((Tp'f) = 0, ot a’; et désignent respec- 


tivement les images canoniques de a; et ¢; dans les quotients A?1/A? et E(g/b). 

Par suite fe Cp... Pour tout entier p > 0, Cy, est donc le noyau de Ty. On 

va maintenant démontrer que pour tout Fe 2(A?*/A?,2(H(g/b),k)) il 

existe un feC, tel que Soit en effet (i =1,2---n) une 

base d’un supplémentaire de b dans g. Tout élément ee N? s%écrit d’une 

maniére et d’une seule e = > aym; oti les aje A? et ot les m; sont des 
j 
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mondmes de la forme - - (Lemme 6). On peut done définir 
une fonction f’: +k en posant f’-e (F'-a’;)+m’; ou a’; désigne image 


canonique de a; dans A?*/A® et m’; Vimage canonique de m; dans L(g/b). 
La fonction f’ est évidemment nulle sur N?. Soit f une fonction dans 
2(L(g),%) ayant f’ pour restriction & N?*; c’est un élément de Cy dont 
Vimage par T, est F. On notera désormais y, l’isomorphisme de C,/C,_, 
sur 2(A?*/A?, 2(H(g/b), &)), déduit de T, par passage au quotient. 


7. Filtration des modules V(g) et W(g). Les notations étant celles 
du paragraphe précédent, on posera Vp(g) = V(g) et Wy(g) = CoN W(g) 
pour tout entier p > 0. 


LemME 7. Le module W(g) est réunion des modules W,(g). Si g est 
une algébre de Lie résoluble sur un corps de caractéristique 0 et si b en est 
Vidéal dérivé, alors V(g) est réunton des V,(q). 


En effet, si fe W(g), c’est une fonction linéaire nulle sur un idéal 
bilatére de la forme A(g)* dans H(g), donc a fortiori nulle sur VN" puisque 
NCA(g). D/’autre part, on sait que si g est une algébre de Lie résoluble 
sur un corps de caractéristique 0, pour tout g-module M de dimension finie 
sur ce corps, les endomorphismes de Vespace M qui correspondent a des 
éléments de Vidéal dérivé de g sont dans le radical de Valgébre enveloppante 
de M.1° Ceci signifie que, si D est un idéal bilatere de codimension finie dans 
EH (g), VYimage inverse dans H(g) du radical de l’algébre associative (g)/D 
contient 6, done également NV. II en résulte que tout idéal bilatére de co- 
dimension finie dans #(g) contient un idéal de la forme N” ce qui prouve 
que tout élément de V(g) est dans un sous-module V,(g). 

La structure de g-module de 2(L(g/b), &) étant définie par sa structure 
canonigue de g/b-module et ’homomorphisme de H(g) sur H(g/b), les élé- 
ments de caractére fini (respectivement nilpotent) du g-module Q(H(g/b), k) 
coincident avec les éléments de caractére fini (respectivement nilpotent) de 
&(£(g/b),%) muni de sa structure canonique de g/b-module, c’est a dire les 
éléments de V(g/b) (respectivement W(g/b)). Les quotients A?-*/A? étant de 
dimension finie sur k, les éléments de caractére fini (respectivement nilpotent) 
du g-module £(A®-1/A?, 2(£(g/b), &)) sont donc les éléments du sous-module 
2(AP-1/A?, V(g/b)) (respectivement 2(A?*/A?, W(g/b))). D’un autre cété, 
V,(9g)/Vp-.(g) s’identifie 4 un sous-module du module des éléments de carac- 


15 Cf. C. Chevalley, “Algebraic Lie algebras,” Annals of Mathematics, 48 (1947), 
pp. 91-100. 
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tere fini de U,/Cp.. De méme, Wy(g)/Wp.(g) s’identifie & un sous-module 
du module des éléments de caractére nilpotent de C,/Cp1. L’isomorphisme yp 
du paragraphe précédent applique done isomorphiquement Vp(g)/Vp4(g) dans 
V(g/b)) et Wo(g)/Wps(g) dans 2(A?*/A?, W(g/b)). 

LEMME 8. Si b est un idéal de codimension 1 dans une algébre nil- 
potente g, alors, pour tout p>0, yp applique isomorphiquement 
sur W(g/b)). 


Le seul point restant 4 démontrer est que tout élément de caractére 
nilpotent dans le g-module C,/Cy. est dans Wy(g)/Wpi(g). Soit f un 
élément de C, ayant pour image dans C,/C,. un élément de caractére 
nilpotent. Ceci signifie que f est une fonction dans 2(#(g),&) nulle 
sur N? et sur un idéal de la forme N®*A(g)™ ot m désigne un entier 
=(. On va démontrer que, quels que soient les entiers p et m=O, 
il existe un entier r= 0 tel que N?*M A(g)” C N?+ NPtA(g)™. Tl en 
résultera bien que la restriction de f & N?* est aussi la restriction d’une 
fonction nulle sur A(g)", done que fe We(g) + Cpu. 
Soit un élément de g—b. Quel que soit lVentier r > 0, tout élément 
ee A(g)” sécrit, modulo N?, comme somme de mondmes ayant au moins 
r—p-+1 facteurs «, les autres facteurs étant dans 6 et en nombre au plus 
égal & p—1. Puisque les x? constituent une base de #(g) considéré comme 
module & gauche sur H(b) (cf. paragraphe 5), on pourra écrire chacun de 
ces mondmes sous la forme > c,a% avec cy e H(b). Pour cela, on devra effectuer 

qa 


un certain nombre de fois Vopération consistant & permuter un facteur x 
avec un facteur ye b, ce qui fait apparaitre chaque fois un mondme supple- 
mentaire ayant un facteur % en moins, mais dont l’un des facteurs ye 6 aura 
été remplacé par [2, y] =ad(x)-f. Puisque g est supposée nilpotente, il existe 
un entier s tel que ad(x)*==0. Les monédmes dont on part comportant au 
plus p—1 facteurs dans b, on voit que le nombre des facteurs 2 restera au 
moins égal & r—p-+1—s(p—1). Ainsi, tout élément ee A(g)* est égal, 
modulo NV? & une somme de termes de la forme c,r4 avec q = r—(s — 1)(p—1) 
et cge H(b). Side plus ee N*?-, alors les c, sont dans A?! (Lemme 6). Par 
conséquent, en choisissant r= m-+ (s—1)(p—1), on a A(g)" 
C N, + NP*A(g)™, ce qui achéve la démonstration. 

LEMME 9. St b est Vidéal dérivé d’une algébre de Lie g, alors yp 
applique isomorphiquement Vp(g)/Vp+(g) sur 2(AP*/A?, V(g/b)). 

Soit (2%) («= 1,2,---m) une base d’un supplémentaire de 6b dans g. 
On notera S(i,i+ 7) (i=1,2,- -n;r=0,1,- - le sous-espace de 
E(g) engendré par les mondmes de la forme ot les 
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Pir * Sont des entiers = 0. On posera pour abréger S = 8(1,n) 
et S(t) —S(t,7). L’homomorphisme canonique ¢ de E(g) sur E(g/b) 
applique isomorphiquement S sur /(g/b). On utilisera dans la démonstration 
du Lemme 9 le résultat préliminaire suivant: 

LEMME 10. Pour tout sous-espace U C S dont Vimage par € est un 
idéal de codimension finie dans E(g/b), tl existe un sous-espace U’ C U 
tel que 

1° est un idéal de codimension finie dans 


2° U’g CU’+ BU. 

On remarquera pour commencer que, si we S(1,1-}- 7). alors quel que 
soit 9, 

(A) zu—uzebS(t,1+7). 
Pour vérifier cette inclusion, on supposera que wu est un mondme et l’on pro- 


cédera par induction sur le nombre de ses facteurs en posant u = w’xi,, avec 
Ona 


ZU — Uz 
= + (28 ing — i+ 8) + C +17) 

d’aprés Vhypothése inductive. 

D’autre part, si we 7) et sit Si-+ alors 

(B) + 7). 
On supposera de méme que wu est un mondme et l’on procédera par induction 
sur le nombre de ses facteurs en posant w—w’a,, avec OSsSr et 
weS(i,i+s). On a trivialement si jzi+s. Si 
j<its, on écrira 

UL; = W — + + 8) + S(t 
d’aprés V’hypothése inductive. Ce sous-espace est contenu dans 

S(i,i+ts) C 8Q,i+r) 4+ b8(4,i+7r) 

d’aprés (A). 

Ceci étant, soit 8 Vapplication linéaire du produit tensoriel (sur &) 
g ® dans le sous-espace bS(i) C H(g) définie par B-(z®u) = zu — uz 
pour zeg et we S(t). Posons K; =U S(t); c’est un sous-espace de co- 
dimension finie dans S(i). Puisque b est de dimension finie, bK, est de 


codimension finie dans bS(i), et il existe done un sous-espace de codimension 
finie dans g ® S(7) dont l’image par 8 est dans bK;. Puisque g est de dimen- 


C 
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sion finie, il existe done un sous-espace K’; de codimension finie dans $(1) tel 
que B:(g @ K’;) C bK;. Les deux sous-espaces K; et K’; de S(t) étant de 
codimension finie, leur intersection contient un polyndme P; 0 en a. Soit 
U’; le sous-espace S(1,i—1)P,S(i,n) de S.1° Son image par £ est l‘idéal 
de E(g/b) engendré par ¢-P;. Puisque Pye U, on a done 
dot U. D/autre part, U’ig CU’; +0U. En effet, si zed, alors 
U'z2e8(1,t—1)PibS(i,n) Waprés (A). Or, pour tout yeb, on a 
yP;,— Py ebK; C bU, done Pib C OU et par suite, ¢-U étant un idéal, 
C bU Waprés (A). D/autre part, si t= 7 Sn, on a 


= S(1,i—1)PS (4, n)a; 
C + 8(1,1—1)PibS (4, n) 
daprés (B), done Enfin, si 7 < 4, 
U’ ia; C S(1,i—1)PiaS (i, n) + 1) (4, n) 
Waprés (A). Puisque bK; on a done 


d’aprés (A) et ce sous-espace est contenu dans U’;-++ bU d’aprés (B). Soit 
alors U’ = > U’;; son image £-U’ dans H(g/b) est Vidéal engendré par les 


Puisque H(g/b) est Valgébre commutative des polyndmes en 
Vidéal £-U’ est de codimension finie dans H(g/b). D’autre part, U’ C U et 
U’g C U’ + BU par suite des inclusions analogues vérifi¢es par chaque U’,. 
Ceci termine la démonstration du Lemme 10. ; 
Revenons maintenant a la démonstration du Lemme 9. II reste & vérifier 
que tout élément de caractére fini dans C,/C,, est Vimage d’un élément de 
earactére fini de C,. Soit f un élément de C, ayant pour image dans C,/Cp1 
un élément de caractére fini. Ceci signifie que f est une fonction dans 
R(H(g),k%) nulle sur N? et sur un idéal de la forme N?*H ot H est un 
idéal bilatére de codimension finie dans E(g). On va démontrer que, quel 
que soit l’entier p et l’idéal H de codimension finie dans /'(q), il existe un idéal 
bilatére H’ de codimension finie dans E(g) tel que H’ C N? NH. 
Il en résultera bien que la restriction de f & N?* est aussi la restriction a 
NP dune fonction nulle sur H, done contenue dans Vp(qg). 
Soit U, intersection de S et de H. Le Lemme 10 permet d’y associer une 
suite de sous-espaces U;, (k =0,1,- - -) possédant les propriétés suivantes 


Pour i== 1, S(i,i—!, repr’sentera le corps des sealaires. 
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2) quel que soit k, ’image canonique de U;, dans H(g/b) est un idéal 
de codimension finie, 
3) quel que soit k= 1, C + OU x4. 


k=p 
Ceci étant, soit L, A?*U, (k =0,1,2,---p). Le sous-espace L = > 
est un idéal droite de E(g). En effet, 
Lg = AP*U Li AP*bU = Lx 4- Cc Lx 


pour k= 1, et pour k—0, on a Lo —=A?U, = A'S =—N?. D/autre part, 
L,-+ L, est de codimension finie dans H(g). En effet, si (¢;) est une base 
d’un supplémentaire de U, dans S, tout élément de E(g) s’écrira modulo 
Ly, == E(b)U, comme combinaison linéaire coefficients dans H(b) des 
Puisque pour a;¢ A”, ajt;e N? = Ly, et que A? est de codimension finie dans 
E(b), Vespace L, + Ly est bien de codimension finie dans E(g). A fortiori, 
Vidéal L est de codimension finie dans E(g). On ‘va enfin montrer que 
élément de L; (k =0,1,-- +p) s’écrit comme combinaison linéaire 4 coeffi- 
cients dans A? d’éléments linéairement indépendants de U;,, et s’il est dans 
N’-**! ces coefficients sont dans A?**! d’aprés le Lemme 6. On a donc 
Dy C C Ly pour tout k > 0. D’autre part, L, C N?* pour 
0=kSp. Ilen résulte que L,N N C Lp. et (Lp + Lp1)N N C Ly. done 
(Lp + Loi) NN? C N? C Lp» et par suite (Lp + Lps + NN? 
C De proche en proche, on démontrera ainsi que (Lp + 
L,, done que LN N?*CL,+ On a done obtenu un idéal 
a droite Z de codimension finie dans H(g) dont l’intersection avec N?* est 
dans N? + N?*H. Puisque tout idéal & droite de codimension finie contient 
un idéal bilatére de codimension finie, le Lemme 9 est ainsi démontré. 


7. Trivialité de H*(g,W(g)) lorsque g est nilpotente. 


THEOREME 6. Si g est une algéebre de Lie nilpotente, alors H?(g, W(g)) 
= (0) pour tout p> 0. 


La démonstration procéde par induction sur la dimension de g. Le 
Théoréme est vérifié pour g de dimension 1 (Théoréme 4). Supposons le 
vérifié pour les algébres nilpotentes de dimension inférieure 4 la dimension 
de Valgébre nilpotente g. Soit b un idéal de codimension 1 dans g et soit 7 
Vhomomorphisme de W(g), considéré comme b-module, dans W(b) qui 
associe 4 tout fe W(g) C 2(£(g), k) sa restriction 4 H(b) C E(g). L’homo- 
morphisme 7 se prolonge en un homomorphisme 7 du complexe de cochaines 
C(b,W(g)) dans C(b,W(b)). En composant V’homomorphisme cano- 
nique C(g,W(g)) >C(b,W(g)) avec 7, on obtient un homomorphisme 
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é:C0(g, W(g)) ~C(b, W(b)). Compte tenu de l’hypothése inductive, il 
suffit de démontrer que cet homomorphisme induit un isomorphisme de 
H*(g,W(g)) sur H*(b,W(b)). On utilisera pour cela des filtrations de 
ces deux complexes. Dans C(g, W(g)), on prendra la filtration par les sous- 
complexes C(g, W,(g)) correspondant a la filtration du module W(g) par 
les Wp(g) relatifs 4 b (Lemme 7). Dans C(b, W(b)), on utilisera la filtra- 
tion par les sous-complexes C'(b, W,(b)) ott chaque W,(6) est le sous-module 
des fonctions linéaires sur #(6) nulles sur Vidéal A?. Pour tout p=0, ona 
n° Wo(g) C W,(b) et par suite €-C(g, Wp(g)) C C(b, Wp(b)). D’aprés le 
Lemme 8, l’isomorphisme y, permet d’identifier C(g, Wp(g))/C(g, Wo-+(g)) 
=O(g, avee C(g,2(A'/A?, W(g/b)) qui est aussi le 
complexe 2(A?*/A”,C(g, W(g/b)) des applications k-linéaires de Vespace 
Av1/AP dans C(g,W(g/b)). Dautre part, C(b, Wp(b))/C(b, 
s’identifie au complexe 2(A?1/A?, C(b, k)). Aprés ces identifications, ’homo- 
morphisme déduit de € par passage au quotient devient V’application de 
C(g, V(g/b)) dans 2(A?-*/A?”, C(b, k)) qui est définie par ’homo- 
morphisme ow: C(g, W(g/b)) > C(b,k) qui a été étudié au paragraphe 4. 
Compte tenu du Théoréme 5, cet homomorphisme induit un isomorphisme de 
la cohomologie du premier complexe sur la cohomologie du second. I] résulte 
done du “ Lemme des cing ”’ que € induit bien un isomorphisme de H*(g, W(g)) 
sur H*(b, W(b)). 


8. Trivialité de H*(g,V(g)) lorsque g est résoluble. 


THEOREME 7. Si g est une algébre de Lie résoluble sur un corps de 
caractéristique 0, alors H®(g,V(g)) = (0) pour tout entier p> 0. 


La démonstration consiste & se ramener au Théoréme 6. Soit b Vidéal 
dérivé de g. C’est une algébre de Lie nilpotente. Tout élément de V(g) 
étant contenu dans un sous-module V,(g) relatif 4 Vidéal 6 (Lemme 7), sa 
restriction 4 H(6) sera nulle sur un idéal A? C N? et c’est done un élément 
de W(b). On a ainsi un homomorphisme canonique 7 de V(g) considéré 
comme 6-module dans W(b) et cet homomorphisme se prolonge en un homo- 
morphisme 7: C(b, V(g)) C(b, W(b)). En composant V’homomorphisme 
canonique C(g, V(g)) C(b, V(g)) avec 7%, on obtient un homomorphisme 
canonique du complexe C(g, V(g)) dans le complexe ((b,W(b)). Compte 
tenu du Théoréme 6, il suffit de démontrer que cet homomorphisme induit un 
isomorphisme de H*(g,V(g)) sur H*(b,W(b)). On utilisera pour cela 
des filtrations de ces deux complexes. Dans C(g, V(g)) on prendra la filtration 
par les sous-complexes C(g, Vp(g)) (Lemme 7) et dans C(b, W(b)) 1a filtration 
par les sous-complexes C(b, W,(b)) déja introduits dans la, démonstration du 
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Théoreme 6. Pour tout p20, on a 7°Vp(g) C Wp(b) et par suite 
Vo(g)) C C(b, 
D’aprés le Lemme 9, l’isomorphisme yp, permet d’identifier 


C(g, Vn(g))/C(g, Vraal(g)) =C(9, 


avec O(g, 2(A”-*/A”, V(g/b)) qui est aussi le complexe 2(A?*/A?, C(g, V(g/b)) 
des applications k-linéaires de ’espace A?-*/A? dans C(g, V(g/b)). D’autre 
part C(b, W,(b))/C(b, W,.(b)) sidentifie au complexe 2(A?-*/A?, C(b, k)). 
Aprés ces identifications, ’homomorphisme déduit de € par passage au quotient 
devient ’homomorphisme de 2(A?-1/A®, C(g, V(g/b)) dans C(d, k)) 
qui est défini par ’homomorphisme o,: C(g, V(g/b)) > C(b,k) étudié au 
paragraphe 4. Compte tenu du Théoréme 5, cet homomorphisme induit un 
isomorphisme de la cohomologie du premier complexe sur la cohomologie du 
second. I] résulte donc du “Lemme des cing” que € induit bien un iso- 
morphisme de H*(g, V(g)) sur H*(b, W(b)). 


Démonstration du Théoréme I. Soit M un module de représentation 
d’une algébre de Lie résoluble g sur un corps de caractéristique 0. Pour tout 
entier p >0, on a M)) = H®(g, V(g))@ M = (0) daprés 
le Théoréme 7. Compte tenu du Théoréme 2, ceci prouve que les classes 
dans H?(g, M) sont effacables. 

On observera pour terminer que, compte tenu du Théoréme 3, le Théoréme 
8 signifie que, sur un groupe résoluble simplement connexe, done homéo- 
morphe a R”, la cohomologie du complexe des formes différentielles réelles 
dont les translatées sont dans un espace de dimension finie est la méme que 
celle du complexe de toutes les formes différentielles. Nous ne connaissons 
pas de démonstration directe de ce résultat. Le méme circonstance se vérifie 
trivialement pour un groupe compact; elle n’est cependant pas générale. 
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COHOMOLOGY OF RESTRICTED LIE ALGEBRAS.* 


By G. HocHscHILD. 


Introduction. We shall be concerned with restricted Lie algebras of 
characteristic p (=4 0), in the sense of N. Jacobson, [6]. The structure of a 
restricted Lie algebra Z comprises, in addition to the usual Lie algebra struc- 
ture, a map of L into itself, denoted x —> z!”!, with properties that correspond, 
in a fashion we shall make precise later, to the properties of the map 2 —> 2? 
in an associative algebra of characteristic p. The appropriate representation 
theory for such algebras is accordingly obtained by confining one’s attention 
to the restricted representations in which the transformation corresponding 
to zl is the p-th power of the transformation corresponding to z. Just as 
the ordinary representations of a Lie algebra correspond to the representations 
of its universal enveloping (or Birkhoff-Witt) algebra, the restricted repre- 
sentations of a restricted Lie algebra correspond to the representations of a 
certain homomorphic image of its universal enveloping algebra; its u-algebra, 
in the sense of Jacobson, [6]. It has been shown by H. Cartan and 8. Eilen- 
berg that the cohomology groups of a Lie algebra can be computed, as described 
in section 1 below, from any augmented free acyclic complex over its universal 
enveloping algebra. If, in this context, one replaces the universal enveloping 
algebra by the u-algebra one obtains new groups, which we shall call the 
restricted cohomology groups. One is thus led to the problem of deter- 
mining what becomes of the usual elementary interpretations of the 1- and 2- 
dimensional cohomology groups in terms of extensions of modules and Lie 
algebras when one passes from the “ ordinary ” situations to the corresponding 
“restricted ” ones. This is what we propose to do here.1 The results are 
what one would naturally expect them to be, except perhaps for those which 
deal with the relations between the “ ordinary ” objects and the corresponding 
“restricted ” ones, such as are expressed in Theorems 3. 1 and 3. 2. 


* Received October 14, 1953. Part of this work was done while the author was 
participating in the conference on Lie algebras and Lie groups, July-August, 1953, 
organized by the American Mathematical Society with the financial support of the 
National Science Foundation. 

*T am indebted to N. Jacobson for the suggestion that restricted Lie algebras can 
thus be used as experimental animals to be observed in the environment of general 
cohomology theory. 
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1. Basic notions. Let LZ be a Lie algebra over a field F. We denote 
by 7, the tensor algebra built over the F-space L.2 As an F-space, T, is 
the direct sum of subspaces 1=0,1,---, where T;°=—F, 
T12=L®L, and generally 7,‘ is the i-fold tensor product L®- - -@L, 
relative to F, of Z by itself. The multiplication in 7’, is defined by the 
canonical map > = T1'T1) = Let Iz denote 
the 2-sided ideal of 7; which is generated by the elements of the form 
«@y—ySr—[z,y] eL@OL-+ L, with x and y ranging over L, and [z, y] 
denoting their commutator in Z. The algebra Ry, = T,/I;, is the universal 
enveloping algebra of Z. We identify F with its natural isomorphic image 
(F + 1,)I;, in Ry, and similarly we identify Z with its natural isomorphic 
image + I,)/I;, in Rr. 


The natural image R,* of > T,‘ in R, is a 2-sided ideal in Ry, and, as a 


vector space over F’, RP, is the direct sum of R;* and F. We have a canonical 
projection « of R; onto F whose kernel is R;*, namely the map induced by the 
projection of onto 

The importance of R, lies in the almost evident fact that the repre- 
sentations of LZ correspond in a natural 1-1 fashion to the “ unitary” 
representations of Ry, i.e., to those in which the identity element of R, 
is mapped into the identity transformation. Accordingly, the cohomology 
theory of Z can be based entirely on the ring R;, with the projection «, as 
has in fact been done by H. Cartan and S. Hilenberg. Since their general 
theory of algebraic complexes has not yet appeared in print, we shall be 
content with only a few allusions to it, which may eventually serve to clarify 
the underlying ideas for some of our manipulations of the cohomology groups. 

We regard F as an Rz-module with “trivial operators,” i.e., if re Rr 
and ae F, we set r-a—e(r)a. The usual cohomology groups H"(L, M) for 
LZ in an L-module M coincide with the groups Extr,"(F, M), as defined by 
Cartan-Hilenberg, where M is regarded as a unitary Rz-module in the natural 
fashion. 

The usual explicit computation of H(L,M) starts with the alternating 
n-linear functions on Z with values in M as the n-cochains, and with the 
coboundary operator §, where 


(Sf) (2o, La) 


* We look upon this and related concepts from the point of view of chapter 1 of [1]. 
In particular, see p. 4 of [1] for a precise definition of the tensor algebra. 
* For the details of this description of H(L,M), see § 1 of [5]. 
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It is a consequence of the general theory of algebraic complexes that the 
following cochains and coboundary operator lead to the same cohomology 
groups, up to a canonical isomorphism. The n-cochains are now the n-linear 
functions (not alternating) on R,* with values in M, and the coboundary 
operator 8 is defined by the formula 


i=1 


Observe that this is what one obtains from the cohomology theory of asso- 
ciative algebras if one makes all the right operators 0. 

One can give an explicit cochain map which induces the canonical iso- 
morphism between the cohomology groups as defined above in the two different 
ways. For every cochain g of the second, “ associative ” type, define a cochain 
g’ of the first, “ Lie” type by the formula 


where the summation is over all permutations o of the set (1,---,n), and 
where éa is equal to + 1 or —1 according to whether o is even or odd. It 
can be verified directly that 8(g’) = (8@)’, so that the map g — g’ induces a 
homomorphism of the cohomology groups. The fact that this homomorphism 
is actually an isomorphism onto is proved by means of the general theory of 
algebraic complexes. In order to apply this theory, we must observe that, 
in the terminology of Eckmann, [2], each of the above descriptions of H(L, M) 
exhibits it as the “ equivariant ” cohomology group of some (augmented) free 
acyclic R;-complex. The first description is obtained from the following 
complex: let Cy) = Rx, regarded as an R;-module in the natural fashion. This 
is the 0-dimensional part of the chain group, regarded as an R;-module. The 
augmentation is «:C,— F, as defined above. For n > 0, denote by LZ, the 
n-fold exterior product of the F-space L by itself.t The n-dimensional part 
of our Rz-module of chains is defined as C, = R;, ®@ Lp, with its natural struc- 
ture of an Ry-module. The boundary operator d of our complex is charac- 
terized by the formula 


d(r® (a A: -A 

= (— 1)#(raj) (xo A: + (— @ ([ap, A A: -A am). 

Evidently, each C,, is a free R,-module, and it is not too difficult to verify that 
*This means that Z, is the homogeneous component of degree n of the exterior 


algebra over L, in the sense of §5 of [1]. The symbol / in the boundary formula 
below indicates multiplication in this exterior algebra. 
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d? = 0, so that we have indeed a free Rz-complex. In order to show that this 
complex is acyclic, one observes that, by means of an ordered basis of L, one 
can define a graduation of R;; the degrees being the total degrees in the basis 
elements of Z in the representation of the elements of Rz as linear combina- 
tions of ordered monomials (which are well known to be linearly independent) 
in these basis elements. One then defines an increasing filtration of the com- 
plex by means of a sequence of subcomplexes defined by the condition that, 
in the m-th complex, the n-chains shall have their coefficients in R, of degree 
no greater than m—n. The factor complex obtained from two successive 
such subcomplexes can be described quite simply in terms of the symmetric 
algebra ®° and the exterior algebra over LZ, and can be shown to be acyclic by a 
reductive verification. By a familiar argument of algebraic homology theory, 
it follows then immediately that the whole complex is acyclic. Now if I is 
any unitary #;-module, one constructs the cochain eomplex whose n-dimen- 
sional cochains are the Ry-homomorphisms of C,, into Jf, and whose coboundary 
operator § is the dual of d. The corresponding cohomology group is the “ equi- 
variant ” cohomology group derived from J/ and our complex (C,d). It is 
evident that this gives precisely the usual description of H(L, M) in terms of 
Lie cochains; one merely has to observe that the linear maps of LZ, into M 
may be identified with the n-linear alternating functions on LZ with values 
in M. 

The situation of the second, associative description of H(L, M) is a good 
deal simpler. We construct an appropriate augmented free acyclic Rz-complex 
(D, d) as follows. D, = (Co, and the augmentation is the same map e which 
we have used above. For n> 0, - -@ (one factor 
followed by n factors R;*). The boundary operator is characterized by the 
formula 


d(r@s.®-- -@sn) 
i=1 
Here we can define an explicit homotopy operator h which shows at once that 
(D,d) is acyclic. On F, h is defined as the injection into R;, which is an 


inverse of «. On D, = Ry, h is the map r>1®r*, where r* denotes the 
component r—e(r) of r in R;*. Generally, 


h(r@s,@---@s,) =1 Ort @s,@- - 
It is easily verified that dh + hd is the identity map of D onto itself, whence 


5 Loosely speaking, this is the algebra of the commutative polynomials in the basis 
elements of L, with coefficients in F. For a precise definition, see § 4 of [1]. 
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it is clear that our complex is acyclic, while it is evidently free. It is also 
evident that the corresponding equivariant cohomology group in any unitary 
R,-module M is exactly what our second, associative description of H(L, M) 
yields. 

Now, quite generally,° if C and D are two augmented complexes over an 
arbitrary operator ring , both having the same augmentation module, F, 
and if C is free over R, while D is acyclic, then the identity map of F onto 
itself can be extended to an admissible map of C into D, i.e., to a dimension 
preserving R-homomorphism which satisfies the usual commutativity relations 
with respect to the boundary operators and the augmentation maps. More- 
over, any two such admissible maps are equivalent to each other by a homotopy 
map of C into D. It follows that there is a canonical homomorphism of the 
cohomology group derived as above from D into the cohomology group derived 
from C, this homomorphism being induced by the dual of any admissible map 
of C into D. 

In the situation above, it is clear, by interchanging C and D, that the 
canonical cohomology homomorphism is an isomorphism onto. In order to 
complete the proof of our above claims concerning the two descriptions of 
H(L,M), we merely have to observe that our cochain map g >’ is the dual 
of the chain map y: C—> D defined as follows. On Co, y is the identity map. 
On Cy, with n > 0, y is the Ry-homomorphism which is characteribed by the 
formula y(r® (2, A- an)) = > @ It is not 

Co 


difficult to verify that y is an admissible map of C into D, so that its dual 
induces the canonical isomorphism of the associatively defined cohomology 
group onto the usual one. 

Before discussing the restricted cohomology groups, we must recall the 
basie facts concerning restricted Lie algebras. From now on, F' will always 
denote a field of characteristic p40. A restricted Lie algebra over F is a 
Lie algebra Z over F in which there is defined a map xz!) (of L into 
itself) which is subjected to the following conditions. Let D, denote the 
inner derivation of Z which is effected by the element xeZ, so that 
D,(y) =[2,y]. Then the conditions are: 


(1) (Dz)? = Dan; 
(2) foraeF and (ax) — gpzlel ; 


(3) (a+ = ell + + o(x,y), where o(z,y) is the element 
of L which, in R;, coincides with (x + y)? — a? — y?. 


* What follows is a slightly specialized reminder of some of the basic features of 
the theory of complexes over a ring, as formulated by Eckmann in [2]. 
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It is not evident that o(z,y) actually belongs to LZ. This is seen by 
operating in the polynomial ring R,[¢] in one auxiliary variable, t, which 
commutes with the elements of the coefficient ring R,;. Write 


(at + y)? = + yr si(a,y) tt 


1 
Then we have o(2, y) —5 si(v,y). On the other hand, differentiating with 
i=1 


respect to ¢, we obtain 
(ct + y)ia(at + = = isi(x, 


The left hand side is of the form 5 ujtu?*), Let us momentarily replace u 
j=0 

and v by independent non-commuting variables and operate in the ring Q 

of the non-commutative polynomials in wu and v with coefficients in F. If D, 


denotes the map w—> uw — wu = D,(w), we have 
. 
Di ( Suivur--5 D,?*(v) ) 0. 


In Q, this is easily seen to imply that } wivu?*) = D,P*(v). This formal 
j=0 

result must continue to hold after specialization to R,[t]. Hence we have 


= and a comparison of coefficients shows that 


Si a y) is a sum of commutators, and so belongs to L. Thus o(z,y) el. 
For future reference, it is important to observe that o(z,y) is obtained by 
formally integrating the polynomial D(.:,,)?"(x) with respect to ¢ and then 
putting #1. Note that this formal integration can be carried out meaning- 
fully, because the polynomial is actually of degree less than p— 1. 

Now let Z be a restricted Lie algebra over F, and denote by P,, the 
ideal of R;, which is generated by the elements of the form 2? — z!], with 
rel. Note that, by (1), these elements are in the center of R;. We define 
U;,=R,/P.. Ux is what Jacobson has called the u-algebra of ZL. We can 
still identify F and Z with their isomorphic images in Uz, and we have then 
x? in for all ze L. 

For restricted Lie algebras LZ, it is appropriate to concentrate one’s 
attention on the restricted Z-modules, i.e., on those Z-modules M in which 
for all me M and all re Clearly, the restricted L- 


7 It is implied in our notation that LZ-modules are simultaneously regarded as Ry; 
modules. In particular, when we write x?.m, «#? denotes the p-th power of # in R;,. 
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modules are precisely the unitary Uz-modules. Contrary to R,, the algebra 
U, is of finite dimension over F', whenever L is finite dimensional. In fact, 
if (a) is an ordered basis for L over F’, the ordered monomials 2;,%- - - 2,%, 
with and 0 Se; < p, constitute a basis for Uz over whether 
the basis is finite or not. 

Our projection « of Rz onto F induces a projection ¢ of Uz, onto F 
whose kernel is the natural image U;* of R,* in Uz. If F is regarded as a 
U,-module with wu: «= «’(u)a, then the restricted cohomology groups for L 
in the restricted L-module M coincide with the groups Exty,”(F,M) of 
Cartan-Hilenberg. One can avoid this reference to the general theory by 
using the explicit cochain complex of the associative type which is obtained 
from the one we discussed above simply by replacing R,* with U,*. Observe 
that the usual description of cohomology for Lie algebras in terms of Lie 
cochains leaves no room for such a modification, so that a reformulation was 
necessary for our purpose. 

The canonical homomorphism of Rz onto Uz, allows us to regard any 
U;-module also as an R;-module. An acyclic Uz-complex can therefore be 
regarded also as an acyclic Rz-complex. It follows from our above general 
remarks concerning algebraic complexes that there is a canonical homomor- 
phism of the restricted cohomology groups H,."(L, M) into the ordinary 
cohomology groups H"(L, 11) which is induced by the dual of any admissible 
chain map of the free acyclic R;-complex used for computing H(L, M) into 
the free acyclic Uz-complex (regarded as an acyclic R;-complex) used for 
computing H,,(L, 7). An explicit cochain map inducing this homomorphism 


is given, in the associative description, by g—g°, where g°(s1,° Sn) 
= +, 8'n), with R,* and s’; its canonical image in 


Let C(M) denote the complex of the cochains for R;* in the restricted 
L-module M, and let C°(J/) stand for the subcomplex consisting of the 
cochains of the form g°, where g is a cochain for U;* in M. Then we have 
the exact sequence of complexes (0) > C°(M) > C(M) > C(M)/C°(M) = (0), 
which gives rise to the usual exact sequence connecting their cohomology 
groups. Since our map g —> g® is an isomorphism, we may identify H"(C°(M)) 
with H,.”"(L. M), whence we have the exact sequence 


It is evident that H°(C(M/)/C°(M)) = (0), whence we conclude that the 
canonical homomorphism H'(L,M) is an isomorphism into 
(a fact which we shall verify directly in the next section). As a by-product 
of other results. we shall also obtain (in section 3) a simple description of 
H'(C(M)/C°(M)) and some information about H?(0(M)/0°(M)). 
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2. Extensions of modules. Let L be a Lie algebra over F’, and let K 
and Q be Z-modules. An extension of K by @Q is a pair (4, ¢), where £ is 
an L-module containing K, and ¢ is an L-homomorphism of FE onto Q whose 
kernel coincides with K. Two such extensions (/,¢) and (H’, ¢’) are said 
to be equivalent if there is an L-isomorphism @ of F onto E’ which leaves 
the elements of K fixed and satisfies the relation ¢’«—¢. The equivalence 
classes of the extensions of K by Q constitute a vector space ext(K,Q). 
The definition of the vector space structure of ext(K,@) is exactly the same 
as the one given (for the case of associative algebras) on p. 924 of [4]. 

Denote by M the F-space of all F-linear maps of Q into K. We 
make M into an L-module by setting, for me M, re L, geQ, (x: m)(q) 
=2x-m(q)—m(a-q). It is essentially well known that ext(K,@Q) is iso- 
morphic with the ordinary 1-dimensional cohomology group H'*(L, M), 
regarded in the natural fashion as a vector space over F. We recall only 
the way in which one attaches a certain cohomology class to each extension 
(E,¢) of K by Q. We can find a linear map y of Q into F such that gy 
is the identity map on Q. Then we obtain a linear map f of Z into M by 
setting f(x) (q) =2z-¥(q) —y(a-q). One verifies directly that f is a Lie 
1-cocycle for Z in M, and that its cohomology class depends only on the 
equivalence class of (Z,¢). It follows immediately from the definition of 
the vector space structure in ext(K,@Q) that this yields a homomorphism of 
the F-space ext(K,@Q) into the F-space H*(L,M). The fact that this is 
actually an isomorphism onto is proved in exactly the same way as Theorem 
1.2 of [4]. 

Now let us consider the analogous situation in the case where L is a 
restricted Lie algebra, and K and Q are restricted Z-modules. An extension 
(E,¢) of K by Q is then called a restricted extension if EF is a restricted 
LI-module. The equivalence classes of the restricted extensions of K by Q 
constitute a subspace ext,(K,Q) of ext(K,Q). Next we observe that our 
module M is now a restricted L-module. In fact, by a familiar computation, 
using that for commuting maps wu and v (here, u and v are the endomor- 
phisms of M consisting in composition of an element of M with the z-operator 
on K or Q, respectively) one has (w—v)?=—u?—v?, we find that 
(a?-m)(q) =2?- m(q) — m(2?-q), whence, since K and Q are restricted, 
m, which proves our claim. 

Let (£,¢) be a restricted extension of K by Q, and let f be a Lie 
1-cocycle derived from (H£,¢) as above; i.e., with an evident extension of 
the notation used before, f(z) = 2x-y, where y is a linear map of Q into F 
which is inverse to ¢. We have xz?'- f(x) —2?-y, and since F and @Q are 
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restricted, it follows by the argument just above that 2?*- f(x) —f(a!!), 
for all re L. Conversely, let f be any Lie 1-cocycle for Z in M such that 
g-1- f(x) =f(r!), for all ee L. We construct a corresponding extension 
(Z,¢) of K by Q as follows. The underlying vector space of HF is the 
direct sum QY+ KK, ¢(¢g-+k) =gq, and the JL-operators are defined by 
Then the equivalence class of 
is indeed the element of ext(K,@) which corresponds to the cohomology class 
determined by f. Futhermore, a direct computation shows that 


(q+ k) g 4 4 Sad f(a) 
j=0 


If we apply the identity S uous = D,?-*(v) to the sum on the right, we 
j=0 

find that it is equal to (2?'-f(xr))(q) =f(x™!)(q). Hence we conclude 
that is a restricted extension. 

In order to interpret this result in the framework of cohomology theory, 
we consider the canonical homomorphism of H,'(L,M@) into H*(L,M). 
We know that it is induced by the cochain map g — g° which we have defined 
at the end of sectionl. This cochain map is evidently an isomorphism into, 
and in dimension 0 it is simply the identity map of M onto itself. It 
follows that the induced homomorphism maps H,,'(Z, M) isomorphically into 
H'(L,M). Accordingly, we shall identify H,(ZL, MZ) with the corresponding 
subspace of H'(L, 1/). There remains to decribe this subspace in terms of 
Lie 1-cocycles. The result is the following: 


THEOREM 2.1. The canonical homomorphism of H,(L, M) into H(L, M), 
where M is any restricted L-module, maps H,,\(L, M) tsomorphically onto that 
subspace of H'(L, M) whose elements are represented by the Lie 1-cocycles f 
which satisfy the relation f(x) =f (z'?!). 


Proof. Let f be a Lie 1-cocycle whose cohomology class belongs to 
H,'(L,M). Then there is an associative 1-cocycle g, defined on Uz*, such 
that the cohomology class of g° coincides with that of f. This means that 
there is an element me M such that f(z) = «-m + g(z), for allze L. Hence 
art. f(x) 2? -m + - g(x) = - m + g(x?) = since 2? = in 
U,. Conversely, suppose that f satisfies the relation of the theorem. By the 
equivalence of the associative cochain complex with the Lie cochain complex 
it follows that f is the restriction to LZ of an associative cocycle h defined 
on R,*. We have then, for re R,; and ze L, 


h(r(ae —alel)) = (a? - f(x) —f(rl)) = 0. 
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Thus, h vanishes on Pz, and is therefore the natural image g° of an associative 
cocycle g defined on U;*. Clearly, the cohomology class of g is mapped into 
that of f by the canonical map of H,?(L, .M/) into H*(L,M). This completes 
the proof. 

We may now state our result concerning module extensions as follows: 


THEOREM 2.2. Let K and ¥ be restricted L-modules, and let M be the 
restricted L-module consisting of the linear maps of Q into K. Then the 
canonical isomorphism of ext(K,Q) onto H*(L,M) maps ext,(K,Q) onto 


3. Extensions of Lie algebras. Let M be an abelian Lie algebra, L 
an arbitrary Lie algebra. An extension of M ‘by L is a pair (£,¢), where E 
is a Lie algebra containing M as an ideal, and ¢ is a homomorphism of FL 
onto L with kernel M. This situation defines on M the structure of an 
L-module, with Z operating on M via F, in the natural fashion. An auto- 
morphism of such an extension is an isomorphism @ of EF onto EF which 
leaves the elements of M fixed and satisfies the relation ¢¢—¢. Then 
the element a(e) —e, with ee HL, depends only on ¢(e), and we may write 
a(e) =e-+f(¢(e)), where f is easily seen to be a Lie 1-cocycle for L in M. 
Conversely, every 1-cocyele gives rise to an automorphism of (/,¢) which 
may be defined by the last relation. Thus the automorphisms of (£, ¢) 
constitute a vector space isomorphic with the space Z*(L,M) of the Lie 
1-cocyeles for Z in M. In particular, if me M, we obtain a corresponding 
automorphism e— e+ [e,m], which we call the pseudo-inner automorphism 
effected by m. Clearly, this gives a rather superficial interpretation of 
H*(L, M) as the space of all automorphisms of (#,¢), modulo the subspace 
of the pseudo-inner automorphisms effected by the elements of M. Our 
purpose in bringing this up is that we wish to illustrate, in the simplest case, 
a certain difficulty which arises in treating restricted extensions of Lie algebras. 
Now we assume that J/, LZ, and F are restricted Lie algebras, and that ¢ is a 
restricted ” homomorphism of onto ZL, i. e., d(e'!) = (d(e))™!, for every 
ee EH. The pair (£,¢) is then called a restricted extension of WM by LZ. It is 
clear that M is a restricted Z-module in this case. It is here appropriate 
to study the restricted automorphisms 2 of (H#,¢). If, as above, we write 
x(e) =e-+f(d(e)), the condition that « be restricted is easily seen to 
amount to the condition that =f(a!l) — (f(r))®!, for all 
rel. In fact, for ee # and meM, condition (3) on the p-map gives 
(m + + as is seen from our determination within 
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E of the term o(m,e¢). Our assertion concerning f follows at once from this. 
This result suggests that, in order to obtain interpretations for the restricted 
cohomology groups in connection with Lie algebra extensions, we should 
consider restricted extensions in which the kernel M is “strongly abelian,” 
in the sense that J’! — (0), as well as [M,M]— (0). If M is strongly 
abelian the above shows that the canonical image of H,,1(Z, M) is precisely 
that subspace of H*(Z, 1/) which corresponds to the restricted automorphisms 
of (E, ¢). 

In general, two restricted extensions (H,¢) and (H’,¢’) of M by L 
will be called similar if there exists an ordinary Lie algebra isomorphism y 
of E onto E’ which leaves the elements of M fixed and for which ¢’y = ¢. 
lf, furthermore, y can be so chosen that y(e!?!) = (y(e))”!, for all ee £, 
we shall say that and ¢’) are equivalent. 

We place ourselves in the situation where we are given an L-module M, 
and we consider the set ext(M/, LZ) of the similarity classes of those extensions 
of M by L which induce the given Z-module structure in M. As is well 
known, ext (.V/, L) carries the structure of a vector space over the base field F, 
and there is a canonical isomorphism of ext(J/, Z) onto H?(L,M).° If L is 
a restricted Jie algebra and M is a restricted Z-module we can consider the 
subset exty(-V, 1) of ext(M,Z) which consists of the similarity classes of 
the restricted extensions. The relation of similarity is so weak that, in con- 
sidering exto(.M, L), the actual p-maps in the extended algebras have little 
influence, although the requirement that they should exist cannot be neglected. 


The resulting simplification is expressed by the following lemma. 


LEMMA 3.1. Every restricted extension of M by L is similar to one tn 
which M is strongly abelian. 


Proof. Let (2, ¢) be a restricted extension of M by L. It follows at once 
from condition (1) on the p-map that J/) must be contained in the center 
of £. It we denote by VW” the subspace of J consisting of all elements that 
are annihilated by L in the given Z-module structure of M, this can be 
expressed by the inclusion M'’!C M/. The restriction to M of the p-map in 
fis a “ p-semilinear ” map of M into .//, in the sense that we have, for ae F 
and L. (am, + am.) == + This is seen at once 
from conditions (2) and (3) on p-maps, noting that M is abelian. We can 
evidently extend this map to a p-semilinear map & of F into M4. It is then 


®* The constructions which are needed in this connection will appear in our proofs 
below. The 0-element of ext(M,Z) is the class of the trivial extension which contains 
Las a subalgebra, 
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Thus, 4 vanishes on Pz, and is therefore the natural image g° of an associative 
cocycle g defined on U;*. Clearly, the cohomology class of g is mapped into 
that of f by the canonical map of H,*(L, M/) into H*(L, M). This completes 
the proof. 


We may now state our result concerning module extensions as follows: 


THEOREM 2.2. Let K and Y be restricted L-modules, and let M be the 
restricted L-module consisting of the linear maps of Q into K. Then the 
canonical isomorphism of ext(K,Q) onto H*(L,M) maps ext,(K,Q) onto 
H,)(L, M). 


8. Extensions of Lie algebras. Let MM be an abelian Lie algebra, L 
an arbitrary Lie algebra. An extension of M by L is a pair (L,¢), where HL 
is a Lie algebra containing M as an ideal, and ¢ is a homomorphism of F 
onto L with kernel M. This situation defines on M the structure of an 
L-module, with Z operating on M via #, in the natural fashion. An auto- 
morphism of such an extension is an isomorphism ¢ of E onto EH which 
leaves the elements of M fixed and satisfies the relation ¢a—¢. Then 
the element «(e) —e, with ee FH, depends only on ¢(e), and we may write 
a(e) =e-+f(d(e)), where f is easily seen to be a Lie 1-cocycle for L in M. 
Conversely, every 1-cocyele gives rise to an automorphism of (£,¢) which 
may be defined by the last relation. Thus the automorphisms of (£, ¢) 
constitute a vector space isomorphic with the space Z!(L,M) of the Lie 
1-cocycles for Z in M. In particular, if me M, we obtain a corresponding 
automorphism e— e + |e, m], which we call the pseudo-inner automorphism 
effected by m. Clearly, this gives a rather superficial interpretation of 
H*(L,M) as the space of all automorphisms of (#,¢), modulo the subspace 


of the pseudo-inner automorphisms effected by the elements of M. Our 


purpose in bringing this up is that we wish to illustrate, in the simplest case, 
a certain difficulty which arises in treating restricted extensions of Lie algebras. 
Now we assume that M, LZ, and £ are restricted Lie algebras, and that ¢ is a 
* restricted homomorphism of onto ZL, i. e., d(e'?!) = (¢(e))™!, for every 
ee The pair (2, ¢) is then called a restricted extension of M by L. It is 
clear that M is a restricted Z-module in this case. It is here appropriate 
to study the restricted automorphisms « of (H,¢). If, as above, we write 
x(e) =e-+f(d(e)), the condition that a be restricted is easily seen to 
amount to the condition that f(x) = f(all) — (f(x))™!, for all 
cel. In fact, for ee H and meM, condition (3) on the p-map gives 
(m + mie) + + as is seen from our determination within 
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F of the term o(m,e). Our assertion concerning f follows at once from this. 
This result suggests that, in order to obtain interpretations for the restricted 
cohomology groups in connection with Lie algebra extensions, we should 
consider restricted extensions in which the kernel M is “strongly abelian,” 
in the sense that — (0), as well as Z]— (0). If M is strongly 
abelian the above shows that the canonical image of H,,7(L, M) is precisely 
that subspace of H'(L, 1) which corresponds to the restricted automorphisms 
of (E, 

In general, two restricted extensions (#,¢) and (H’,¢’) of M by L 
will be called similar if there exists an ordinary Lie algebra isomorphism y 


ce 


of EF onto E’ which leaves the elements of M fixed and for which ¢’y = ¢. 
lf, furthermore, y can be so chosen that y(e!!) = (y(e))'”!, for all ee #, 
we shall say that (#,¢) and (H’, ¢’) are equivalent. 

We place ourselves in the situation where we are given an L-module M, 
and we consider the set ext(M, L) of the similarity classes of those extensions 
of M by L which induce the given Z-module structure in M. As is well 
known, ext (.V, L) carries the structure of a vector space over the base field F, 
and there is a canonical isomorphism of ext(J/, LZ) onto H*?(L,M).8° If L is 
a restricted Lie algebra and M is a restricted Z-module we can consider the 
subset exty(-VU/, 1) of ext(J/,Z) which consists of the similarity classes of 
the restricted extensions. The relation of similarity is so weak that, in con- 
sidering exto(.V, LL), the actual p-maps in the extended algebras have little 
influence, although the requirement that they should exist cannot be neglected. 


The resulting simplification is expressed by the following lemma. 


LEMMA 3.1. Kvery restricted extension of M by L is similar to one tn 
which M is strongly abelian. 


Proof. Let (2, ) be a restricted extension of M by L. It follows at once 
from condition (1) on the p-map that J/'! must be contained in the center 
of 2. Ti we denote by WV” the subspace of M consisting of all elements that 
are annihilated by L in the given Z-module structure of .W, this can be 
expressed by the inclusion M'?! CM". The restriction to M of the p-map in 
Lisa“ p-semilinear ” map of J/ into 1/4, in the sense that we have, for ae 
and L. (a,m, + am.) == + This is seen at once 
from conditions (2) and (3) on p-maps, noting that M is abelian. We can 
evidently extend this map to a p-semilinear map / of # into M4. It is then 

*The constructions which are needed in this connection will appear in our proofs 
below. The 0-element of ext(M,L) is the class of the trivial extension which contains 
Las a subalgebra. 
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easily seen that the map e—e = el?) —k(e) still satisfies the conditions 
for a p-map. If £ is equipped with this new p-map, we have a restricted 
extension of M by L which is similar to the given extension by the identity 
map of £ onto £, and in which M is strongly abelian. 

A first consequence of this result is that ext)(M,L) is a subspace of 
ext(M/, LZ). In order to prove this, we must recall the definition of the vector 


space structure in ext(J/,Z). The subtraction in ext(M,L) is the map | 


which is induced by the following composition of extensions (#,¢) and 
(E’, ¢’). Let D denote the subalgebra of the direct sum of H and E”’ which 
consists of all elements (e,e’) in which ¢(e) = ¢’(e’). Let J be the ideal 
of D consisting of the elements (m,m), with me M. In D/J, we identify 
(M, M)/J with M by means of the homomorphism (m, m2) — m, — mz whose 
kernel is exactly J. The homomorphism (e, e’) > ¢(e) induces a homo- 
morphism y of D/J onto L, and (D/J,w) is the extension of M by L which 
represents the difference of the similarity classes of (ZF, ¢) and (E’,¢’. Now 
if (£,¢) and (£’, ¢’) are restricted extensions, we can define a p-map in D 
by setting (e, == (ell, Since this maps J into itself, it induces 
a p-map in D/J, with which (D/J,w) constitutes a restricted extension of 
M by L. Hence ext,(M, L) is a subgroup of ext(1V/, L). 

The scalar multiple of the similarity class of (Z,¢@) by an element ae F 


is the similarity class of the extension defined as follows. First construct a | 


Lie algebra (£, ) whose space is the direct sum of FH and M, and where 
the commutation is defined by the formula . 


[ (€1, m1), (€2, M2) | = ([e1, €2], m2 — Mm). 


Let J, be the ideal of (#, 11) which consists of the elements of the form 
(m,—am), with me M. In (F,M)/Jq, identify (M,M)/Je with M by the 
homomorphism (m,,m,)—>am,-+ m, whose kernel is Ja. Let y be the 
homomorphism of (£, M)/J, onto Z which is induced by the homomorphism 
(e,m)—>¢(e). Then the similarity class of ((Z,M)/Ja,) is the desired 
a-multiple of the similarity class of (EF, ¢). 

We wish to show that multiplication by a maps ext)(M, L) into itself. 
By Lemma 3.1, every element of ext,(M,L) is represented by a restricted 
extension (£,¢) in which M is strongly abelian. In that case, the map 
(e,m) —> (e, — (ell, annihilates (M, M) and therefore induces 4 
p-map in (£, M)/J, (which annihilates 1/). With this p-map, M)/Ja,¥) 
is a restricted extension of M by LZ (in which M is strongly abelian). This 
completes the proof that ext)(M/, L) is a subspace of ext(M, L). 

Now denote by H-?(L, M) the image of ext.(M, LZ) in H?(L, M), by the 
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canonical isomorphism of ext(J/, L) onto H*(L,M). We wish to characterize 
H,?(Z, M) in terms of cohomology theory. Let f be any Lie 2-cocycle for L 
in M. We consider an extension of / by L whose similarity class corresponds 
to the cohomology class ¢ of f, by the canonical isomorphism. Such an exten- 
sion is the following. ‘The underlying space of the extended algebra is 
the direct sum of Z and J/, and the commutation is defined by the formula 
[(x, m), m1) ] = 2, The homomor- 
phism onto L is the map (z,m)—z. It is easy to verify that we have 


1 
0) = ([x!, ke(a1)), where = f(x, 


If we apply D(o,o)? to the commutator [ (2,0), (72,0) ] and express the fact 
that D,z,o)? is a derivation, we obtain 


2, — halts) + f(s, (2, + ay], 2). 
If we write fy(x) = f(z, y), this relation means precisely that, for each fixed 


teL, kz + fate: is a Lie 1-cocycle for in M. 
If f is a coboundary, 89, we obtain 


4=0 


Now the identity (valid in any commutative ring) 


k-1 
u® — (u—v)* = v ut(u— i 
i=0 
shows that 


If we specialize this to the case where wu is the left multiplication by x in Ry 
and v is the right multiplication by x in R;, we find that 
(1) (— 1)*(; 44 
Set k =p in this relation and substitute the result in the above expres- 
sion for k,(2,). This gives 


whence 
ke + f (21, 2!) = — g(z)). 
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It follows that the cohomology class of /,-+ fz depends only on the 
cohomology class ¢ of f. We shall denote it by c(z). 

If ce H,?(L,M), we can define a p-map in our extended algebra so as 
to obtain a restricted extension of by L. Write (2,0)! = (a!), p(z)), 
Then, since = we must have k,(z7,) = 2,) — a, p(a), 
ky + fri = 8(— p(x) ), so that = 0. 

We wish to prove, conversely, that if c(x) =0, for all xeL, then 
ce H,*(L,M). We can find an associative 2-cocycle g for Rz* in M which 
represents the given cohomology class ce H*(L,M). If 


f(t, y) = —9(y, 


then f is a Lie 2-cocycle belonging to c. If we express k,(z,) in terms of 
values of g we obtain, after some obvious simplifications resulting from the 
fact that g is a cocycle, 


= 9 (2, x). 
By the above identity (1), this becomes 
ky(@1) = 9 (2, 411) — 9 x) 
= 9 (x?, —g (#1, x?) — g(xP", 
Hence c(x) has the representative 1-cocycle g’z, where 
= 9 (a? — — (21, 


This expression for g’, shows at once that the map «—>c(#) is a p-semi- 
linear map of LZ into H*(Z, MW), and that if c(x) =0, for all we L, we can 
find a p-semilinear map o of L into M such that we have identically 
g'(%) =2,:0(x). Now we define a p-map in our extended algebra by 
setting (x, m)@) = + g(x?1,x) —o(x)). Condition (2) for 
a p-map is evidently satisfied. The verification of conditions (1) and (3) 
requires some computation. By induction on k, one finds that 


i=0 i=0 
If we write the terms of the last sum on the right in terms of values of 9, 
k-1 
we find that it is equal to g(x*, 21) — ¥ g(2‘D,**-*(2,), 7). If we take k =p 
i=0 
and use the above identity (1) we obtain 


M1) = ([x!, x,], - m, — m g(a?, 21) — 2)), 
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which is easily seen to be equal to [(x, m)"!, (2,,m,)].° Thus our map 
satisfies condition (1). 


If, in our identity (1), we put k = p—1 and note that vs. 4 is con- 


p-2 p-2 
gruent to (— mod. p, we find that =— Now 
i=o j=0 


we operate in the polynomial ring Rz[t] as in our discussion of the p-map, 
in section 1. Rewriting our above result for Dzm)*(a%, m1), with k = p — 1 
and with (2,,m,)t-++ (z,m) in the place of (z,m), and using the last 
identity with z,f + 2 in the place of z, we obtain 


where q(t) is the polynomial in ¢ with coefficients in Jf which is obtained by 
expanding the expression 


(a,t + + (a,t + + (mt + m) 
+ g((ait + x)?1, + g((axt + + + 2)). 
j=0 


It is seen at once from this expression that g(t) is the formal derivative 
with respect to ¢ of the polynomial obtained by expanding the expression 
(x,t + + m) + g((ait + + Note that this implies 
that q(t) is actually of degree less than p—1. If we formally integrate 
q(t) with respect to ¢, regarding it as a polynomial of degree p— 2, and 
then put f = 1 we obtain therefore 


+ x)P*-(m, + m) + g((t1 + 2)? (@1 + 
— — — - m — 2); 


namely those terms of the last expression in which the exponent of ¢ is 
greater than 0 and less than p. On the other hand, the component in M of 


m) + (2, m,))!! is 

(x, + x)?2-(m, + m) + + (21 + — o(a, + 2). 
Hence we see that the formal integral of q(t) is precisely the component in 
M of the element ((x, m) + (2;, m,))#! — (2, — By what 


we have seen in our discussion of section 1, this means that condition (3) is 
satisfied by our p-map. Hence we have the following result. 


THEOREM 3.1. Let L be a restricted Lie algebra, M a restricted L- 


*It is here that the relation g’,(a,) = a,-¢(@) is used. 


| 
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module. Denote by S(L,H*(L,M)) the space of all p-semilinear maps of 
L into H*(L,M). For an associative 2-cocycle g, let g’ be the function 
defined by g’2(y) = 9 (a? — x), y) —g(y, 2? Then the map g 


induces an F-linear map of H?(L,M) into S(L,H*(L,M)) whose kernel | 


is precisely the subspace H,*(L,M) that corresponds to the restricted exten- 
stons of M by L. 


We shall now look at our situation more closely from the following 
point of view. We suppose that we are given a restricted L-module UM, 
together with a p-semilinear map of M into M¥. Then M has also the 
structure of a restricted abelian Lie algebra. We shall refer to these data 
by saying that M is given as a restricted abelian Lie algebra on which L 
operates. The object to be investigated is the set ext,(M,Z) of the equi- 
valence classes of those restricted extensions of M by Z which induce the 
given Z-module structure and p-map on M. If we assign to each such 
equivalence class the similarity class to which it belongs we obtain what we 
shall call the canonical map of ext,,(M, Z) into ext(M,L). It is clear from 
Lemma 3.1 that, no matter what the given p-map in M actually is, the 
canonical map sends ext,,(M/, LZ) onto ext)(M, LZ). Furthermore, it is clear 
from our discussion of the vector space structure in ext,(M, L) that ext,,(M, L) 
can be given the structure of an abelian group such that the canonical map 
onto ext,(M/, Z) is a group homomorphism. In fact, this is done by means 
of the same composition of extensions which we have used for making 
ext)(M, Z) into an abelian group. Furthermore, our earlier discussion shows 
that, in the case where M is strongly abelian, our previous construction (for 
ext)(M, L)) yields also scalar multiples in ext,,(J/, LZ) so that ext, (JM, L) 
becomes a vector space, and the canonical map onto ext)(M, LZ) is an F-linear 
map. In the general case, we do not obtain a vector space structure in 
ext,,(M, L). 

Next we wish to describe the partitioning of a similarity class c of 
restricted extensions of M by L into a set S, of equivalence classes. Let y 
be a similarity isomorphism of a restricted extension (F,¢) onto a restricted 
extension (H’, ¢’). For ee FE and me M, we have then 


while 
(y(e + m))!! (y(e) + (y(e))! mie) + m. 
It is seen from this that y(e!”)).— (y(e))! depends only on the coset of 


e mod. M, so that we may write this difference in the form g($(e)), where 9 
is a map of L into M. Our discussion of condition (3) for a p-map shows 
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that we have here y(o(é,, ¢2)) =o(y(e:), y(éz)). It follows from this that 
q is p-semilinear. Furthermore, for r= ¢(z) eL, we have 


a-g(o(e)) = (2) (y(e!) — (y(e)) = y(Lz, + Dyer? (y(2)) 
= y([z, e#!) + y(De(z)) =9, 


so that g actually maps Z into M4. Thus g belongs to the space S(L, M”) 
of the p-semilinear maps of L into M4. Now denote by EF, the restricted Lie 
algebra which is obtained from H by changing the p-map to 


e—> el?) == —. g(p(e)). 


Then (E,,@) is a restricted extension of M by L and y is an equivalence 
isomorphism of (H,,¢) onto (H’,¢’). Conversely, if g is any p-semilinear 
map of Z into M¥, then (Z,,¢) is a restricted extension of M by L which 
is similiar to (£, ¢). 

With every g ¢ S(L, M”) we associate the transformation g* on ext,(M, L) 
which is induced by the map (F,¢) > (£j,¢). We have just seen that 
g*(S-) C S., and that S(L,M*) operates transitively on each S,. The 
transformations g* are actually group translations in ext,(M,LZ). In fact, 
let s) denote the 0-element of ext,,(4/, Z), i.e., the equivalence class of the 
trivial extension. Then, for any se ext,,(M/, LZ), we have g*(s) =s + g*(So), 
as is easily seen from the definitions. Furthermore, the maps g—>g* and 
9 9*(so) are group homomorphisms, as a glance at the definitions will show. 

Next we shall determine the kernel of the map g-—>g*. Suppose that 
¢ is an equivalence isomorphism of (E,,,¢) onto (Z,.,¢). Denote the 
p-maps in Fy, and F,, by e—-et?! and e—e®), respectively. We write 
{(e) =e + h(¢(e)), where h is a Lie 1-cocycle for LZ in M, as we did 
when we studied the automorphism of an extension. If we write out the 
relation £(e}) (£(e)) we find that we must have, for every re L, 


g2(x) —gi(x) = 2?" - h(x) + (A(x)) 


We write this as g, —g, —h’. Conversely, let h be any Lie 1-cocycle for L 
in M. We know that h is the restriction to LZ of an associative cocycle f 
defined on R,*. Hence 


f(x) —f (al) + (h(x))¥) = f(ar — all) + (h(x))™!, 


whence we see that h’ is p-semilinear and takes its values in M4. Thus we 
have a corresponding extension (Ey, ¢), and the map e—> ¢(e) =e + h(¢(e)) 
is an equivalence isomorphism of (E,¢) onto (Hy,¢). Hence the kernel 
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of the map g — g* is precisely the image in S(L, M”) of the space Z'(L, M) 
of the Lie 1-cocycles for Z in M, under the above linear map h > h’. 

By the above, the map g—>g*(so) of S(L,M¥) into ext, (M, L) has 
exactly the same kernel, while its image is precisely the kernel of the canonical 
homomorphism of ext,,(M, onto ext)(M, Thus we have the following 
result. 


THEOREM 3.2. Let M be an abelian restricted Lie algebra on which the 
restricted Lie algebra L operates. Then the linear map h—>)h’ of Z*(L, M) 
into S(L, where h’(x) - h(x) —h(a!) + the homo- 
morphism g—>g*(8)) of S(L,M*") into ext,,(M,L), and the canonical 
homomorphism of ext,,(M,L) onto exty(.V,L) constitute an exact sequence 


Z1(L, M) > 8(L, (M, L) > ext) (M, L) > (0). 


From now on we suppose that W is strongly abelian and connect 
ext,,(M, LZ) with H,.?(L,M). Suppose that an associative 2-cocycle g for 
U,* in M is given. We construct a corresponding restricted extension of M 
by Z as follows. The space of the extended Lie algebra, Fy, is the direct sum 
of Z and M. The commutation is defined by the formula 


[ (a1, m1), me) | = ([a1, M2 — + G(X, — %1)). 


The homomorphism ¢, of F, onto LZ is defined by ¢,(2%,m) =a. Finally, | 


we define a p-map in E, by setting (x, m)) = + g(a?",2)). 
In fact, in the course of our proof of Theorem 3. 1, we have already seen that 
this map satisfies all the conditions for a p-map. Next we show that the 
equivalence class of our restricted extension (E,,¢,) depends only on the 


cohomology class, in H,,?(L, M), of g. Let h be any 1-cochain for U;* in M. | 


For (z,m) set a(z,m) = It is easy to verify 
as usual that « is a similarity isomorphism of (Hg.5n, ¢g.sn) onto (Eg, $y). 
Furthermore, we have 


a( (x, == (ele), + x) + 2? *-h(x) —h(2?)) 


= (cll, + + = (a(x, m))™, 
i.e., @ is an equivalence isomorphism. 


Thus, the correspondence g ¢,) induces a map of H,,?(Z, into 
ext,,(M,L). Now it is not difficult to see that the difference of the equi- 
valence classes of (Ey,,¢9,) and (,,,¢,,) is represented by the extension 
Similarly, the scalar multiple by an element ae F of the 
equivalence class of (H,,¢,) is represented by (Kay, dag). Hence our map 
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of H,?(L, M) into ext, (M,Z) is an F-linear homomorphism. We shall see 
that it is actually an isomorphism. 

Suppose that (/,,¢,) is a trivial extension. This means that there is 
a restricted isomorphism y of Z into EF, such that ¢,y is the identity on LD. 
Write = —h(xv)). Then we have 


g(x,y) —g(y,x) =x-h(y) —y- h(x) —h([2,9]), 
and 
g = h(x) — 


The first relation means that the Lie cocycle corresponding to g is a co- 
boundary. Now this Lie cocycle is also the Lie cocycle which correspond 
to the associative 2-cocycle g° for R,* in M, and we conclude that g® is 
also a coboundary, i.e., there is a 1-cochain c for R;* in M such that 
g°(u, v) =u-c(v) —c(uv), for all wu, v in If c’ is the restriction of 
c to L then c’ —h is evidently a Lie 1-cocycle, and therefore coincides with 
the restriction k’ to Z of an associative 1-cocycle & for R;* in M. Replacing 
c by c—k, we may therefore assume that c’—=h. Then we have from the 
above 7) = - c(x) —c(a)), for every xe L. But this means that 
c(a?) = i.e., ¢ vanishes on Pz, so that c with f a 1-cochain 
for U;t in M. Clearly, we have g = df, and so we conclude that our homo- 
morphism of H,,.?(L, ) into ext,,(M, LZ) is an isomorphism. There remains 
to show that it maps M) onto ext,,(M, L). 

Let (BE, ¢) be a restricted extension of M by L. The homomorphism ¢ 
can be extended uniquely to a homomorphism ¢’ of Ug onto U;z, such that 
¢’(1) 1. The kernel of ¢’ is the ideal UgM of Ug which is generated by 
the elements of M. We claim that the identity map of M onto M can be 
extended in one and only one way to a Ug-homomorphism of UgM onto M, 
each being regarded as a Ug-module in the natural fashion. Clearly, there 
can be at most one such map. Let (m;) be a basis for MW over F. We claim 
that if um i= 0, with u;e Ue, then each u; must actually belong to 


In fact, if we multiply our relation by the product of the elements m*, 
where i ranges over all the indices which occur in the sum, with the exception 
of one of them, say 11, we obtain u,m,m.?"*- - -m,?-'=—0. If we write 
u, as a linear combination of ordered monomials in the elements of an ordered 
basis of Z which contains the m; we see at once that this last relation implies 
that uv, ¢ UgM, and this suffices to establish our claim. Hence there exists a 
map y of onto M such that = ¢’(ui)- mi, and it is evident 
i 


that y is a U-homomorphism. 
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Now let y be a linear map of Z into # which is inverse to ¢. We can 
extend y to a linear map y’ of Uz, into Uz such that ¢’y’ is the identity map 
on U,. For «x and y in U;*, we now set y) = — 
noting that y/(x)v’(y) —w (ay) lies in the kernel of ¢’, i.e., in UsM. Now 
observe that. for ec and meM, we have me whence 
y(me) =0. Hence we conclude that y maps Uz.MU,* into (0). Using this, 
it is easily verified that x- g(y, z) — g(ay, z) + g(a, yz) — 0, for all z, y and 
z in U;*, which means that g is an associative 2-cocycle for Uz* in M. 

Now we shall show that (H#,¢) is equivalent to (Z,,¢,). Define a map 
a: E,—E by setting a(z,m) =y(x) +m. Then we have 


a([ (a1, m1), m2) ]) 
= 22]) + + G(s, — G(@s, 2) 
= + M2 — + — 2])) 
= [¥(21), ] + mM, — mM, = [ a(x, mM), me) ], 


whence a is a similarity isomorphism. Furthermore, 


(a(x, m))@1 — (y m, 
while 
a( (a, m) ll) = y(alel) + xv) + 
But we have 
g(a?*, = (2? (x) 
= —y (ale!) ) ) — (at), 


whence (a(x, a((a, m)"!), « is an equivalence isomorphism. 
); | 


Hence our map H,?(L, M) > ext,,(M, ZL) is an isomorphism onto, and 
our above construction yields its inverse. We have therefore obtained the 
following result: 


THEOREM 3.3. Let M be a strongly abelian restricted Lie algebra on 
which the restricted Lie algebra L operates. Then there is a canonical 1so- 
morphism of ext,,(M,L) onto H,?(L,M) such that the following diagram 
of canonical F-linear maps is commutative : 

ext,,(M, L) H,?(L, M) 


J 
L) H,?(L, M). 
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In the case where WM is strongly abelian, we can replace Z*(L, M) by 
H(L, M) in the exact sequence of Theorem 3.2. Taking account of Theorems 
3.1 and 2. 1 also, we find that, for every restricted L-module IM, the following 
sequence is exact: 


(0) > H,(L, M) > M) > 8(L, M*) 
H,2(L, M) > H?(L, M) > S(L, H*(L, M)). 


We shall compare this sequence with that obtained at the end of section 
1, which involved the relative cohomology groups H"(C(1)/C°(M)). An 
element a ¢e H1(C(M)/C°(M)) is represented by a cochain f ¢ C(1) such that 
sfeC°(M). Thus f is a linear map of R;* into M such that u- f(v) — f(ur) 
depends only on the cosets mod. P; of u and v. Now define a p-semilinear 
map g of L into M by setting g(x) = f(#?—vz!!). It is seen immediately 
that g does not depend on our particular choice of the representative f, but 
only ona. Furthermore, a direct verification shows that ge S(L, M"). Hence 
the correspondence a — g is actually an F-linear map of H'(C(M)/C°(M)) 
into S(L, M“). We shall denote this map by y;. Similarly, we shall define 
an F-linear map y2 of H?(C(M)/C°(M)) into S(L,H*(L,M)). In fact, 
an element ae H?(C(M)/C°(M)) is represented by a cochain fe C(M) such 
that 8feC°(M), i.e. by a bilinear map f from R;* to M such that 
u-f(v,w) —f(uv,w) + f(u, vw) depends only on the cosets mod. Pz of u, v, 
and w. For we ZL, we now set =f(z?— u) —f(u, 2? — 
Then, as we know already from Theorem 3.1, the map *—g, induces a 
p-semilinear map of Z into H'(Z,M). Furthermore, it is seen immediately 
that the cohomology class of g, does not depend on our particular choice of 
the representative f, but only on a. If we denote this by y2(a), we have the 
desired F-linear map y, of H?(C(M)/C0°(M)) into S(L, H1(L,M)). Now 
let us consider the two exact sequences with the connecting maps y;, y2, and 
identity maps, as shown: 


H,\(L, M) H*(L, M) > H*(C(M)/0°(M)) 


(L, M*) 
H,2(L, M) H2(L, M) > H2(0(M)/0°(M)) 
Y2 
H,2(L, M) H2(L, M) 8(L, H*(L, M)). 


It can be verified from our definitions of these maps that this diagram 
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is commutative, and a familiar argument shows that y, is an isomorphism 
onto and that y, maps the image of H?(L,M) in H?(C(M)/C°(M)) iso- 
morphically onto the image of H?(L,.M/) in S(L, H*(L, M)). 

This suggests that the bottom sequence may be continued, preserving 
exactness, with a map S(L, M)) H,'(L,M). Such a continuation 
would be of interest in connection with the interpretations of the 3-dimensiona! 
cohomology groups with which we intend to deal in a later paper, but to 
which the next section is already an introduction. 

Finally, we observe that it should be possible to prove these cohomology 
results, without making any use of the interpretations of the cohomology 
groups, by filtering the associative cochain complex for R,* in M relative to 
the ideal P;, and considering the spectral sequence which arises from such a 
filtration. This procedure would amount to a slight modification of that of 


Kung-sing Shih, in [7]. 


4. Non-abelian kernels. In the case of groups,.it is well known that 
the enumeration of the extensions of an arbitrary kernel can be reduced to 
that of the extensions of the center of that kernel.1° The analogue of this 
reduction, for the case of associative algebras, is contained in [4], and it is 
clear that the case of Lie algebras can be dealt with in the same way. In the 
ease of restricted Lie algebras, one has to augment the ordinary procedure 
by the appropriate considerations relating to the p-maps. We shall briefly 
indicate the resulting theory for this case. The case of ordinary Lie algebras 
will then be contained implicitly in our discussion; one merely omits every- 
thing which relates to the p-maps. ; 

We begin by recalling some of Jacobson’s results on derivations of 
restricted Lie algebras. A derivation ¢ of the restricted Lie algebra J is 
said to be a restricted derivation if £(m'!) = D,,?-*(€(m)), for every me MV. 
Observe that every derivation ¢ of M can be extended uniquely to a derivation 
¢* of Ry which annihilates the base field F. One sees immediately that ¢ 
is a restricted derivation of M if and only if £* annihilates the elements of 
the form m?—m!*!, From this it is evident that if ¢ and 7 are restricted 
derivations of so are & and [f,7] Hence the space 
consisting of the restricted derivations of J/, with the usual commutation, 
and the p-map ¢— ¢'! == &, constitute a restricted Lie algebra. 

Now let Z be another restricted Lie algebra, and consider a restricted 
extension (£,¢) of M by ZL. Choose a linear map p of LZ into EF which 1s 


19 See [3], which also gives an account of similar results due to R. Baer. 
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inverse to @. With each xe L, associate the derivation £ of M which is effected 
by the element p(x) of ZF, i.e., (m) =[p(a),m]. Condition (1) on the 
p-map in & implies that ¢ is then a restricted derivation of M. Let I(J) 
denote the set of the inner derivations of Jf. It is immediate that [(J/) is a 
restricted ideal of Do(J/), in the sense that J(M)™! CI(M). The above 
correspondence induces a homomorphism of L into Do(M)/I(M). 
Furthermore, it is easy to verify that this homomorphism is a restricted 
homomorphism of Z into the restricted Lie algebra D)(M)/I(M). Finally, 
this homomorphism evidently depends only on the equivalence class of the 
restricted extension ¢). 

We suppose that we are given a restricted Lie algebra M and a restricted 
homomorphism of / into the restricted Lie algebra D)(M)/I(M). We shall 
refer to this structure by saying that we are given a restricted L-kernel J. 
If N denotes the center of J/, then N is a restricted abelian Lie algebra on 
which Z operates, in the sense defined in the previous section. Clearly, it 
means the same if we refer to this structure by saying that we have a 
restricted abelian J-kernel. We shall call this the nucleus of the restricted 
L-kernel M. 

In accordance with our earlier notation, we shall denote by ext, (M/, L) 
the set of equivalence classes of those restricted extensions of M by L which 
induce the given L-kernel structure on M. The chief tool for our reduction 
of ext, (47, L) to ext,,(V, 1), where N is the nucleus of M, is a composition 
which associates with an element we ext,,(/, LZ), and an element ve ext,(N, L), 
an element u + veext,,(J/, LZ). As the notation suggests, this is an evident 
extension of the addition in ext,(N,L). We shall describe this composition 
briefly. 


Let (£,¢) be a restricted extension of M by ZL which belongs to the 


| equivalence class u, and let (7,7) be a restricted extension of N by LZ which 


belongs to v. Let D denote the subalgebra of the direct sum of # and T 
which consists of all elements (e,¢) in which ¢(e¢) =7(t). Let J be the 
ideal of D consisting of the elements (n,-—n), where ne N. In D/J, we 
identify (M.N)/J with M by using the homomorphism (m,n) > m -+ n, 


| whose kernel is exactly /. The map (e,¢) —(e) induces a homomorphism 
¥of D/J onto L whose kernel is 14. The map (e, t) (e!, induces 


i p-map in D/J, because it sends J into itself. With this p-map, (D/J, yp) 
constitutes a restricted extension of M by LZ whose equivalence class is easily 


Seen to depend only on wu and v, and which is the desired composite wu -+ v. 


It is easy to verify that this composition has the properties suggested by the 
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notation. In particular, and Also, 
if w+ v=—u, it follows that v= 0. 
The reduction theorem can now be expressed as follows: 


THEOREM 4.1. Let L bea restricted Lie algebra and let M be a restricted 
L-kernel. Let u be any element of ext,,(M, lL), and let N denote the nucleus 
of M. Then the map v>u+v of ext,(N,L) inte ext, (M, L) ws 1-1 and 
sends ext,(N,L) onto ext, (M,Z). 


Proof. We shall prove only the second statement in detail. Let (£, ¢) 
be a restricted extension of 1 by Z which belongs to the given equivalence 
class u. Let #* be the restricted Lie algebra which is obtained from F# simply 
by changing the sign in the commutation. More precisely, we have an anti- 
isomorphism e— e* such that [e, f]* [e*,f*] and = (elel)*, 
Let (£;,¢:) be any restricted extension of M by ZL which induces the given 
[-kernel structure in M. Let D denote the subalgebra of the direct sum of F, 
and L* which consists of those elements (— e, e*) in which ¢:(e:) = ¢(e) 
and also [e,,m] = [e, m], for allmeM. D is a restricted Lie algebra (with 
the p-map defined component-wise, as before) and the map (— ée:, e*) > — ¢(e) 
is a restricted homomorphism of D into L. Using that (£;,¢,) and (E£, ¢) 
induce the same L-kernel structure on M, it is easy to see that this homo- 
morphism actually maps D onto L. The kernel K of this homomorphism 
consists of all elements of the form (—m, (m-+ n)*), where me and 
neN. Let J be the ideal of D consisting of the elements (— m, m*), with 
meM. In D/J, we identify K/J with N by means of the homomorphism 
(— m, (m + n)*) —>1n whose kernel is precisely J. Since the p-map in D 
sends J into itself, D/J is a restricted Lie algebra. If 7+ is the restricted 
homomorphism of D/J onto Z which is induced by the above map (— ¢;, e*) 
— — ¢(e), then (D/J,7) is a restricted extension of NV by LZ which represents 
an element ve ext,,(N, LZ). 

We claim that the equivalence class of (£1, ¢,) is the composite u + v. 
In order to see this, we must exhibit an equivalence isomorphism of the 
composite (G,y), say, of (Z,¢) with (D/J,7) onto (F,,¢:). An element 
g © G is represented by a triple (e’, — e,, e*), where ¢$,(e,) = o(e) = — o(e’). 
Hence e+ e’e M, and we can form (e+ e’) —e, in F;. It is easily seen 
that this element of #, does not depend on the particular choice of the repre- 
sentative triple, but only on g. Hence we may define a(g) = (e+ e’) —é¢: 
Clearly, we have ¢,;2—-y. An element me WM is represented by the triple 
(m, 0,0), whence it is clear that a leaves the elements of M fixed. Evidently, 
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a maps @ linearly onto /;. If «(g) =0 we have e, = (e + e’) e M, whence 
also ee M and e’e M. Hence ge M, and therefore g =a(g) =0. Hence « 
is 1-1. Now let g, and g, be two arbitrary elements of @ and choose repre- 
sentative triples (e’,—e,,e*) and (f’,—fi,f*) for g, and go, respectively. 
Then [9;, 92] is represented by the triple ([e’, f’], [e:, f:], — [e, f]*). Hence 
= f’] —[e.f]) + [e,f:]. On the other hand, 


1 [(e + — (fF +f) —fi] 
=[(¢+¢) &) —4, fil. 


Using that e and e,, and f and f,, effect the same derivations in M, this can 
be written 


+ Ley fil 
= (Le, —[e fl) + Le, fr] = 92])- 


Hence & is an isomorphism of G onto #,. Finally, we have 


(a(g))'! == (e + — + of(e + 


where o is the map which occurs in condition (3) on p-maps. We know that 
o(e e,) is the formal integral of the polynomial 


D (ese'yt of *(e e’). 


Since e and e, effect the same derivation in M, we may here replace e, by e. 
It follows that 


a(e + e’,—e,) =a(e+ —e) = (e”) (e e”) 


Hence (a(g))@! = ((e’)! + el”}) —e,”) This shows that « is 
an equivalence isomorphism, and Theorem 4.1 is proved. 

As in the case of groups, [3], and associative algebras, [4], the study of 
L-kernels M leads to 3-dimensional cohomology. The procedure analogous to 
that of [3] and [4] leads to 3-cocycles of the Lie type for LZ in the nucleus 
N of M. In order to extend the technique used above, it would be necessary 
to establish a direct connection between L-kernels and 3-cocycles of the 
associative type. There are a number of unsolved problems in this direction. 
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NORMAL COMPLETIONS OF INCIDENCE MATRICES.* 


By and H. J. Ryser. 


1. Introduction. In this paper we consider the problem of arranging v 
elements into v sets, such that each set contains exactly & distinct elements 
and such that every pair of sets has exactly X= k(k —1)/(v—1) elements 
incommon (0<A<k<v). The significance of the v,k, configuration 
is described in detail in [3]. In particular for v = N?+N+1,k=N-+1, 
NV => 2, and A = 1, the problem is equivalent to determining a projective plane 
with V + 1 points ona line. For v= 4N —1,k=2N —1,andA—= WN —1, 
the problem reduces to a determination of a Hadamard matrix of order 4N. 
Finally, we remark that the general v,’,A configuration arises in a natural 
way in statistics, where the arrangements are referred to as symmetrical 
balanced incomplete block designs. 

Let the elements 2,,- ~:~, 2, be listed in a row and let the setss,,- - -, S» 
be listed in a column. We form the incidence matrix A of the configuration 
in the usual way, by inserting 1 in row 7 and column 7 if 2; belongs to 5; 
and 0 otherwise. The matrix A is of order v and is composed entirely of 0’s 
and 1’s. Clearly A satisfies the matric equation AAT B. Here A™ denotes 
the transpose of A, and B is the matrix with & in the main diagonal and A 
in all other positions. For incidence matrices, one readily proves that 
ATA = B, so that the incidence matrix is normal. 

It is evident that the matrix B is congruent to the identity, relative to 
the field of rational numbers. Bruck and Ryser have utilized this fact for 
the establishment of the theorem on the nonexistence of finite projective 
planes [2]. Later this result was extended by Chowla and Ryser to the 
general v, hk, A configuration, and the proof of the nonexistence theorem was 
made independent of the Minkowski-Hasse theory of the invariants of a 
quadratic form [3]. Shrikhande, basing his investigations directly on [2], 
independently derived the result on the nonexistence of v,k, A configurations 
[8]. Further results, with emphasis on the cyclic case, may be found in 
Hall and Ryser [4]. In summary, a necessary and sufficient condition in 
order that the matrix B of even order v be congruent to the identity is that 
k—} be a square. For v odd, the condition is that the norm residue symbols 
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satisfy (kK —A,—1),"/* (kK —A,v)p = + 1 for every odd prime p. The 
latter condition is equivalent to asserting that the Diophantine equation 
(k—dA)y? + (—1 possesses a nonzero solution in integers. 

For the case of the non-symmetrie block design, however, the condition 
for rational solvability is usually vacuous. But Connor [5] has given certain 
conditions which must be satisfied if a given initial set of blocks can be 
completed to a design. Connor’s conditions may be interpreted as the require- 
ment that a certain quadratic form may not be indefinite. Connor's con- 
ditions do not yield results for symmetric designs. 

In an effort to obtain results similar to Connor’s for symmetric designs, 
the authors of this paper found entirely contrary results, which are given here. 
The rational conditions for solvability being satisfied, there exists (a) a 
normal and rational matrix satisfying the incidence equation and (b) any 
set of initial rows satisfying the trivially necessary conditions can be rationally 
completed to a normal solution of the incidence equation. Of these results, 
the first generalizes a theorem of Albert [1] and the second is in strong 
contrast to the results of Connor. 

2. Matrix preliminaries. [Let 1. 3. C be symmetric matrices with 
elements in a field F of characteristic 42. Suppose that C®BAcC OB, 
where the notation © denotes direct sum and ¢ denotes congruence in the 
sense that there exists a nonsingular X with elements in F’ such that 
AT(C@A)X=CO@B. Then Witt’s theorem [9] asserts that AcB. 
Theorem 2.1 which follows is closely related to a theorem of G. Pall [6], 
and will be derived here by the use of Witt’s theorem. 


THEOREM 2.1. Suppose that AAT=D,@Dz,. Here the matrix A ts 
of order n and nonsingular. The matrix D, is of order r and D, is of order 8, 
wherer+s=n. Let X be an arbitrary r X n matrix such that Y¥XT = Dy 
Then there exists an n X n matrix Z having X as tts first r rows such that 
ZZ* =D, @ Dz». This result holds for all fields F of characteristic + 2. 


The matrix X is of rank r since D, is of rank r. Thus we may take an 
s Xn matrix W of rank s whose rows are a basis for the space annihilated 


by X. Then letting 
4 


we have YYT=D, @D;. We assert that Y is nonsingular. There exists 
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a nonsingular ( such that CD,C™’ =D, where D = diag[d,,- - -,d,], with 
d,~0. Thus 


OF Vistar wa 


Let the row vectors of 


W 
be denoted by @,°--,¢,81,°°*,;Bs We assert that 7 is nonsingular. 


For if a,a,+:--+a,c,+0,8,+--:--+0;8,;—0, then since the inner 
product ;) = with dj; 40 and the Kronecker delta, it follows 
that a, =: --=—a,—0. But since the f’s are linearly independent, we 
have b; =: --==b,—0. Thus the determinant | D; | 40 so that Y is non- 
singular. By Witt’s theorem, there is a nonsingular # such that ED,H™T = D,. 


But then with 
2—( 


0 ED,ET 0 Dot 


we have 


THEOREM 2.2. Let r= and let y= de 
vectors such that 


Then there exists an orthogonal matrix O such that cO=y. This result 
holds for all fields F of characteristic $ 2. 


There exists an (2 —1) X n matrix W of rank n—1 such that 


X satisfies XX 


Since c ~ 0, it follows as in the proof of Theorem 2. 1, that X is nonsingular. 
By Theorem 2.1, there then exists a matrix 


Evidently the matrix 0 = X-'Y is orthogonal and we may write 
Thus it follows that cO = y. 


| 
1 
y 
y 
| 
y 
> 
| 
| 
> 
| 
| 
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Theorem 2. 2 has to do with the preservation of length by rotations. This 
is well known for the field of reals, but, in particular for the rational field 
to which we wish to apply this result, it is not obvious (without Theorem 2. 2) 
that we can find a rational rotation carrying one vector into another of the 


same length. 


3. Incidence matrices. We return now to the topic of incidence 
matrices, and in the following, B always denotes the v X v matrix with & 
in the main diagonal and A= k(k—1)/(v—1) im all other positions. 


THEOREM 3.1. Suppose that there is a rational C such that C™C = B. 
Suppose, moreover, that CS =kS, where 8 is the matrix of all ones. Then 
CCT = B. 

For by hypothesis, 

(3. 1) CTO = B= (k—A)I+A8 

so that 

(3. 2) CTOS = (k —A)S + 

Thus 

(3.3) CTS = [(k —A) + Av] 
and, transposing, we have 

(3. 4) 

Hence also 

(3. 5) SC-! == k-8. 

From (3.1) and (3.5), we have 


(3. 6) (k — = CT — 


and multiplying (3.6) on the left by C gives 


(3. 7) (k — aA)I = CCT — 
whence 
(3. 8) CCT = B. 


The following is a generalization of a theorem of Albert [1]. 


THEOREM 3.2. Suppose that the matria B is congruent to the identity, 
relative to the field of rational numbers. Then there exists a rational C 
such that CC’ =C™C =B. 


B 


584 
I 
( 
( 
( 
( 
( 
th 
T 
su 
the 
C01 
(3 
of 
(3. 


NORMAL COMPLETIONS OF INCIDENCE MATRICES. 


By hypothesis, there exists a rational A such that 
(3. 9) ATA =B. 


Let S again denote the matrix of all ones, and let 7; denote the sum of the 
ith row of A. Then 


(3. 10) SATAS = SBS = k?vS8 
so that, calculating the element in the (1,1) position on the right, we have 
(3. 11) rete 


Hence by Theorem 2. 2, there exists an orthogonal O such that 


k 
(3. 12) of 
k 


Consider the matrix 


(3. 13) C = OA. 
Evidently, 
(3. 14) CTC == ATOTOA = B. 


Moreover, by (3.12), we have 


(3. 15) : 


By Theorem 3.1, this proves normality. 


We come now to our main result. 


THEOREM 3.3. Suppose that the matrix B is rationally congruent to 
the identity. Let A, be an r X v matrix of 0’s and 1’s such that AA, = B,, 
where B, is the r X r matrix with k in the main diagonal and x elsewhere. 
Then there exists a v Xv rational matrix A having A, as tts first r rows 
such that AAT = ATA = B. 


Concerning Theorem 3.3, we remark that if we are given r rows of a 
v,k,X configuration, then a trivial condition necessary for completion is that 
their pairwise intersections equal A and their sums equal /. This is the 
condition 
(3. 16) A,A,'=B, 
of the theorem. 

The rational matrix C of Theorem 3.2 satisfies 


(3.17. 1) ct? 


585 
| 


586 MARSHALL HALL AND H. J. RYSER. 
and 
(3.17. 2) CS = SC =kS. 
Consider the r X v matrix A,C“. This matrix satisfies 
(A,C-*) (A,C-1)T = A,C7C1TA,T — A,BA,T 
(3. 18) 
= A,[(k — — Ak?*8)] Ai? = (k — — A(k —A)"8,, 


where S, is the r X r matrix of all ones. Here we have used not only (3. 16), 
but also the fact that the rows of A, sum to k. Thus 


(3. 19) (A,C-1) (A,0-2)T = 1, 


where J is the r < r identity. Hence by Theorem 2. 1, there exists a rational 
Q such that 


(3. 20) ee 


is orthogonal. But this implies that 


(3. 21) [C-TA,T, QT] ce 
or 
(3. 22) [A,T, OTQT] —C™C=—B. 


Let 7; denote the sum of the i-th row of QC. We assert that 
(3. 23) r?+---+r,? = k’s, 


where s=v—vr. For we have 


(3. 24) A,TA, + (QC)™QC = B, 
whence 
(3. 25) SA,7A,S + 8(QC)TQCS = 


and as A, has r rows summing to k, 


(3. 26) S(QC)T(QC)S = k?(v —1r)8, 
proving (3. 23). 
By Theorem 2. 2, there exists an s X s orthogonal matrix O such that 


(3. 27) O | ; | = ke, 
Ts 
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where «, is the column vector of s components all equal to one. Moreover, 


Ts 


where « is the column vector of v components all equal to one. Hence it 
follows that 
(3. 29) (OQC = ke. 


We now define 


(3.30) 
Evidently, 

(3. 31) ATA = B. 
But also 

(3. 32) Ae = ke, 


and the normality of A follows by Theorem 3. 1. 


4, Integral solutions. In this section we consider the solutions of the 
matric equation AAT = B, where A is integral. We remark in passing that 
it is easy to show that if elso ATA = B, then A is composed entirely of 0’s 
and 1’s or entirely of 0’s and-—1’s [7]. 

Suppose now that AA™ = B, where A is integral. In this section we 
suppose further that & and k — are always relatively prime. Then if k—x2 
is odd, it has been shown that upon suitable multiplication of the columns 
of A by +1, A is transformed into an incidence matrix composed entirely 
of 0’s and 1’s [7%]. The case of k—X even, however, presents additional 
difficulties. We confine ourselves for the most part to the case of the pro- 
jective plane, with —A—WN and v=N?+N-+1. 

Let s; denote the sum of the i-th column of A and let ¢;; denote the 
inner product of the i-th and j-th columns of A. We agree to write A so 
that we always have s;=0. In [7] it was shown that s;==0 (modk) and 
if we write s;— uk, it was also shown that u,?+---+u,2?—v and 
== Auju; + (k —A)8;;, where 8 is the Kronecker delta. 

For the projective plane case, the integral solutions of AAT = B are of 
two types. A solution of type J is defined by the conditions that u;— 1 for 
each i. A solution of type IJ is defined by the condition that uj, 1 for 
some 1. For type J, we have ;—=N-+1 and tj; —N-+41 for all It 
follows easily that solutions of type I are incidence matrices. 
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For solutions of type JJ, we cannot have u; > N because u,’ + u,’ 
=N?4+N+4+1. Since and u?—(N+1)u+N=0, 
we must have some u;, say U,, such that u;— N. But then there are N + 1 u; 
such that u;—1 and N?—14, such that u;—0. Then s, = N?-+ WN and 
t:, = N?+N. The first column of A thus has a single 0, and all other 
entries +1. For convenience, the 0 may be placed in the (1,1) position 
of A. In addition, there are N 4-1 columns with s;=%—N-+1. These 
columns have N+ 1 ones, and all other entries 0. For these columns 
t:;— N-1—WN, whence one of the 1’s is in the first row. Thus the first 
row consists of N + 1 1’s and other entries 0. 

Upon deleting the first row and column of A, we obtain a matrix C of 
order N? + N such that CC’ =CTC = NI. There are N + 1 columns of ( 
composed entirely of 0’s and 1’s. Conversely, if these reguirements are ful- 
filled, then it is easy to construct the integral solution A of type IJ. By the 
results of § 2, it is clear that it is always possible to find rational matrices 
fulfilling these requirements, provided, of course, that NJ is rationally 
congruent to the identity. | 

It is much more difficult, however, to determine those values of N for 
which there are integral solutions of type JJ. We conjecture that such solu- 
tions exist whenever WN is even, provided only that B is rationally congruent 
to the identity. To substantiate this conjecture, integral solutions are known 
to exist whenever WN is the order of a Hadamard matrix [7]. This, however, 
restricts NV to the value of 2 or to N==0 (mod 4). In eonclusion, we exhibit 
an integral solution for VN 10. Let 


— —1 —! l 1) 
l 1 —2 l l l 1 0 0 0 
1 0 1 0 0 
l —1 0 0 1 0 
l 1 1—l —1 2 0 0 0 l 
1 l 0 0 0 —1 —1 
1 —1 0 ] 0 —2 1 
1 —1 0 0 i —2 l 
0 0 0 1 —1 1 


The matrix XY is such that YX¥T—10J7. The matrix C may be formed from 
X by taking a direct sum of Y with itself eleven times. From C the desired 
A of order 111 is obtained by adjoining the suitable row and column. 
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ON THE 1-DIMENSION AND THE 4-DIMENSION 
OF PARTIALLY ORDERED SETS.* 


By SreyMour GINSBURG. 


Let S be an abstract set and K a collection of linear orders defined on 8. 
Define a partial order P on S as follows. For any two elements p and q in § 
put p< q in P if and only if p < q for each linear order in K. The partial 
order P so obtained is said to be realized by the linear orders in K. By the 
dimension of a partial order P on a set S is meant the smallest cardinal 
number m such that P is realized by m linear orders. 

The above definition of ‘dimension of a partially ordered set’ was first 
stated by Dushnik and Miller in [2]. They showed that each partial order 
has a dimension. Other papers pertaining to this definition of dimension are 
[1], [8], [6], and [8]. In [6] Komm defined the A-dimension of a partial 
order P on a set S as the minimum number of linear extension of P? each 
of which maps S into the real numbers. He then calculated the A-dimension 
of several partial orders. The purpose of this note is twofold: 


(1) To make some remarks on A-dimension. In particular, the )- 
dimension of the partially ordered set consisting of sequences of real numbers, 
ordered as follows, is shown to be No: (41.° S 
if and only if z, S y» for all n. 


(2) To discuss a new kind of dimension, called A-dimension. A non- 
trivial example is given to show that A-dimension and A-dimension need not 
be equal, even when both dimensions exist. 


1. d-dimension. For each positive integer n denote by E,, the set of 
n-tuples By, = {(21,° a real number}. Denote by the 
set = {(2:,- a real number}. Let P(£,) be the partial 
order on which is defined as follows: (21,- - < (y:,° Yn) if and 
only if 2; < y; for each 7. Let P(E) be the partial order on EF which is 


* Received July 24, 1953. 

1 The linear order ZL on a set § is a linear extension of a partial order P if, for any 
two elements p and gq such that p<q in P, then p<q in L. If L is similar to a subset 
of the real numbers, then Z is called a d-extension of P. 
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defined as follows: if and only if 
vi < yi for each 7. Let P’(#,) and P’(#,) be the partial orders on EF, and 
E, respectively which are defined as P(F,) and P(E.) except that “aj< y; 
for each 1” is replaced by “a; y; for each 1, and for some 7,2; < yj.” 
Let M be a subset of HL, or H,. By P(M) is meant the partial order P(£,) 
or P(E.) acting on M, and by P’(M) is meant P’(F,) or P’(£.) on M, 
according as M is a subset of F, or Ey. 

In [6] Komm demonstrated that the A-dimension of P’(En) is No. 
Komm’s method however cannot be extended to calculate the A-dimension of 
P’(E.~). In Theorem 1.1 we shall show, by a procedure which is also 
capable of calculating the A-dimension of P’(/,), that the A-dimension of 
is No. 

Denote by J the open interval (0,1) of the real numbers. Denote by 
Je the set Ja == *)| each 2, €J}. 


LemMMA 1.1. There exists a mapping y of J into P’(E.) such that 


(a) 2) = 21, where p(x) = the being 
positive integers, 
ifx<yin the real number system, then (x) < P’(Ee), 


(y) wis one to one. 


Proof. Kach element of J can be written uniquely as « = > u;/2*, where 
i=1 


each uj == 0 or 1, and the wu; are not ultimately all 1. Let 2 =u, + 1 and, 
in general, let 2) == 22 + u,,, +1. Let p(x) = -,2,---). 
There is no difficulty in verifying that the function y satisfies the conclusions 
of the lemma. 


THEOREM 1.1. The A-dimension of is 


Proof. Since J is similar to the real numbers it follows that the two 
partially ordered sets, P’(J,) and P’(#,) are isomorphic. Now two 
isomorphic partially ordered sets evidently have the same A-dimension. 
Consequently, in order to demonstrate the theorem, it is sufficient to show 
that the A-dimension of P’(J,) is No. 

(a) We shall now construct the auxiliary functions, f, g, h, and for 1 
and 7 any positive integers such that i < 7, the function pj. In (b) these 
functions will play an important role in calculating the A-dimension of P’(Jq). 


To define f we first decompose the set of positive integers into a family 


] 

and 
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{W,/1 Sn <} of pairwise disjoint sets, each set W, being infinite. For 
each n denote by f, the similarity transformation which maps the positive 
integers onto the set W,, where W, is ordered according to the natural order 


of the integers in it. To each element x = -,2n,° +) of J» we now 
associate an element f(z) = *,@n,° of in the following manner. 
For each number 2,» let = (@%n,° >, +) where y is the fune- 


tion defined in Lemma 1.1. Define ay;,(;) to be the element 2,. The sets 
W, and the functions f, were constructed in such a manner that for each 
positive integer k, there exists one and only one pair of integers (n,%) for 
which k =f,(1). Therefore the element f(x) = *,@n.° is well 
defined. From (a), (8), and (y) of Lemma 1. 1, and from the fact that each 
fn is a similarity transformation onto W,, we see that the mapping f possesses 
the following four properties: 


(a) For each z in where f(z) = (@1,° °°). 
(8) If*<y in P’(Ja), then f(x) < f(y) in 
(y) f is a one to one function. : 


(5) P’(J,) and P’(f(Jo)) are isomorphic partially ordered sets under 
the correspondence which takes x into f(z). 


We are now going to associate with each element y = (@,° 
of the set f(J.) a complex g(y) = *) of 0’s and 4’s. Each 
2 shall be referred to as a coordinate. The first four coordinates of g(y) 


shall be denoted as the first set of coordinates. Fori>1,the S} 24+? + 1-th 
to the S 2/*!-th, inclusive, coordinates of g(y) shall be denoted as the 1-th 


1SjSi 
set of coordinates. The coordinates of the i-th set of coordinates, 1 = 1, are 
defined as follows. Each of the first a; of the coordinates is to be a 4. Each 
of the remaining coordinates is to be a 0. Note that there are 2**? coordinates 
in the i-th set of coordinates, and that a; 2***. The function g has the 


following properties: 


(B’) P’(J.), then gf(x) < gf(y) in P’(Eq). 
(y’) g isa one to one function. 


(8°) P’(f(Jx)) and P’(gf(J«)) are isomorphic partially ordered sets 
under the correspondence which takes f(x) into gf(z). 


We now define the mapping h. For each complex z = (41,° 


of 0’s and 4’s let h(z) be the real number h(z) =} 2n/10". The mapping 
h has the following properties: — 
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(B’) Ifiz<2 in then h(z) < h(2’) in the real number system. 
and z+ 2’, then if i is the first integer for which 77, and 
2, then h(z) < h(2). 

(y”) his a one to one function. 


Now let i and j, i < j, be any two positive integers. To each complex 
a= °°) of 0’s and 4’s, let piy(a) = b = +, 
b, being defined as follows. Let s be an index for a coordinate which is in 
the m-th set of coordinates of a. If m<i or m>j let b, =a. The 
remaining coordinates of b are to consist of a permutation of the i-th through 
the j-th set of coordinates of a, the coordinates in each set of coordinates 
retaining their order in that set of coordinates, under the permutation. The 
permutation is defined in the following manner. If a=8,S2.---Snr°--, 
where S, is the n-th set of coordinates of a, then 


(b) We now evaluate the A-dimension of P’(J.). In [6] Komm showed 
that the A-dimension of P’(L,) is No. Since #, contains a subset which is 
isomorphic to #,, the A-dimension of P’(J.) is at least 8). To complete the 
proof we shall exhibit S, A-extensions of P’(J.) which realize P’(J). 

Consider the function k —hgf which maps J, into the real numbers. 
In view of (y), (y’), and (y’), & is one to one. Let r< a’ in P*(JQ). 
From (f”’) and (8”) it follows that k(x) < k(2’) in the real number system. 
Consequently the function & is a A-extension of P’(J.). In a similar manner 
we see that each of the functions ki; = hpigf is a A-extension of P’(J.). In 
order to show that the functions & and kj; realize P’(J.), it is sufficient to 
show that for each pair of incomparable elements x and y in P’(Jq), there 
exist two of the functions, say o and +, such that o(z) < o(y) and r(y) < 7(@). 

To this end suppose that « and y are two incomparable elements of 
By (8) f(z) — - -) and f(y) - *) 
are two incomparable elements of P’(/,,). Let 7 be the first integer for 
which a; b;, say a; < 5; Since f(a) and f(y) are incomparable elements 
in P’(E,), there exists an integer j such that b; << a;. For s <i the s-th 
set of coordinates of gf (a) is identical with the s-th set of coordinates of gf (y). 
Since a; < 6;, the first b; coordinates in the i-th set of coordinates of gf(y) 
are 4’s, whereas the first 0; coordinates in the i-th set of coordinates of gf (2) 
are a; 4’s and (b;—a;) 0’s. This, in conjunction with (8”) implies that 
k(x) =hgf(x) < hgf(y) =k(y). Now consider the two elements k(x) 
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and kij(y). The first a; coordinates of pigf(x) which follow the } 2**-th 


coordinates are all 4’s. The first a; coordinates of pygf(y) which follow 


the > 2¢+?-th coordinate are, since b; < aj, (a;—0;) 4’s and 6; 0’s. For 
1St<i 
any positive integer s< > 2, if c, and dy are the s-th coordinates of 
1St<i 


pugf(x) and pigf(y) respectively, then c,;—d,;. Consequently 
kis(y) = hpugl(y) < hpugf(x) = ky (2). 
Thus the set of A-extensions of P’(J~), {k} U {kij/t < j} realize P’(Jq). 
q. d. 
Remark. The proof of Theorem 1.1 depended on the construction of &, 
A-extensions of P’(J,). In Ruziewicz defined a function $(z) and a 
sequence of constants {¢n}n<. such that the mapping o which takes an element 


*,%n,° *) of into the real number > is a A-extension of 
4=1 


P’(J.). There does not appear to be any way of using Ruziewicz’s result 
to compute the A-dimension of P’(Jq). 

Consider the A-dimension of P(£,,). Evidently we may confine ourselves 
to P(J.). Nowifa<yin P(J,), then x < yin P’(J~). This implies that 
the function &, which was defined in the proof of Theorem 1,1, is a A-exten- 
sion of P(J,). In [6] it was shown that if a partial order P on a set £, 
of power m, has a A-extension, then the A-dimension of P exists and is = 2m. 
Consequently the A-dimension of P(J,) exists and is S 2%. Now P(E,) is 
of A-dimension 2% by 5.31 of [6]. Since P(/,) contains a set which is 
isomorphic to theA-dimension of P(#,) is 2 2%.” As a consequence 
we obtain 


THEOREM 1.2. The A-dimension of ts 2%, 


We conclude this section with some observations in relation to [6]. 
On p. 517 of [6] there appears the following: 


(*) “Let M be any subset of F, which, except for a denumerable 
number of verticals? has at most one point in common with any vertical. 
Then the A-dimension of P(M) and P’(M) is at most n.” 


For n > 2 this result is not true as is now shown. Let M be the set of triplets 
M = { (41, 3) = 1, = 1} U { (21, Le, 2g) | 3/2, = 3/2}. 


“2A set V in EH, is a vertical, parallel to the a, axis, if, for any two elements 
for j 41. 
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Clearly M satisfies the hypotheses of (*). For each real number & let 
I, = (1,1,%) and u, = (3/2, 3/2,h). The arguments given in 6. 2, 6.3, 
and 6.4 of [6] can easily be applied in the present situation to prove that 
the A-dimension of P(J/) and the A-dimension of P’(M) are both infinite. 

An inspection of the proof of 6.1 reveals that the argument breaks down 
at that point when, after projecting M into 2; axis, the claim is made that 
N;, is similar to a subset of the real numbers. This is not necessarily true. 
Now 6.1 can be amended slightly to obtain a correct result. By a vertical 
plane in F,, perpendicular to the a; axis, shall be meant the set of those 
points (2,,° whose coordinate is k, where & is a constant. 
Then 6.1 becomes true if the word ‘ vertical” is replaced by the words 
‘vertical planes.’ In other words 


“Let M be a subset of Z, which, except for a denumerable set of vertical 
planes, has at most one point in common with any vertical plane. Then the 
\-dimension of P’(M) and P(M) are both at most n.” 


The proof given for 6.1 then suffices with the following changes. Let 


For P’(M), Gx is to be mapped into the interval v associated with the point 
k by the function fj, where fix is a A-extension of Gj, considered in P’(Gix). 
For P(M), Giz is to be mapped into v by the function gixfix, Where gix is an 
order reversing mapping of v into itself, i. e., if ¢ << y in v, then gix(y) < gix(£) 
in v. 
In connection with the preceding the following generalization of the 
statement made on lines 20 and 21 of p. 518 of [6] is easily shown. 


“Let M be any subset of #, which has on each of a non-denumerable set 
of vertical planes at least two comparable points (comparable when considered 
in the vertical plane). Then the A-dimension of P(M) and of P’(J/) are 
hoth infinite.” 


If ‘comparable’ is changed to ‘incomparable ” then the result is no longer 
true. For example consider the set M—FU G, where 


F { (21, Xs) = 1, 22 = 1} and G { (21, £2, = 3/2, = 1/2}. 


Let L, be the linear ordering of M which occurs by having each element of 
F precede each element of G, and let LZ, be the linear ordering of M which 
occurs by having each element of G precede each element of F. Now ZL, and 
L, together realize both P(M) and P’(M). Thus the dimension of each is two. 
Nevertheless, each vertical plane 23 = intersects M in the two elements. 
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The following two problems are still open: 


(1) Characterize those subsets M of #, for which the A-dimension of 
P(M) and the A-dimension of P’(J/) do not exceed n. 


(2) Can there exist subsets of P(/,) and of P’(/,) which have a finite 


A-dimension > n? 


2. A-dimension. A subset HL, of power 2%, of the real numbers is said 
to have property A if no disjoint subsets of /, of power 2*° each, are similar. 

The linear order Z on S shall be called an A-extension of the partial 
order P on S if LZ is an extension of P which is similar to a set having 
property A. 

By the A-dimension of a partial order P on S§ is meant the minimum 
number of A-extensions of P which realize P. 

There exist partial orders of power 2% which do not have A-dimension. 
A trivial example is the order on the set of real numbers. ‘To obtain a 
condition on P which ensures that it have A-dimension the following lemma 
is needed. 


LemMa 2.1. Let G be an ordered sum G=—=G.+:-:-+ Gn having 
property A. Let H be a subset of the real numbers which is an ordered sum 
H=H,+--:-:+AH),, where the H; satisfy the following condition: There 
exists a permutation f, of the set of integers {i]t Sn} such that G*si3) = H%*i, 
where E* is the order type of E. Then the set H has property A. 


Proof. Suppose the contrary. Let M and WN be two disjoint, similar 
subsets of H of power 2% each. Then for some integer s, H,M M is of 
power 2%, Let Q be a subset of NV which is similar to H,M M. For some 
integer t, H; N Q is of power 2%. Let U be a subset of Gy(s) which is similar to 
H,N@Q and V a subset of Gyz) which is similar to H;N Q. U and V are 
disjoint since H, M and H,/ Q are disjoint. Thus U and V are two dis- 
joint, similar subsets, of power 2* each, of G. But this contradicts the fact 
that G has property A. Consequently no such sets M and N ean exist, i.e., 
H must have property A. q. e. d. 

In 4.2 of [6] it was shown that a necessary and sufficient condition for 
a partial order P on a set H to have A-dimension was for P to have a d- 
extension. Repeating that proof in conjunction with Lemma 2.1 we obtain 


THEOREM 2.1. A necessary and sufficient condition for a partial order P 
defined on a set S to have A-dimension is for P to have an A-extenston. 
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In conclusion we present an example which shows that the A-dimension 
need not equal the A-dimension even when both exist. First, however, we 
state a result which is an immediate consequence of Lemma 3.1 of [5] and 
Lemma 3.1 of [4]. 


(*) The open interval (0,1) of the real numbers is not the union of a 
finite number of sets, each of which either has property A or is of power < 2%, 


A non-empty subset / of the real numbers is said to have property C if, 
for each element p of / and each positive number 4, each of the two open 
intervals (p— p) and (p,p-+ 8), of the real number system contains 
elements of £. 

Let S be a set which has properties A and C. Such sets are known to exist 
(Theorem 4.4 of [5]). For some element 2 of S let M = {u|u < %, we 8} 
and N = {u|u = %,ueS}. The set M has properties A and C and thus 
is similar to a dense subset G of the open interval H = (0,1) of the real 
numbers. Let H = H—G. By Lemma 3.1 of [4] H is a dense subset, 
of power 2%, of #. Define go to be the identity transformation on J£. 
Define g, to be a function which is a similarity transformation of G onto M, 
and which also maps H one to one onto N. The function g, is well defined 
since H and N both are of power 2%. The two functions go and g, realize a 
partial order ? on #. Since g,(#) =S has property A, by Theorem 2. 1, 
P has A-dimension. The A-dimension of P is 2. We shall show that the 
A-dimension of P is infinite. 


Suppose that the A-dimension of P is n + 1, where n is a positive integer. 
Let fo, fn, be m+ 1 real valued functions of such that f;(#) has 
property A, which realize P. We now shall proceed to arrive at a contra- 
diction. Note that for any two elements u and v, if u<v in G@ then 
w<v in P. For each element « in H let S.—{yly<2,yeG} and 
T.={y|y >2,yeG}. Since S, and 7, are subsets of G, fi(S2) < fi(Ta) 
for each in. Two alternatives arise: 


(a) There exists an element x in H so that for no integer 1 is 
fi(S2) < fi(z) < fi(Tc). Now in P, 8S, < 2, and z is incomparable with 
each element of T,. By hypothesis, for each integer i, an element y in T, 
can be found so that fi(yi) < fi(x). The set J = {y;|i <n}, being a subset 
of T,, is simply ordered in P. Denote by w the smallest element in J. 
Clearly f;(w) < fi(x) for each i. Thus w<z in P. But this contradicts 
the fact that 2 is incomparable with each element in T,. 
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(b) For each element x in H, there exists an integer 1(z) such that 
fiz) (Sz) < fiz) (x) fitz) For each i= n let 


i= fi(S2) < < fi(Te)}. 


Notice that B; is similar to f;(B;). Hence B; either has property A or is of 
power < 2%, We conclude that F is the union of a finite family of sets, 
{G} U {B;|i Sn}, each of which has property A or is of power < 2%, This 
contradicts (*). 

Since (a) and (b) are the only possibilities, and each of these cases 
cannot occur, we are forced to conclude that the functions f; cannot exist, 
i. e., the A-dimension of P is infinite. 


THE UNIVERSITY OF MIAMI. 
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INTERPRETATION OF VECTOR COHOMOLOGY GROUPS.* 


By W. H. Cockcrort. 


1. Introduction. In a recent note, J. H. C. Whitehead (see [4]; the 
relevant definitions are repeated in section 2 below) has defined vector 
cohomology groups 9,"(X, ¥;G@) of groups X and Y over an Abelian group 
G. The definition generalises the cohomology groups H"(Y,G) of S. Hilen- 
berg and Saunders Maclane (see [2]), by introducing the group X of operators 
on Y and G. In his note, J. H. C. Whitehead interpreted the groups §,° 
and §,° in terms of group extensions with operators. His results generalise 
those of Kilenberg and Maclane which interpret the ordinary cohomology 
groups H*(¥,G@) and H*(Y,G) in terms of the group of similarity classes 
of Y-kernels (with centre G) and the group of (non-operator) group exten- 
sions of Y by G@ respectively (see [2], paras. 4, 5, 6). 

The object of this note is to show that the other algebraic interpretations 
of ordinary cohomology groups can also be generalised. In part I below 
I will interpret the group §o* in terms of operator automorphisms of group 
extensions with operatovs. The result generalises the well known theorem 
of Baer (see [1], also [2], para. 4) which relates the group H'(Y, @) to the 
group of automorphisms of a group extension of Y by G. 

In part II, I define loops with operators in a group and show that the 
Kilenberg and Maclane theory of loop prolongations which is developed in 
[3] can be generalised to a theory of loop prolongations with operators by 
using the vector cohomology groups. This gives an interpretation of the 
groups §,." for all n = 3, just as the Hilenberg and Maclane theory gives 
an interpretation of the groups H® for all n= 3. 

The result in part I is proved directly. In part II, I use the technique 
of [4], para. 6 to obtain the results indirectly. This technique is outlined 
in section 5 below. 


Part. I. 


2. Vector cohomology groups. For convenience J shall reproduce here 
the definition of the vector cohomology groups of [4]. Let @ be an additive 
Abelian group, which admits a multiplicative group Y as a group of operators. 


* Received October 22, 1952. 
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Let the operators of ye Y and geG give the element y-geG. Further, let 
both Y and G admit a multiplicative group X as a group of operators. Let 
the symbols xg and xy denote the images of ge G and ye Y respectively under 
the operator re X. Let zy: g denote (ry) -g and let x(y-g) = (xy)-(29), 
for all re XY, ye Y and geG. 

The cochain group C"(Y,G@) of Y over G, n= 0, admits X as a group 
of operators from both left and right by the rules: 


where ce C"(Y,G). The group Y operates on C™(X, G), m = 0, according 
to the rule 

where ce C”™(X,G@). It is easily verified that 1c = c and that y(y’c) = (yy)c 
as required. 
I recall that the co-boundary operator 5: C"(W, A) > C"™(W, A), of the 
group of cochains of a multiplicative group W over an additive Abelian group 
A, on which W operates from both left and right, is defined by 


(2.1) (8c)(w1,° War) = Waar) 
i=1 
(If W operates only from the left, then we set cw = c). Coboundary operators 
5,: C"(X, C*(Y, G)) C™ (X, C"(Y, G)), 
C™(X, G)) (Y, C™(X, G)), 
are now defined in exactly this way with respect to the cochain groups 
Om X, C"(Y, G)) and C"(Y, C™(X, G)). Since any element of either of these 
groups can be regarded as a function of the form c(a,- 5 Yn)s 
we identify them and denote them by C™"(X,Y;G) or simply C™". It is 
trivial that 3,6, = 0, 5d, = 0 and that 6,5, = 8,82. 
Let C" = C"(X, Y; G@) be the group, with vector addition, in which the 
elements are all ordered sets (Co, * Where c,e C*"*(X,Y;G). 


The elements (¢o,- - -, Cn) of this group are vector cochains. A homomorphism 
A: C"->C™" is defined by 


A(Co, Cn) (c’o, Cn = (— 1) 


with c,—Cn1—0. It follows from the fact that 8,8,—0, 5,8, —0 and 
that 8,8, = 8,5, that AA 
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Let OS pn, and let ©," C C” denote the subgroup of C" which 
consists of the vector cochains (¢o,° such that = 0,° +, = 0. 
These are denoted by When p—-—1, the group G_,° con- 
sists of the cochains (g)°, for all elements ge @ such that rg =g for all 
teX. Note that since rg = g, we have §,g =0 and therefore A(g)° = (8yg)’. 
The vector cohomology group §,” is then defined by = 3 
where 85" = ©," M A-*(0) is the group of vector cocycles. 


3. Statement of Theorem. Let (H,¢) be an X-extension of an Abelian 
group G@ by a group Y, i.e. H and Y admit X as a group of operators and 
there is an operator homomorphism ¢: H > Y, onto Y, with kernel G. Since 
E admits X as a group of operators, and ¢ is an operator homomorphism, 
( also admits Y as a group of operators, with x: G = G the automorphism 
induced by = LE, for all re X. 

Let (E’,¢’) be similarly defined in terms of groups X’, Y’ and G’. 
Both F and EK’ will be written additively though in general neither will be 
Abelian. 

Suppose f: Y—X’ is a given homomorphism. A homomorphism «: HE’ 
is an operator homomorphism associated with f if and only if a(xe) = (fx)ae 
for all ee X, ee HE. An operator homomorphism «: H— EH’ is a proper 
homomorphism if and only if «@ C G. Thus a proper homomorphism 
a: E— E induces an operator homomorphism 6 = «|G: G— G@’ associated 
with f in the sense that 6(2g) = (fa)@g, and an operator homomorphism 
vy: YY’ defined by vp —¢’a, also associated with f, i.e. such that 
= (fa) py. 

Let representatives e(y) eg '(y), e’(y’) be chosen for all 
ye Y, ye Y’, with e(1) = 0, e’(1) =0. Let Y operate on G and Y’ on @’ 
by the usual rules: 


y-g=e(y) +9—e(y) (ye Y3geG@), 
and 
=e'(y’) + 9’ —e'(y’) ; GQ’). 
Finally the operation of Y on G’ is defined by y- 9’ = (Wy) -g’ for all ye Y, 
fe’. 
It is now a trivial matter to verify that, with these definitions of 
operations, the groups X, Y, G@ satisfy all the requirements set out in section 
2 above. Thus we may define $)"(X,Y;G’). TI prove the 


TuHEorEM. The proper homomorphisms — E’, associated with 
X’, all of which induce the same homomorphisms 6: G—> y:Y—>Y’ 
are in a one-one correspondence with the elements of 3)1(X.Y;G@’). In this 
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correspondence the elements of AC_,°(X,Y;G’) correspond to proper homo- 
morphisms «: E — E’ of the type 

+ae—g’ (ec LZ), 
where (g’)°e ©_,°, and where a: EH — E’ is, as above, a proper homomorphism 
associated with f, and inducing 6 and y. 

If (E’, ¢’) = ¢), X’ =X, Y’ = Y and f—=1:X > X, 6G, 
y=1:Y—-Y, where 1:Q—@Q denotes the identity automorphism of any 
group Q onto itself, we may define the group A, of proper automorphisms 
of associated with 1: ¥ X, which induce 1:G—G and1:Y—Y. In 
this case the one-one correspondence of the theorem becomes an isomorphism, 
and if A. denotes the subgroup of A, which consists of automorphisms 
e—>g+e—g, where 1g g, we have the 


CorottaRy. A,/A, ~ Y; GC’). 
If X — 1, this reduces immediately to the theorem of Baer. 


4, Proof of Theorem. I first recall that if we are given the represen- 
tatives e(y), e’(y’), chosen in section 3 above, we have associated with the 
extensions (#,¢) and ¢’) the cochains: 


y) = ve(y) —e(ry) (ce X;yelY), 
Y2) =e(Yr) + e(Y2) — 
and 
Co(y's, y's) =e (y's) + — Y’). 


It is a simple matter to verify that (see [4], p. 224) 8,co—0, 8,c, = 0, 
= 0, 8,07, = 0, = Syc,, and = 8,c’;. 

Now let «,: 4» LE” be any proper homomorphism which is associated 
with a homomorphism f: Y — X’ and which induces 6: G—> @ andy: Y > Y’. 
Any element of # may be put in the form g + e(y) for some ge G and ye Y, 
and similarly for Z’. Suppose a,(g + e(y)) =g’ + e’(y’) for some g’e @, 
Since a, induces 6 and y we have g’ + e’(y’) = 69g ai(y) + (yy) 
where a,(y) C°?(X,Y;G’). Now if and y, Y, then 


(91 + €(Y1) + 92 + (Ys) ) = (91 + G2 + Co(Ys, Y2) + ) 
= 691 + + Y2) + + 
= 0 (Y1Y2) — (Yr) — (Y2) + (Ys; Y2) 

— + O91 + + O92 


it 
a 
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Therefore 


(4.1) + + g2+ e(y2)) 
= — (801) Yo) + OC0(Y1, Yo) — Co (Wy, WY2) 
+ 99: + (Wy) + + a1(Y2) + (Wye). 


But by definition, z, is a homomorphism; thus 


(4.2) (gi + + + e(Y2)) = (91 + €(Y1)) + + ) 
= 69, + (y:) +e’ + O92 + a1 (y2) + 


and therefore, comparing this with (4.1), we have 


(4.3) (Y15 Yo) = (Ys, Yo) — Co WY2)- 


Again, if ge G, re X, ye Y, we have 


a,(rg + re(y)) (1g + y) + e(zy)) 
= Org + + (ty) + 
= a, (ry) — (fx)ai(y) + y) — es (fa; yy) 
+ (fx) (Og + a(y) + 


Therefore 


(4.4) + ve(y)) = — (8201) y) + 0: (23 — (fa; py) 


+ (fa) (%(g + e(y))). 


But by definition, a, is an operator homomorphism; thus 


(4.5) + ve(y)) = + e(y))} = (fe) (a (9 + e(y))), 


and therefore, comparing this with (4.3), we have 


(4.6) (8.01) y) = 8c, y) — (fa; py). 


Conversely, if 6:@—G’ and y:Y—Y’ are operator homomorphisms 
associated with f: X — X’, we may define a mapping @,: i — EL’ by setting 


ai(g + e(y)) =09 +ai(y) + (yy), 


for all ge G and ye Y, where a,eC°(X, Y;G’) and satisfies conditions 
(4.3) and (4.6) above. It is a straightforward matter to prove that a, 
so defined, is a proper homomorphism associated with f: X — X’ and inducing 
$and y. Firstly it is obvious that a, induces 6 and y as required. Secondly, 
it is a homomorphism, since (4.1) and (4.3) imply (4.2). Thirdly, it is 
an operator homomorphism associated with f: X — X’, since (4.4) and (4. 6) 
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imply (4.5). Finally it is obvious that «@ C G’ and «a, is therefore a proper 
homomorphism of the required type. 

Let a,:H— EH’ be another proper homomorphism associated with f 
and inducing 6 and y. Further let + e(y)) = 69 + a2(y) + e’ (yy) 
(geG;yeY). Again we have and (8,42) y2) 
and (8,a,)(x;y) satisfy similar conditions to (4.3) and (4.6). Thus 
(a, is a vector cochain in the group Y ; G’) and it follows from 
(4.3) and (4.6) that A(a,—az)*=0. That is, (a,—a2)* is a vector 
cocycle in 3o'(X, ¥;G’). It is obviously independent of the choice of the 
representatives e(y) and e’(y’). 

Suppose now that a,: H — LE’ is fixed and let a.: HE’ be any proper 
homomorphism associated with f and inducing @ and y. I say that the corre- 
spondence @,—> (a@,— is one-one with 8 .'(X,Y;G’). For suppose 
H —> EH’ are proper homomorphisms associated with f and inducing and 
y. If a. and 2’, have associated with them, as above, the functions a2(y) and 
a’.(y) respectively, and if (a, — a2)? = (a, —a’2)', then a; — =a, —@’2. 
Hence a, =a’, and @—2@,:H— EH’. Thus the correspondence is one-one. 
It is onto 8.'(X, ¥Y;G’) for if (do) is a cocycle in 3o1(X, Y; G’), define 
a: H — by setting 


+ e(y)) = Og + a2(y) + (yy), 
for all ge G, ye Y, where az(y) =ai(y) —do(y). Then 
(8,42) (Ys, Y2) = Yo) — Co 
since A(d,)* implies that (8,do) =0. Similarly 
(S202) (23 y) = Oe, (x;y) — (fas py), 


and it follows that «.:— E’ is a proper homomorphism of the required 
type, which by definition corresponds to the cocycle (d,))?. This establishes 
the first part of the theorem. Finally, suppose (g’)°e €_,°. Define «,: HF 


by setting a.(g + e(y)) + + e(y)) for all ge G and ye Y. 
Thus 


a(g+e(y)) =Og+a(y) +97 +e 
=$g9+a(y)+9—y +e (yy) 
= 69 + a2(y) + e’ (Wy) 


where a2(y) =4a,(y) — (8y9’) (y). Obviously 
(8y42) (41, Yo) = (8yA1) (41, y2e Y), 


and 


(8.2) (3 y) = (8201) y) (ceX;yeY). 
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It follows immediately that a, so defined, is a proper homomorphism asso- 
ciated with f:X — X’ and inducing 6 and y. Moreover a, corresponds to 
the cocycle (a, = (dyg’)1, and (dyg’)'e¢ AC_,°(X, Y;G’). Thus the 
Theorem is proved. 

Suppose now that the conditions of the Corollary hold. It is obvious 
that the proper homomorphisms associated with 1: ¥ — X, which induce the 
identity automorphisms 1: G—>G,1:Y—Y, form a multiplicative group A, 
in which the product of two homomorphisms e, B is the homomorphism 
pa: E— EH. Moreover A; is Abelian. Set —1:H—£ in the proof of 
the Theorem and suppose «;, «;: H — E to be proper automorphisms associated 
with 1:X¥—Y, given by +e(y). (r=14)). 
Thus a and a; correspond to the cocycles (—a;)' and (—a,;)' respectively. 
But ajai(g +e(y)) =g+a(y) +a(y) +e(y). Thus corre- 
sponds to the cocycle (— a;—a;)*e¢ 3o1(X, Y;G@), and the one-one corre- 
spondence of the theorem is in this case an isomorphism onto 3,'(X, Y; @). 
Under this isomorphism AC_,°(X, Y;G@) obviously corresponds to the sub- 
group A, of A, which consists of all automorphisms, e > g + e—g where 
(g)°e ©_,°(X, ¥;G@). Thus the Corollary follows. 


Part II. 


5. Outline of method. For convenience IJ shall first recall the defini- 
tions of loop prolongations, and the classification theorems proved by EHilenberg 
and Maclane (cf. [2], para. 17, and [3]). ; 

A loop L is essentially a non-associative multiplicative group; its ele- 
ments satisfy the following axioms: 


(5.1) for a,be LZ, the element abe LZ is uniquely defined; 
(5.2) there is a unit element 1 eZ such that al ~a—1a for allaeL; 


(5.3) for any a,beL the equations ax = b and ya=b have unique solu- 
tions 2, ye L. 


Given a,beL, the associator A(a,b,c)eL is defined by the equation 
a(bc) = A(a,b,c)[(ab)c]. Higher associators are defined inductively by 


A(@1,° * Gon, @) = A (dy, de, A(@z,° *,@on,@)). 


A loop prolongation of a group Y by an Abelian group G, with given 
operators of Y on G, is a pair (L,¢) satisfying 


(5.4) JZ is a loop containing G as a subgroup; 
9 


| 

f 

2) 

§ 

or 

1e 

se 

d 

d 
1e 

d 


W. H. COCKCROFT. 


(5.5) @ is a homomorphism of Z onto Y with kernel a group K con- 
taining G; 


(5.6) A(k,a,b) =1—A(a,k,b) for L, ke kK; 
(5.7) kA = Ak for any ke K and any associator A = A(a, b,c) ; 
(5.8) ag = (¢(a)g)a for aeL, geG. 


It is usual in the theory of loop prolongations to use the notation a:b 
for the element z which is the unique solution of the equation ab = za. 
I shall do this and will drop the notation y- g for the operation of an element 
y of a group Y on ge G, which was used in part I above. This will be written 
simply yg in future. No ambiguity will arise in what follows because of this. 
Thus formula (5.8) now reads a: g $(a)gQ. 

In [8], Hilenberg and Maclane define two prolongations (Ly, ¢;) 
and (L2,¢2) to be equivalent if and only if there is an isomorphism 
6:L, ~ L, such that ¢.6=¢,:L,—Y. They also define a multiplication 
(Li, $1) @ (L2,¢2) of prolongations by considering the loop P of pairs 
42), @,€L,, Ly such that = ¢.a.. The loop L, L, is defined 
to be the factor loop P/U where U is the set of all pairs (g,g7') for geG. 
These definitions follow those used in group extension theory (see [1], para. 4). 
The prolongations are now divided into four classes defined by the conditions 


A(ai,° a) = 1, A(ai,- Gon, = 1, 
G,/: Qn, @) € G, G,™: A(a,,- - -, Gn, k) G, 


where n> 0, ae L, and ke K. A further class R, is defined 
which consists of those prolongations (Z,¢) of Y by G which contain a 
prolongation (L’, ¢’) of Y by G@ which is in class O,* (i.e. L’ C L, o|L’ = ¢' 
and (L’,¢’)20O,*%). Finally we have a class Go defined by the condition 
K = G. 

With each loop prolongation is associated a characteristic cochain 
gu’ € C*(Y, K) with values in the centre Z of K for r > 2, defined by taking 
representatives u(y) e¢*(y) for all ye Y, with u(1) —1, and by setting 


U(Y1)U(Ys) = Gu? (Ys, Yo) U(Y1Y2); 


Yon Y) =A(U(Yr),° U(Yon), U(Y)) (n= 1), 
Yons y’) = A(u(y:),° U(Yon), Ju" (Y; 
(n= 1), 
for all Yon, These cochains give rise to homomorphisms: 


— 
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forn >0. After defining the products of classes, e. g. On” © G+, as the set 
of prolongations which are products of prolongations in the classes, Hilenberg 
and Maclane prove that these homomorphisms induce the isomorphisms in 
the following theorems. 


THEOREM EM1. O,K G,4/0," © = G) (n> 0), 
THEOREM EM2. On,” 9 H?"*?(Y, G) (n> 0). 


In each case the system on the left is a group of modulo classes. 

These then are the classification theorems of Hilenberg and Maclane 
which are proved in [3]. In section 6 below I define loops with operators 
and in section 7 X-prolongations (L,¢) of a group Y by a group G are 
defined. These are essentially loops Z with operators in X such that ¢: LY 
is an operator homomorphism whose kernel K is a group which contains the 
group G. 

The classification of such prolongations could be done directly, defining 
characteristic vector cochains associated with X-prolongations and following 
the pattern of [3]. However it is simpler to use the indirect technique which 
J. H. C. Whitehead employs in [4]. To this end I first define the group W 
which is a group extension of X by Y. Its elements are all pairs (y, 7) 
with (41, 71) (Yo, = X;y,eY). For convenience y is 
identified with (y,1) and denotes (1,7). Thus (y,7) and w, w; 
are used to denote yz and y;%. The operation of W on G is defined by 
setting = y(xg), for all we W, geG. 

I now associate with each X-prolongation (Z,@) an ordinary (i.e. non- 
operator) prolongation (L*,¢*) of the group W by the group G. The 
correspondence (L,¢) — (L*,¢*) is shown to be one-one to within an 
equivalence. Furthermore, the classes of X-prolongations which are defined, 
O,*”, On**, Gy*4, G,** are carried over into the corresponding classes 
On”, Onk, ete. by the correspondence. Products of X-prolongations and their 
classes are defined and are preserved by the correspondence in a similar way. 
Thus the classification of X-prolongations is referred to the classification of 
prolongations of W by G, which are called associated prolongations. 

In order to use the theory of [3] to classify the associated prolongations 
we require a new cohomology group H,”"(W, G) which is defined as follows. 
Given W, as above, let 4: X¥—> W be the homomorphism defined by setting 
At = Z, and let A*: C"(W, G) + C"(X, G) be the homomorphism induced by 
A. Denote A*-1(0) by C,,", and let 8 be the coboundary operator defined by 
(2.1). Since #g 2g, it follows that A*8 = and hence 8C," C 


— 
= 
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Let 8,: Cy* Cy”*! be the homomorphism induced by and let H,°, H,’,---, 
be the cohomology groups which are defined in terms of C,°, C,?,: - +, and 
the coboundary operator 6,. That these groups are the ones required for our 
classification, follows from the fact that the characteristic cochains of the 
associated prolongations belong to C,”"(W, K) and not simply to C"(W, Kk), 
as do the characteristic cochains of [3]. The results of. [3] can obviously 
be taken over as they stand, setting C,”, B,”, Z,.", H,." for the corresponding 
CO", Bx, Z" and H™” (I use the normal notation for cochain, cocycle, coboundary 
and cohomology groups) except for one minor and obvious alteration: it is 
necessary to prove a new existence theorem, replacing Theorem 12.1 of [3], 
which proves that given a cocycle eZ”, there is a prolongation with that 
cocycle as its characteristic cocycle. Here we require that the constructed 
prolongation should not be just any prolongation but an associated one. The 
proof I give is a modification of Theorem 12.1 of [8], and after certain 
initial differences, which are given in full, it is identical with that in [3]. 
I therefore omit the identical portions and prove that, having constructed a 
prolongation (Z*,¢*) as in [3], (L*,¢*) is an associated prolongation. We 
are then in a position to quote the classification theorems of [3], EM1 
and EM2 above, to classify the associated prolongations in terms of the 
groups H,”. 

Now it is proved in [4], that ~ for each n=0,1,2,- 
Thus the classification of associated prolongations by means of the groups 
H,,", together with this isomorphism and the one-one correspondence of the 
associated prolongations with the X-prolongations yields a classification of 
X-prolongations in terms of the groups §,_1”. 

In a final section I indicate the definition of a characteristic vector 
cochain of an X-prolongation which would yield these results directly. 


6. Loops with operators. Let XY be a multiplicative group and L a 
loop. The loop LZ will be said to have operators in the group X, or to be 
operated on by X, if given re X and ae TL there is a unique element zac L 
such that 71 — 1, laa, where the symbol 1 is of course used for both the 
unit of Z and of X. 

The vector, or operator, associators of L are triples (Ao%, A.°. A.*) of 
elements of LZ which are uniquely defined by the equations: 


(A203) == Ao® (Gh, Ae, As) { dg} (1, @2, a3 € L) ; 
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Higher order vector associators are defined inductively. Thus the vector 


associator (Ao°"*1,- - is defined by 
(dy, dona) = Ay? (a, Qe, A,?""* (ds, ons) ) 
A,?**1( 215 Qon) = (215 Gen)) 5 
where 2=r=2n and °*,%eX. 


A subgroup S of LZ will be said to be an operator subgroup if zS C 8 
for all ee X. If also A,*(s,a,b) =1—A,3(a,s,b), =1, for 
all se 8, a, be L and xe X, then we can define in the usual way (see e. g. [3], 
para. 3, and the references cited there) the factor loop L/S of cosets Sa, of 8 in 
L. Furthermore, this loop will have operators in X defined by z(Sa) = S(xa) 
and the natural homomorphism p: L— L/8 defined by pa = Sa, will be an 
operator homomorphism in the sense that px = zp. The loop L/S is obviously 
associative (and hence a group) if, and only if, Ao*(a:,a2,a3) eS for all 
If also a2) and € for all 2,,2,eX 
and a,,a,¢L, then L/S is obviously a group which admits X as a group of 
operators in the usual sense. 


7. Prolongations with operators. As in part I, section 2, above, let X 
be a multiplicative group acting as a group of operators on a multiplicative 
group Y, and furthermore let both XY and Y operate on an Abelian group G 
subject to the usual condition, now written x(yqg) = (xy) (2g), for all re X, 
ye Y,geG. An X-prolongation of Y by G is a pair (LZ, ¢) satisfying the 
conditions : 


(7.1) JZ is a loop with operators in X and containing G@ as an operator 
subgroup ; 


(7.2) @ is an operator homomorphism of Z onto Y with kernel a subgroup 
K of L which contains G; 


(7.3) a,b) =1—=—A,9(a, k, b) (ke K;a,beL); 
(7. 4) A,*(z;k,a) (ceX;keK;aeL); 
(7.5) kA? = Ask (ke K;i—0,1,2); 
(7. 6) a-g = (¢a)g (aeL;geG); 
( t(a:g) = (2a) (zg) (ceX;aeL;geG). 
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Here, as stated before, the symbol a-g denotes the solution z of the equation 
ag = 2a. 

Two such X-prolongations of Y by G, (Li,¢:), (Le, $2), will be said 
to be equivalent if there is an isomorphism 6: 1,— L, such that 20 = 62, 
¢, = $26. A multiplication (Li, ¢:) © (Le, $2) is defined in the usual way by 
considering the loop P of pairs a2), a, a2 L, such that = 
Multiplication is defined by setting (d:, a2) (b:, b2) = (142, b,b2). The prod- 
uct loop LZ, ® LZ, is then defined as P/U, where U is the set of pairs (g, 9"'), 
geG. The operation of X on L,@ TL, is then defined in the obvious way, 
i.e. = A homomorphism ¢:L,@L,—Y is defined by 
setting ¢[4a1, = ¢1(a1) = ¢2(a2), for all elements [@:, a2] of L, Lz, and 
the product is thus an X-prolongation of Y by G. 


8. The associated prolongation. Let W be the group extension of Y 
by Y which is defined in section 6 above. Let (L,¢) be an X-prolongation 
of Y by G. Consider the set L* of elements (a,x) for all ae L, re X, with 
a multiplication defined by 


(a, 21) L2) (a; (2102), 2,22) ae L; v1, Ve a). 
Lemma 8a. L* is a loop. 
The proof which will be omitted, is trivial and consists merely of showing 
that L* satisfies the conditions (5.1), (5.2), (5.3) above. 
A mapping ¢*:L*-—>W is defined by setting $*(a,7) = (g¢a)# for 
all (a,x) e L*. 


LEMMA 8b. ¢* is a homomorphism. 
The proof is trivial and will be omitted. 


Let us for convenience identify the elements ae L with the elements 
(a, 1) L*, and denote (1,2) by x. 


LemMMA 8c. (L*,¢*) ts a prolongation of W by G. 


We require to prove that (L*, ¢*) satisfies the conditions (5. 4)-(5. 8) 
above. 

With the identification proposed above, L* obviously contains Z and 
therefore it contains G as a subgroup. This is (5.4). Also ¢*:L*—~W 
is obviously onto W and has kernel ¢*-'(1) = ¢-'(1) = K, which contains 
G. This is (5.5). 

Let the (non-operator) associators of L* be written A* (a,x), 
for all a,,a2,a,;¢ L. X. From the fact that 


A* (a,%1, 2X2, Ass) {[ (Asx) } (@,%1) [ (@2%2) (ass) | 


| 
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it follows that 
A* { ( [a, (2a2) | [ (122) ds ) 
{a, [a (a2 ) ) ] 


Manipulation of the product {a,[a,(d2(x2d3))]} brings it to the form 
(a) { where, using the additive notation for the centre 
Z of K, 


= Ay + a, 3 de, Tos) + Le; as). 
Thus 
+ [a,(a1a2)] ds). 
Using (7.3), (7.4) and (7.5) above, it follows that 
A* (k, do%2, = 1 = A* kb, (ke K), 


and A*k = kA*, for any associator A*. This proves (5.6) and (5.7). 
Finally, if ae L, ce X and geG, 


(ax) )g} (ax) = { ($a) (2g) } (ax) 


2g) (ax) (by (7. 6)) 
{(a-2g)a}x (by (5.6)) 
{a(xg) jx 

= {(ax) -g}(ux) (by (5. 6)). 


Thus (o*(ax))g = (ax) -g as required by (5.8), and the proof of the 
lemma is complete. 

This prolongation (L*.¢*) is the prolongation associated with (L, ¢). 
Notice that. by definition of the multiplication of L*, if u*(w) = u(y)x, 
where pu(y) =y, u(1) = 1. then o*u*(w) =w, u*(1) = 1, and 


u* (F,)u* = u* (7,22) (21, 
A* (a,%,, do, (%2)) 1 = A* (ay, U* (F,), 226X301, 
A*(a,x,, u* (#2), = 1 (% % mek). 


It is these properties which characterise an associated prolongation (L*, $*) 
of W by G. More fully, an associated prolongation of W by G with kernel 
K is a loop prolongation (L*, ¢*) of W by G with ¢**(1) = K, satisfying 
the usual conditions (5. 4)-(5.8), and such that there exists a set of repre- 
sentatives u*(w) L*, with ¢*u*(w) =w, u*(1) =1 and 


— 
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(8. 1) u* = u* ; 
(8. 2) A*(b, a, u*(Z.)) 1 = A*(a, u*(Z,), ; 
(8. 3) A*(b, u*(z,), u*(Z.)) = 1; 


for all ae and be L*. 


Lemma 8d. The correspondence (L,) — (L*, $*) is one-one with the 
associated prolongations of W by G, to within an equivalence. 


I require to prove that the correspondence (L,¢) — (L*,¢*) has an 
inverse (L*, ¢*) > (L,¢). Thus, let (L*, 6*) be an associated prolongation 
ot W by G defined as above. Consider the set of elements ¢*-*(¥’) C L*. 
They form a loop Z, A homomorphism ¢, :L,—Y is defined by setting 
¢, = ¢*|LZ,. The group X operates on L, through the representatives u*(z), 

x(a) = u*(Z)-a (te X;acel,). 


I prove that (L,,¢;) satisfies the conditions (7.1)-(7.7) above and thus 
that it is an X-prolongation of Y by G. Hence we have a correspondence 
(L*, 6*) — (L,, 41) of associated prolongations of W by G to X-prolonga- 
tions of Y by G. 

Condition (7.1) is obviously satisfied in virtue of the definition of 
(L,,¢:). Since L, D K DG, G is a subgroup of L,. It is an operator 
subgroup, for 

a(g) = = (by (5. 8)) 
= 27g 
and by definition zg © G. 

The homomorphism ¢, is an operator homomorphism, for if re X, ae Li, 

then 
(2a) — (2) -a)u*(#)} = 


so that ¢,(za) = 2z¢,a. It is a trivial matter to prove that the kernel of ¢: 
is K, which by definition contains G. This proves (7. 2). 
The vector associator (A,*, of LZ, is defined as in section 6 above. 


By definition, if a;, a2, a; ¢Z,, then 


(8. 4) (dy, G2, @3) = A* (dy, Ge, as). 


— 
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If in addition z,¢ X, then 


{a,(dods) = {u*(Z,) (dos) }u*(Z1) = 
= A*(u*(%1), do, ds) {(u*(%1)a2)as} 
= A*(u*(z,), de, ds) {[(u* (41) Jas} 
= A*(u*(E,), de, ds) a3) } 
= A*(u*(Z,), do, ds) a3) }u* (41) 
= A*(u*(%,), de, a3) 
Thus, by definition of A,°, 
(8. 5) 3 G2, M3) = A*(u* (2,1), de, ag). 


Similarly, if X, 


= as) }u* 


= u*(Z,){u*(Z2)as} 


(by (8. 2)) 
(by (8. 2)) 


(by (8. 1)) 
(by (8. 3)) 
(by (8. 2)) 


Thus, by definition of A.°, 
(8. 6) (21, == A*(u*(Z,), (T2), ag). 


(by (8. 1)) 


The conditions (7.3), (7.4) and (7.5) are now obviously satisfied in virtue 


of (8.4), (8.5) and (8.6), and (5.6) and (5.7). 
Finally, a- g = (¢.a)g by definition of (L*,¢*) and 


x(a: g) = u*(z)- g). 


But it is proved in [3] that in an ordinary loop prolongation (L, ¢), defined 
as in section 5, the elements of G@ associate with the elements of LZ and 
a: (b-k) =A(a,b,&)[(ab)-k] for all a,be L, ke K. Thus, in particular 


a:(b-g) = (ab)-g for all ge G. Hence, in (L*, we have 


=w*(é) (a-g) = (u*(#)a) 
= {(u*(#) -a)u*(2)} 9. 


Thus again using the results of [3] quoted immediately above, we have 


a(a-g) = (u*(Z) -a) (u*(x) -g) = (xa) (2g) 


for all ze X, ae L,, and ge G. This proves (7.6) and completes the proof 


that (Z,,¢1) is an X-prolongation as required. 


To complete the proof of the lemma, I require to show that the corre- 
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spondence (L*, 6*) — (Li, ¢:) is the inverse of (L, $) > (L*, $*). Suppose 
therefore that given (L*,¢*), I form (L;,¢:) and then from (1;,¢:) I 
form the associated prolongation (1,*,¢:*) as above. Define a mapping 
6: (L,*, (L*, by setting 6(a, x7) = au*(%), for all ae Ly, re X. 
I prove that @ is an equivalence of the prolongations as defined in section § 
above. It is certainly a homomorphism, for if a, L,, X, then 


0( 22) = (by (8. 2)) 
= {a,[(u*(4:) }u*(@2) = {ai ] (a) 
= [ai (4:42) (by (8. 1), (8.2) & (8.3)) 


= = O{ (ai, 21) L2)}. 


It is onto L*, for if ae L* and ¢*a* = yz, we may solve the equation 
a= zu*(Z) for a unique solution z say, in which case a= 6(a, 
It is an isomorphism, for if 6(a,7)=1, then auw*(z)=—1. Thus 
(au*(z)) = (¢*a)@—1 and and u*(z)—1. Hence a—1 and 
(a,x) =1. Finally, ¢*6 = ¢,*, and 6 is an equivalence. This completes 
the proof of the lemma. 


9. The classification of the associated prolongations. In this section 
I use the results of [3] in order to classify the associated prolongations. 
First, using the representatives u*(w) of the associated prolongations (L*, 4*) 
defined above, we define the characteristic g-dimensional cochain gy+? of 
(L*, which is in the group C,27(W,Z) C C2(W,Z), q > 2, where Z is 
the centre of K. We have, as in section 5 above. 


= u*(w,)u*(we), 
Won, W) == U*(Won), U*(w)) (n = 1), 


for all Won, w,w’e W. By definition of A* and by (8. 1)-(8.3), 
we have gus"(Z1,° %g) =1 for all Thus C,%(W, Z), 
for r > 2. 

Thus we may quote the results of [3], i.e. Theorems EM1 and EM? 
of section 5 above, with H,"(W, G) for H"(W, @), to classify the associated 
prolongations, provided we can prove the existence 


THEOREM 9a. Given a cochain g,,4¢C,%(W, G@) there is an associated 


pr 
sa 


an 


Yu 


re 


e 
al 
om 
(1 
fo 
(J 
4 


INTERPRETATION OF VECTOR COHOMOLOGY GROUPS. 615 


prolongation (L*,o*) of W by G and a system of representaties u*(w), 
satisfying (8.1)-(8.3) above, such that 


Jur? = 9x4 
and 
(L*, (q = 2n,n > 0) 


(L*, Ga (q=2n+1,n>0), 


where the classes Gy, O,”, etc. are as defined in section 5. If further 
guste Zy44(W, G) then also 


(L*, $*) © OnE N (q == 2n,n > 0), 
(L*, 6*) © OnE (q=2n+1,n>0). 
The prolongations (L*,¢*) can be chosen so as to have an Abelian kernel. 


This is, of course, Theorem 12.1 of [8] with ordinary prolongations 
replaced by associated prolongations. 

We first construct the kernel K of the desired prolongation. It is the 
direct sum K.+ K; +---+ Koi-+ G, where Ky, is a free Abelian group gen- 
erated by the symbols [w,, y2, Ws, Ws," **, Wm] for all we W and ye Y C W; 
and for m=q. Next we define cochains hy”™eC,"(W, K), for 
m = 2 by setting 
Yo) Wm] if wy 1 (lsSism) 


+"(W1, We, » Wm) 0 if some wj—1 


for 2=m<q, and (m>q+1) 
(w; = yt; throughout). Further, for each w, e« W we define an endomorphism 
a(w,) of K by specifying: 


a(wi)g = wig 


—> (-— 1) *hy™(w1, Wms )— (— Wm); 
i=1 


for 2<m <q. 

From here the proof is identical with that of Theorem 12.1 in [3]. 
I shall therefore not complete it, but go on to prove that the prolongation 
(L*,6*) constructed from K by the methods of [3] is indeed an associated 
prolongation. 

L* is defined exactly as in [8] to be the set of all pairs (k,w), ke K, 
weW with (ky, w,) (ke, ws) = (ki (wi, Wiw2). The homo- 
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morphism ¢*: Z* — W is defined by setting ¢*(4,w) = w, and the repre. 
sentatives u*(w) are chosen by setting u*(w) = (1,w). If we now define 
the characteristic cochain gy? of L* in the usual way, we have g,+7 = 9,4 
(again exactly as in [3], Theorem 12.1). We require to show that (L*, ¢*) 
satisfies (8.1)-(8.3). Certainly (8.1) holds, for if w.,w.e W, then, as in 
[3], hy? (wi, We) u* (wwe) = u*(w,)u* (we). But Ay? =1, for all 
Thus = which is (8.1). Also if ky, e K, 
y,e Y, W, X, then 


{(ki, y:)(1, (ho, = (hi, yr: (Keo, We) 
_ and 
(ha, W2)} = Chi, yr) we), Ewe) 
= (hi {a(y:) We) (yr, 


But a(y,)%(%) =«a(y,%), and by definition of hy™, (8,7) (y1, Z, we) =1. 
Thus A*((k,,y:), (1, Z), (ke, w2)) =1, and since any element of 
is of the form (h,,4,), we have A*(a,u*(Z),b) =1 for all ae ¢*"*(Y), 
xe X,and be L*. This proves the second part of (8.2). Again, if k,, k. eK, 
W, yee Y, X, 


{(k,, w,)(ke, y2)} 2) (ey (a(wy Y2)s wiy2)(1, #2) 
= (ky Yo), 


since hy?(wiY2, T2) = 1. Also, since hy?(Yy2, = 1, 


(ky, { (ke, Yo) (1, = (hi, (hho, YoE2) = (1 eo) YoEo), WiYots) 
= Yo), WiY2ts) 
= wi) (ko, (1, 
from above. Thus A*((k,, w1), (ke, yz), (1, Z) = 1, and since any element 
of ¢*-1(Y) if of the form (kz, y2) we have A*(b, a, u*(Z)) = 1 for all be L*. 
aeg**(¥), and zeX. This is the first part of (8. 2). 
Finally, if ke K, we W, 2, 72€ X, 


(k, w) { (1, (1, Z2)} (k, w) (1, £12) (k, 
and 
{(k, w) (1, ,)}(1, (k, W (1, (k, WE,Z2), 


so that A*(b,u*(Z,), u*(Z))—1, for all be L* and 2,2.eX, which | 


proves (8.3). 
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This completes the proof of the modified existence theorem, as required 
for a quotation of Theorems EM1 and EM2. Thus, we have 


THEOREM EM1*. 
Of 0) G,.F = Hy" G) 


where the classes OnK, Gy”, etc. are classes of associated prolongations of W 
by G, 


and a similar Theorem EM2* replaces Theorem EM2. 
10. The classification of X-prolongations. In order to carry out the 


classification of the Y-prolongations of Y by G, I require four types of X- 
prolongations (LZ, They are O,*”, O,**, and G,**. If fone X, 


0," * *,@ons,€ L, ke K, they are defined by the conditions 
where in each case r= 0,- 2n. 


I prove that if (L*,4*) is the associated prolongation of (L,@), then 
the one-one correspondence (L,¢) — (L*,¢*) preserves these classes. We 
have the 


Lemma 10a. Jf (L,¢) belongs to one of the types On*",- -, 
then (L*,o*) belongs to the corresponding type +, Gn¥. and vice 
versa, 1.e. tf (L,¢) On*” then (L*, $*) © On”, ete. 


This is an immediate consequence of the 
LemMa 10b. 
for all age X, aje L and OSr < 2n. 


We have already seen that this is true when n = 1, in Lemma 8d, with 
*=u*(Z). Suppose that the lemma is true for n</k, then by definition, 
ifr>2 
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= A*(x,, %2, A* (ae, ° * 


Similar inductive arguments prove the lemma then r = 0, 1. 


In addition to the classes O,*4,- - -, G,**, we require a fifth class, R,", 
of n-reducible X-prolongations. These are X-prolongations which contain 
(in the obvious sense) an X-prolongation (L’,¢’) of the class O,**. It 
(L*,¢*) is the associated prolongation of (L,¢) and (L,¢) contains 
(L’, ¢’) © O,**, then from its definition it follows that (L*,¢*) contains 
the prolongation associated with (L’,¢’). Moreover it will follow that this 
associated prolongation is in the class O,%. Conversely, given (L*, ¢*) 
containing (L’*, ¢’*) ¢ O,*, it is easily seen that the X-prolongation (J, 9), 
constructed from (L*, ¢*) as in section 8 above, contains an X-prolongation 
(L’, 6’) ¢ O,** whose associated prolongation is (L’*, ¢’*). Thus the one- 
one correspondence (L, ¢) (L*, ¢*) carries R,»* onto Ry. 

Finally we require the products of the classes O,*”, ete. The product 
A @ B of two classes A, B is defined to be the set {Z, ® L.} of X-prolongations 
of Y by G, for all L,e A, L,eB, where L,®L, is defined as in section 7 
above. It is a trivial matter to prove that the prolongation associated with 
a product of X-prolongations is the product of the associated prolongations. 
Thus products of the classes O,*4,- - -,G,** are preserved. Hence using 
Theorems EM1* and EM2*, together with the fact that 


H,"(W, G) = GSni"(X, Y; @), 
we have the 


THEOREM A. 


0, ** ® Gy_1** =~ (X, Y; G) (n > 0) 
and the 
THEOREM B. 
‘a G,,*% ~ G) (n > 0) 


where in each case the system on the left is a group. 


11. The direct approach. Theorems A and B can be obtained directly 
by methods entirely analogous to those of [3], by using vector characteristic 
cochains (¢.° Cg1)%e Y;K), which are associated with an X- 
prolongation (L,¢). These cochains are defined by 
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Cg (215 ° Yon Y) 


— Lq3 u(y U(Yon), ko(y, y’)) (n 1) 
Const (2, y) — Len 5 ky (Lons1y y)) 
where g=0,-- -,2n, gu(yi) = yi, u(1) = 1, ae X, ye Y, for all 1,7 as 
required, and 
holy, y’ucyy’) = = Tony). 


These cochains *,¢g-1)% are then homomorphisms which induce the 
isomorphism of Theorems A and B in the same way as the characteristic 
cochains g,? induce the isomorphisms of Theorems EM1 and EM2. 


Kinq’s COLLEGE, ABERDEEN, SCOTLAND. 
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DOUBLY STOCHASTIC MATRICES AND THE DIAGONAL OF 
A ROTATION MATRIX.* ? 


By ALFRED Horn. 


An elegant theorem of Hardy, Littlewood and Polya (Theorem 1 below) 
has been generalized and applied in various ways in recent literature. In 
this paper we shall present an extension of their theorem (Theorm 4). As 
an application we prove that the set of all diagonals of rotation matrices of 
order n is equal to the convex hull of those points (+ 1,- --,+1) of which 
an even number of coordinates are —1. From this, we determine the set 
of all diagonals of orthogonal or of unitary matrices. It would be interesting 
to prove without Theorem 4 that the set of diagonals of rotation matrices 
is convex. 


1. Definitions and notation. If A is a matrix, then Aj; will denote its 
i,j component and we write A = (Aj,;). A similar convention will be used 
for vectors. An n-vector is a vector with » components. A unit vector is a 
vector of length one. The diagonal of a matrix A is the vector (Au1,°- +, Ann). 
A diagonal matrix is one whose non-diagonal elements vanish. The identity 
matrix is the diagonal matrix with diagonal (1,- - -,1). 

A permutation matrix is a matrix with elements all 0 or 1 such that 
every row and every column contains exactly one element equal to 1. A doubly 
stochastic (d.s.) matrix is a matrix P such that Pj; 20, Py = > Pi 

v J 


for all 1 and j. We use the symbols A’, A and A* for transpose, conjugate 
and conjugate-transpose respectively. By an orthogonal matrix, we mean a 
real matrix A such that A’—A-, A rotation matrix, or rotation, is an 
orthogonal matrix with determinant +1. A unitary matrix is a matrix A 
such that A* = A-}. 

If x is a real n-vector, then H(z) is the convex hull of all the points 
(Tay* Varying over all permutations of (1,:--,n). By 
we mean the set of all & termed sequences o of integers for which 1 = 01 <°"' 
<o;n. The only member of 8", is the empty sequence. 


* Received April 16, 1953. 
1 This paper was written while the author received partial support from the Office 
of Naval Research. 
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If z is a complex number, 0x denotes its real part. Occasionally we 
shall use 1 both as a summation index and to denote the imaginary unit 
(—1)%. But we shall do this in such a way as to avoid confusion. 


2. Doubly stochastic and ortho-stochastic matrices. 


THEOREM 1. Let x,y be real n-vectors. Then the following statements 
are equivalent : 


(1a) y = Px, where P is a d.s. matriz. 


(1b) ye H (2x). 


k n n 
(1c) max > yo, max Sao, 1SkSn and 
oe i=1 i=1 4=1 i=1 
n 
(1d) Sf(yi) [DS f(x) for any convex function f whose domain con- 


tains all the numbers xi, yi, 1SStSn. 


Proof. The equivalence of (1a), (1c) and (1d) is proved in [1], pp. 49 
and 89. Also (1b) is equivalent to (1a) by virtue of the following theorem 
of G. Birkhoff [2]. 


THEOREM 2. A matrix is d.s. if and only tf it lies in the convex hull 
of the set of all permutation matrices. 


In case «4, -=2,, the conditions (1c) are equivalent. to 
n k 
t=n-k+1 4=1 


We take this opportunity to point out a companion to Theorem 1, various 
parts of which are scattered through the literature. 


THEOREM 3. Let x,y be n-vectors for which 1 
Then the following statements are equivalent: 


i=1 


(2a) y = Px, where P is a matriz such that Py 20, > S 1, Py 1, 
g=1 


k k 
(2b) max > yo, max 1SkEn. 
oeS"% i=1 i=1 
n 
(2c) f(yi) S Sf (xi) for any convex non-decreasing function f whose 
i=1 


domain contains all the numbers yi, 1 


) 
n 
h 
g 
d | 
). 
y 
y 
e 
a 
A n n 
| 
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Proof. The proof given in [1] for the implication (1d) -> (1c) also 
proves (2c) > (2b). The fact that (2b) — (2c) is proved in [3]. Ky Fan 
[4] proves (2a) > (2b). He also states (2b) — (2a) and quotes a lemma 
used in his proof. Using the fact that (1c) — (1a), a simple proof may be 
obtained. It is also possible to derive (2a)—> (2b) from (la) — (1b) by 
using the fact that any matrix of the kind described in (2a) may be aug- 
mented to a d.s. matrix (of order S 2n). 

A special class of d.s. matrices consists of those matrices Q such that 
Qi; = U;;? where U is an orthogonal matrix. Let us call such matrices ortho- 
stochastic (o.s.) matrices. Not every d.s. matrix is an o.s. matrix. In 
fact it is not hard to see that the matrix 


${1 0 1 
cannot even be expressed as a product of o.s. matrices. (This example is 


due to A. J. Hoffman.) Nevertheless we have the following theorem. 


THEOREM 4. If x,y are n-vectors satisfying (1c), then there exists an 
0.8. matrix Q such that y= Qz. 


The proof of Theorem 4 will be simplified if we first prove a lemma. 


DeFIniton. If let T™(2,,- - be 
n k k 

the set of points (2:,---,2m) such that 20, = > whenever 
i=n-k+1 i=1 i=1 


ce 8", l=ksm. Also let T°(2,,- --,2,) be the empty set. 


Lemma. If 7, and (41,° T""(41,° +, and 
n = 2, then there exists a set of n —1 real ortho-normal n-vectors u3,---,u"™ 
such that 


(3) y= (wi), 1SjSn—1. 
i=1 
Proof. The statement is obvious for nm 2. We proceed by induction. 
Suppose the lemma is true for 2n< N. Clearly T'-1(2,,- - -,2y) con- 
sists of the points (y:,---, Yw-1) such that (y:,---, yw-2) +, ty) 
and such that 6S yy.a, where 0 is the largest of the numbers 


N k k+1 k 
bo = 2% — yo,, and a is the smallest of the numbers aco = — Yn, 
i=N-k i=1 i= 


as o ranges over S‘-*, and k& varies from 0 to N —2. (When o is the empty 
sequence, bo = ry and dg = 
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For each oe S*-*,,0 S N — 2, let To be the subset of ry) 
on which a= ado, and let T’o be the subset on which b—bdo. It is not 
hard to verify that Tc consists of the points (y1,- --,¥Yy-2) such that 


(Yor, Yo,) € T* (2, ° and (Yor, Yoryox) € 


TN-*-* @y), Where o’ is the sequence consisting of those integers 
from 1 to N—2 which are not among the terms of o. Verify also that 
N-2 


To=T'o. It follows that T%-1(a,,---,av)=—lWU WU So, where So is 


k=0 oc 
the set of points (41,:°°,Yw-1) such that (4:,---,yw2)eTo and 
bor S Yn-1 S Ao. 
Now let Yn-1) be a fixed point of Then 
Yn-1) € So for some fixed oe and some fixed k= N—2. We 


must find an ortho-normal sequence wu’,- - -,u-' for which (3) holds. By 

the induction hypothesis, there exist & real ortho-normal k& + 1-vectors 

- +, oe and N —2—k real ortho-normal N — 1—k-vectors - -, 
Kk+1 N-1-k 


yo ¥-2-k such that Yom = >» (v%™,) 1 m k, Yor m = 
i=1 4=1 


lim=N—2—k. We define N — 2 ortho-normal N-vectors u},-- -, 
as follows: yor (v%™,, 0, 0), 1 = m <= &; yom (0, 0, 


N 
i=1 
17S N—2. It wu ranges over the circle consisting of all real N-vectors 
orthogonal to u',- - -, uwN-*, then the values of the sum 
N 
(4) (Ui) 
i=1 
fill out some interval. If we take w= where 
(1. + +, Ue) is a real unit & + 1-vector orthogonal to v%,- - -,v%, then 
k+1 k 
(4) takes on the value yc, Also if we take u= (0,-- -,0, 
s=1 j=1 
Where Uy-1-x) is a real unit vector orthogonal to 
N N-2-k 
v71,- + +, then (4) takes on the value 2;— yo, =bo. There- 
i=k+2 
fore if bo S yy-1 S ao, there exists a real unit N-vector wN-? orthogonal to 
uN-2 such that yy. = > 2a. 
i=1 


Proof of Theorem 4. There is no loss of generality if we assume 
%=-+--2>2,. As mentioned above, condition (1c) is equivalent with (1e). 


0 
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1a 
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Furthermore it is easily verified that (le) is equivalent to the conditions 
n-1 


lemma, there exists an ortho-normal system ms -+,u™1 such that (3) holds. 
If wu” is a real unit vector orthogonal to u',---,wu", it follows that 


n-1 


= Sa;— yj;— yn. Thus if we set (ués)2, then Q = (Qy) 
i=1 i=1 j=l 
is the desired o.s. matrix. 

An immediate consequence of Theorem 4 is the following. 


THEOREM 5. A vector y can be the diagonal of a Hermitian matriz 


with eigenvalues tf and only if (1e) holds. 
Proof. A Hermitian matrix A has eigenvalues 2%,- - -,2@, if and only 
if A = UBU*, where B is the diagonal matrix with diagonal (21,- - -,2n), 


and U is a unitary matrix. Therefore y is the diagonal of such a matrix A 

if and only if y;= > | Ui; |?a;. Theorems 1 and 4 now yield the conclusion. 
j=1 

The neecssity of (1c) in Theorem 5 was pointed out by Schur [5], 


who showed that this leads to a simple proof of Hadamard’s inequality for 


determinants. 
It is interesting to compare Theorem 5 with the following theorem, 


proved in [6]. 
TuHroremM 6. Let 7,>0, 1Sisn. Then there exists a Hermitian 
k 
matric A with eigenvalues such that JI 1SkS2, 


where M;, is the determinant formed Jags the first k rows and columns of A, 


if and only if max = max and — Il 
i=1 


aeS", i=1 oeS", 


3. Remarks on the complex case. If y= Px, where z,y are complex 
n-vectors, and P is a d.s. matrix, then there is an —— to the implication 


(la) + (le). We can show that for each oe 8", Ey. lies in the convex 


hull of all the points 3 Zr, TES". More sini we have: 


THEOREM 7. If y= Pz, where x,y are complex n-vectors, and P 1s 4 


n 
d.s. matrix, and tf are any complex numbers, then > les m 
i=1 


t 

a 

tl 
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the convex hull of all the points > cixg,, «¢ R", where R" ts the set of all 


permutations of (1,---,). 
Proof. By Theorem 2, we may write where P%;—1 
a 
when and P*%;;—0 otherwise, and 4,20, Sag—1. Therefore 
a 


n n n 
i=1 


4,j=1 a 4,j=1 
n n n 


nv 


We do not know whether the converse of Theorem 7 is true. However, 

there cannot be any simple analogue of (le) — (1a). In fact there exists 
4 

a pair x,y of 4-vectors such that > ciy; lies in the convex hull of the points 


i=1 


4 
Dd cia, for all real c, - - -,c, and yet there exists no d.s. matrix P such 
i=l 


that y= Px. For example take y = (1/4 + 20/3, — 1/4 + 271/3,0,0) and 
t= (1,—1,1,1/3). Theorem 4 also breaks down in the complex case. In 
fact if y= (0, 1/2 + 1/2, —1/2 + 1/2), x= (1,— 1,7), then there is one 
and only one d.s. matrix P for which y = Pr, namely 


0 1). 
01 1 
However the equivalence (1d) <— (1a) still holds for complex vectors [7]. 


4. The diagonal of a rotation. 


THEOREM 8. A wotor (d;,: -,dn) is the diagonal of a rotation of 
order n if and only if it lies in the convex hull of those points (+ 1,---,+1) 
of which an even number (possibly 0) of coordinates are —1. 


Proof. If R is a rotation, then its eigenvalues are all of modulus 1. The 
complex eigenvalues occur in conjugate pairs and an even number of the real 
eigenvalues are —1. Therefore we may say that if n is odd, say n = 2m + 1, 
then the eigenvalues are of the form 1, exp(+ 76,),° - -,exp(= 10m), where 
056;< 27. If n—2m, the eigenvalues have the form exp (+ 
exp(=+ 1m). In what follows, the case n even is practically identical with the 
case n odd. Accordingly we will only treat the case n= 2m-+1. It is well 
known that there exists an orthogonal matrix U such that R = U’AU, where 


= 
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A= , and B= 
sin@ 


0 
Therefore 
Rj = + Uj? cos 0, + cos 0; 
+--+ + + + Ujom? cos Om Ujoms1 COS Om, 


By Theorem 4, it follows that (d,,- - -,dn) is the diagonal of a rotation 
with eigenvalues 1, exp (+ 16:),- - -,exp (+ @m) if and only if it lies in 
H (1, cos 6,, cos 6:,° *, COS Am, COS Om). Since is unchanged if we 
permute the coordinates of 2, we find that the set S of all diagonals of n-th 
order rotations has the form 


(5) S U 1=q,=:- ai, a; Am; Om). 


Let 7 be the convex hull of those points (+ 1,---,+1) of which an 
even number of coordinates are —1. To prove S C T, we need only show 
follows immediately from the formula 


= 
II (1 + x;a;)(1, 5 Um—-1s Um-1; Un; Am) 
j=l 


To show that T C S, we first note that each of the vertices of T is 
obviously in 8S, by (5). Therefore to complete the proof, we need only 
show that is convex. Suppose 2, y are distinct points of S, and 0 <A <1. 
By (5), there exist sequences {a;}, {6;} such that lZ2a,=---LDa,=—1, 


ve H (1, a1, Qy,° Su), and ye b;,° bm). 


By using the equivalence of (1b) and (1c) it is easy to show that 


At + (1—A)ye A(1, Aa, + (1—A)d,, 
Aa, + (1—A)di,* Adm + (1 —A)Bm, + (1—A)bm) CS. 


Th 
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THEOREM 9. Let x be a real n-vector, n=2. The statements 
are equivalent 


(6b) x lies in the convex hull of B, where B is the set of those points 
with coordinates all 0 or 1 for which the number of 0 coordinates is different 
from one. 
(6c) x is the diagonal of a rotation, and 420,117. 
(6d) x ts the diagonal of a d.s. matriz. 
P Proof. Given any set A of k& integers between 1 and n, there exists a 
1 permutation which leaves invariant the members of A, and only these, as 


long as kn—1. Therefore the set of diagonals of permutation matrices 
of order n is exactly B. Theorem 2 now shows that (6d) and (6b) are 
equivalent. We conclude the proof by deriving the implications (6b) — (6c) 
— (6a) — (6d). 


J (6b) — (6c): Since the set S of diagonals of rotations is convex, we 
S need only prove B C 8. If xe B, then some permutation of x has the form 
Yn), in which = 0, 1 andkF1. 
If k is even, then ye 8 by (5). (When n= 2m, (5) remains valid if we 
drop the 1 in A(1, a:, 41,°--,4m;m)). If & is odd, then & = 3, and using the 
equivalence of (1b) and (1c), it is easy to prove that ye H(z), where 2; = 1, 
+1, = — 4, and 2 otherwise. Again 


H(z) C § by (5). 


(6c) — (6a): Let a be the diagonal of a rotation of order n. If 
= 2m + 1, then by (5), xe H(1, a, 4m, Gm) for some sequence {a;} 
with Therefore by (le), 
2a, + + Gm Sn — 2+ 


The case n even is analogous. 


(6a) — (6d): Suppose z satisfies (6a). We may exclude the obvious 
case 4; = 1 foralli. Let y,—1—2, ThenOSy,;=1 and 
Let B = Dr and set 2;—2y:/8. Then 02,1 and Sane, By the 
OF. 


of (1b) and (1c), we have (2,,: 
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Therefore 2n) = where u‘/ is the n-vector whose i-th 
and j-th components are 1, and whose other components are 0, and a; = 0, 
> a;—1. Therefore = > where yij = %;/2. 
i<j i<j 

If we define Pj, and for 1\SiSjSn, we obtain a 
symmetric d.s. matrix P = (Pi;) with the desired property. 


THEOREM 10. For n= 2, a real vector (d,,---+,dn) ts the diagonal 
of an orthogonal matrix if and only if the vector x= (|d,|,---,| dn]) 
satisfies any one of the conditions of Theorem 9. 


Proof. Suppose @ satisfies (6c). If z is the diagonal of a rotation, 
then d is the diagonal of an orthogonal matrix, since an orthogonal matrix 
remains orthogonal when we multiply any row by —1. 


Now suppose d is the diagonal of an orthogonal matrix. Then at least 
one of the two vectors (|di|,---,| dia|,+ di, | dia |,---,|dn|) is the 
diagonal of a rotation. The proof of (6c) — (6a) did not use the hypothesis 
a;=0. Therefore we have which 
proves (6a). 


THEOREM 11. For n= 2, a complex vector (dy,: -,dn) is the diagonal 
of a unitary matrix if and only if the vector « = (| d,|,-- -.| dn |) satisfies 
any one of the conditions of Theorem 9. 


Proof. Suppose z satisfies (6c). Then z is the diagonal of a rotation R. 
If we multiply the 1-th row of R by arg d;, whenever d; 0, then R becomes 
a unitary matrix with diagonal (d,,- - -, dn). 


To prove the converse (the idea of this proof is due to J. von Neumann), 
let a= sup (| as d varies over all diagonals 
of unitary matrices of order n. In order to prove (6a), we need only show 
%—=n—2. Clearly a= n— 2, as is seen by considering the identity matrix. 
Now let U be a unitary matrix for which | U1. | + | Unin+|—| Um! 
=a. By multiplying each row by a proper factor, we may assume that 
U4 20,1StsSn. By the maximal property of U, we have Uy, = Unn, since 

n-1 


otherwise an interchange of rows of U would increase the value of >} 04 — Un 
é=1 


Thus it involves no loss of generality to assume U,,;=- - - = Un, = 0. 


n-2 
If then a= > Uy Sn—2 and (6a) is proved. Hence- 


forth assume Unin1>0. For 1S p<qSn, let V™ be the matrix of 


wne 
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order n for which Vp) = cos 0, V%pg = — = sin exp (iB), 
where 6, B are arbitrary real numbers, and V%,; = 8,; for 1 p, g, and 7 ¥ p, q. 


n-1 n-1 
If W=UV™, we have | Wi | —| SS Uiui— Unn. Therefore 
i=1 


(7) | Ui, cos — U2 sin exp (—iB)| + | Us: sin 6 exp (iB) + U2, cos 6 | 
S U1, + O22. 


The expansion of the left side of (7) in powers of 6 begins with 
(Ui; + Us2) + (exp (iB) (Us — 


Consequently the coefficient of 6 must vanish for all 8. This can occur only 
if Uz —U,.. Similarly Uj; for 1 SiSjSn—1. From this it 


follows that | Uin| =| Uni |, 1Sisn. 

If Unn > 0, then by multiplying U on the right by V’" and using the 
same reasoning as before, we find U,y,j—=—— Uin, 1 Sisn—1. Therefore, 
if we multiply the last row of U by —1, we obtain a symmetric unitary 


n 
matrix B for which B,,, << 0 and such that «= Sd By. 


i=1 

In case Uy, = 0, let & be an integer for which U;, 0, and let B be the 
matrix obtained by multiplying the n-th column of U by arg (Ui) and the 
n-th row of U by arg (Unx). Then By, = By, > 0. If Sjsn—tl, 
we have 


n-1 n-1 . 
BinBin = ByBui — BujBax. 
4=1 é=1 


Therefore B;, = B,;. Thus in both cases, we have obtained a symmetric 
n 

unitary matrix B for which = B;, and B,, = 0. But the eigenvalues of 
t=1 

a symmetric unitary matrix are all + 1. Also & is equal to the sum of the 

eigenvalues of B. Since B is not the identity matrix, B,, being non-positive, 

the trace of B can be at most n—2. The proof is complete. 

By using Theorem 11, we can prove many curious inequalities in the 
elements of a unitary matrix. For example, if we apply Theorem 11 to the 
product 


1 1 1 1 
© tf Phe 

1 


) 

l, 
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t 
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h 
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where U is an arbitrary unitary matrix of order 3, we find 


| — + + Uis | + | Vor — U22 + | + | + Usi— | S5. 


The bound is attained for 
1—2 —2 
U =4{ —2 1 2 
—2 —2 1 
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NATURALLY TOTALLY ORDERED COMMUTATIVE 
SEMIGROUPS.* 


By A. H. CLirrorp. 


By a semigroup we mean a set S endowed with an associative binary 
operation: a(bc) = (ab)c. The cancellation law is not assumed. The present 
paper deals exclusively with commutative semigroups, and the modifier “ com- 
mutative ” will be omitted except in the statements of the theorems. 

If a and b are elements of S, we say that a divides b, and write aS, 
if either a = b or there exists c in S such that ac=b. We write a< 6 if 
ab but ba. We shall call S naturally totaliy ordered (n.t.o.) if the 
division relation is a total (i.e. “linear,” or “simple”) ordering of S: for 
every pair of elements a,b of S exactly one of the relations a< 6, a=b, 
b<a holds. If § has an identity element, it is a linear holoid as defined 
by Klein-Barmen [4]. The class of n.t.o. semigroups is, of course, far 
more restricted than that of linearly ordered semigroups, as defined by O. 
Nakada [6]. 

An n.t. 0. semigroup S without identity element, and in which the can- 
cellation law holds, is the same thing as a system satisfying O. Hélder’s axioms 
I, I], IV, V, VI ([3], pp. 4-7). Axiom II requires that § be non-discrete 
in the sense of containing no least element. In [3], pp. 22-27, Holder shows 
that if such an S be archimedean, it can be embedded in the additive semi- 
group P of all positive real numbers. The chief object of the present paper 
is to establish the analogous result for an archimedean n. t. 0. semigroup in 
which the cancellation law does not hold. 

Before describing this result, let us return to the general (non-archi- 
medean) case. In [4], Klein-Barmen determined the structure of all linear 
holoids S such that either S is finite or else the order type of S is that of the 
natural numbers. In section 1 below, we give a reduction theorem for the 
general case, from which his results follow. Suppose we have a set of n. t. 0. 
semigroups S,, where the index 7 ranges over a totally ordered (“t.0.”) set I. 
The class sum S of all the S, becomes a semigroup if we define acb;=b,ac=b, 
for all age So, bee S;, when o <7 in I. It is readily seen that § is also 
n.t.o. We call S the ordinal sum of the t. 0. set of semigroups §;. Ann. t. 0. 


* Received September 16, 1953. 
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semigroup S which is not the ordinal sum of two or more subsemigroups will 
be called ordinally irreducible. In section 1 we show that every n. t. 0. semi- 


group is uniquely expressible as the ordinal sum of ordinally irreducible 
n. t. 0. semigroups. 

This theorem reduces the study of n. t. 0. semigroups to that of irreducible 
ones. The general case appears to be difficult, but one can treat the case in 
which every element a of S has finite index (i.e. a"** =a" for some positive 
integer n). We shall use the term segment to mean an ordinally irreducible, 
n. t.0., (commutative) semigroup, every element of which has finite index. 
A segment may also be characterized as an archimedean n.t. 0. semigroup 
in which the cancellation law does not hold (Remark 4 of section 5). In 
section 2 we show that every segment S has a greatest element, which we 
call the ultimate element of S, and denote by oo. If we prefer to think of 
as the zero element of S, then a segment may be characterized as an n. t. 0. 
semigroup with zero, in which every element is nilpotent. Every discrete 
segment is a finite cyclic semigroup. Every non-discrete segment S is either 
dense-in-itself, or is dense-in-itself except for the existence of an immediate 
predecessor oo* of oo, which we call the penultimate element of S. 


In section 3 we show that any segment S can be extended in a unique 
manner to an archimedean n. t. 0. semigroup 7 satisfying the cancellation law. 
More precisely, 7 contains an ideal 7, such that S is isomorphic with the 
factor semigroup 7/7’, in the sense of Rees, [8], p. 389. Using the embedding 
theorem of Holder mentioned above, we show in section 4 that S can be 
embedded in one of the two semigroups P, = P/[1], P*, — P/(1), where [1] 
and (1) denote the sets of real numbers 2 satisfying 21 and z>1, 
respectively.t These two semigroups are, to within isomorphism, the only 
complete non-discrete segments. P, is dense-in-itself, while P*, has the pen- 
ultimate element o* 1. 


~ 


The concluding section 5 contains half a dozen remarks, of which we 
mention one. In Remark 5 it is briefly indicated how a segment S can be 
completed directly (i.e. without the above extension to 7’) using Dedekind 
cuts. Besides being somewhat longer, this treatment suffers from one’s having 
constantly to guard against an interesting but annoying difficulty. If S isa 
non-discrete segment having a penultimate element o*, the function z = zy 
fails to be continuous along the curve z= o*. It is, however, lower semi- 
continuous, and the calculus of cuts goes through all right provided we reckon 
with (closed) lower classes. Thus we must reverse Hélder’s definition of a 


1 P, is an instance of what Klein-Barmen in [5] calls a “geschlossene Zahlbezirk.” 
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cut, [3], p. 20, and require that the upper class have no least element. The 
upper classes of S are just the v-ideals of S, as defined in [1], p. 600, and 
these are therefore not suitable for the completion of 8. 

Notation. A CB means A is a proper subset of B. 0 denotes the empty 
set. If A CB, B—A denotes the complement of A in B. If {A,;reTI} is 


a set of classes, |) A; denotes their class sum, and {) A, their intersection. 
rel tel 


1. Reduction of an n.t. 0. semigroup into the ordinal sum of ordinally 
irreducible n. t.0. semigroups. Let S be an n.t.o. semigroup. A subset a 
of S is an ideal of S if aca and be S imply abea. Thus a is an ideal of 8 
if and only if it is an upper class of S in the usual sense (aea anda< 2 
imply zea). The complement S—a of an upper class a is a lower class, 
and the decomposition of S into a and S —a will be called a cut of 8S. An 
ideal p of 8 is prime if S —p is a subsemigroup of S. If a and 6 are elements 
of S such that ab =a, we shall say that a absorbs 6. An ideal a of 8 will be 
called absorbent if aca and b¢a imply ab —a. 


Lemma 1.1. Let p be a non-empty, proper, absorbent prime ideal of an 
n.t.o. semigroup S. Then p and S —p are non-empty, n. t. 0. subsemigroups 
of 8, and S is the ordinal sum of S—p and p in that order. 


Proof. The subsemigroups p and S —p are t.o. by the division relation 
in 8. We must show that this ordering is natural in both of them. Let 
aand 6 be elements of » such that a <b relative to S. Then ac=—b for 
some ce S. Were cf) we would have } = ac =a, contrary to a<b. Hence 
cep, and a < b relative to p. Similarly, let a and b be elements of S—p 
such that ac = b for some ce 8S. Were ce p we would have b == ac —€¢, con- 
trary to bf p. Thus p and S —p are n.t.o. semigroups in their own right. 
Since they are disjoint, and § is their class sum, and every element of p 
absorbs every element of S — p, it is clear from the definition that S is their 
ordinal sum in the order S — }, p. 


LemMA 1.2. An n.t.o. semigroup is ordinally reducible if and only if 
tt contains a proper, non-empty, absorbent prime ideal. 


Proof. If an n.t.o. semigroup § is the ordinal sum of a set of n.t.o. 
semigroups S, (reZ), where J contains more than one element, and if R is 
any proper, non-empty, upper class of J, then p= lJ) S; is a proper, non- 


TeR 


empty, absorbent prime ideal of S. The converse follows from Lemma 1. 1. 


A prime ideal p of an n. t. 0. semigroup S will be called special if (1) p 
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is absorbent, and (2) there exists an absorbent prime ideal q such that q 2 p, 
and such that no absorbent prime ideal r exists such that qD rd p. q is 
evidently uniquely determined by p, and will be called the absorbent prime 
ideal associated with p. q itself need not be special. Condition (2) requires 
p =< 8, but we do not exclude the possibility 0. The null set is evidently 
a special prime ideal of S if and only if S contains a minimal non-empty 


absorbent prime ideal. 


Lemma 1.3. Let p be a special prime ideal of an n.t.o. semigroup 8, 
and let q be its associated absorbent prime ideal. Then q—p 1s a non-empty, 
ordinally irreducible, n. t.0. subsemigroup of S. 


Proof. Applying Lemma 1.1 first to S — p, and then to the absorbent 
prime ideal q—p of the n.t.o. semigroup S—p, we see that q—p is a 
(non-empty) n. t. 0. subsemigroup of S. Were it ordinally reducible, it would, 
by Lemma 1. 2, contain a proper, non-empty, absorbent prime ideal P. It is 
readily seen that PU p would be an absorbent prime ideal of S between p 
and q, which is impossible. 


THEOREM 1. Hvery naturally totally ordered, commutative semigroup 
is uniquely expressible as the ordinal sum of a totally ordered set of ordinally 
irreducible such semigroups. 


Proof. Let S be an n.t.o. semigroup. Let J be a set in one-to-one 
correspondence with the set of special prime ideals of S. Let p, be the special 
prime ideal of S corresponding to the element 7 of J. For o,7eJ, define 
o<7if po - p-; this is evidently a t.o. of J. Let q, be the absorbent prime 
ideal of S associated with p,, and let S;=q;—p,;. By Lemma 1. 3, each 
S; is a non-empty, ordinally irreducible, n. t. 0. subsemigroup of 8. Ifo <1, 
then po qr so that S; C S while S;Cq,;. Hence 8S, =0, 
and, since q; is absorbent, every element of S,; absorbs every element of So. 
The theorem will follow when we show that every element a of S belongs to 
some S,. 

Let p. be the class sum of all the absorbent prime ideals of S not con- 
taining a, including the null set. (pa —0 if and only if § contains a minimal 
non-empty absorbent prime ideal qo, and a@eqo.) _ is evidently a prime 
ideal. It is, moreover, absorbent. For suppose te pa, yf Pa. Then rep 
for some absorbent prime ideal p not containing a. Since y # p, we conclude 
Ty = 

Let qa be the intersection of all the absorbent prime ideals of S containing 
a. Qq is evidently a prime ideal. To see that it is also absorbent, let 2 a; 
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y¢Qa Then y 7 q for some absorbent prime ideal q of S containing a. Since 
aeqg, we conclude zy = 2. 

Evidently qa — Pa. There cannot exist an absorbent prime ideal r of S 
between p, and qa; for if acer then r> qa, and ifafr then ppr. Hence 
p, is a special prime ideal of S, qa is its associated absorbent prime ideal, and 
ae Ja— Pa: 

To prove the uniqueness, suppose that S is the ordinal sum of a t. 0. set 
of ordinally irreducible n.t.o. semigroups S’,(#eJ). For each let 
y= U S’g and q’,—p’.U S’a. Then p’q is a special prime ideal of 8, 

a 


and q’, is its associated absorbent prime ideal. For p’, and q’q are evidently 
absorbent prime ideals of S, and the irreducibility of S’, prevents the occur- 
rence of an absorbent prime ideal between them. Thus p’, =p, for some 
rel, and 8S’,—<S;. The mapping «—>7 of J into J must cover J since 


U 8, and is clearly one-to-one and order-preserving. 
aed 


2. Ordinally irreducible n. t. 0. semigroups; segments. 
The following lemma is essentially Klein-Barmen’s Satz 9 [4], p. 727. 


LeMMA 2.1. Let e be an idempotent element of an n. t. 0. semigroup S. 
Then the set p of all elements «x of S satisfying x > e ts an absorbent prime 
ideal of 8. 


Proof. is evidently an ideal. If a and 6b are elements of S —p, i.e. 
and be, then ab —e, abe S—h, and p is prime. Let aep, 
beS—p,i.eea>e, bSe. Then for some ce From Se we 
have eS eb e? e, whence eb Hence ab —ceb = ce —a, and p is 
absorbent. 


THEOREM 2. An ordinally irreducible, naturally totally ordered, commu- 
tative semigroup S can contain at most one idempotent element. If it does, 
then this lone idempotent is the greatest element of S, and absorbs every 
element of 


Proof. Suppose e is an idempotent element of S. By Lemma 1.2, 8 
cannot contain a non-empty, proper, absorbent prime ideal. The absorbent 
prime ideal p of Lemma 2. 1 is certainly proper, since e # p, and hence it must 


* Except for the (inconsequential) absence of an identity element in 8, Theorem 2 
asserts that S is a “fundamental holoid” in the terminology of Klein-Barmen [5]; 
8 is of the “first kind” if it contains the lone idempotent, and otherwise it is of the 
“second kind.” 


| 
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be empty. Hence ¢ is the greatest element of S. But then, if ae 8, ea=e, 
hence ea =e, and e absorbs every element a of S. 


If an n.t.o. semigroup S contains a greatest element, we shall call it 
the ultimate element of S, and denote it by o. Clearly o absorbs every 
element of S. Conversely, if an element of § absorbs every element of S (i.e. 
if it is a “ zero element ” of S) this element exceeds all other elements of 8. 

Let S be an n.t.o. semigroup,.and a any element of S. The subsemi- 
group C, of S consisting of the powers of a is called the cyclic subsemigroup 
of S generated by a. Either a" < a”*! for all positive integers n, or else 
there is an n such that a<a?<---<a"=a™'=—q"’?=---. In the 
latter case, as in [1], p. 597, we call n the index of a. C, is itself n.t.o., 
and if a has index n, a” is the ultimate element of Cy. C, is clearly ordinally 
irreducible. 

By the term segment we shall mean an ordinally irreducible, n. t. 0. semi- 
group, every element of which has finite index. Let S be a segment. Since 
S must have at least one idempotent, it has by Theorem 2 exactly one, and 
this is the ultimate element o of S. If ae, and n is the index of a, then 
a"== 0. If we prefer to regard o as the zero element of S, then every 
element is nilpotent. Conversely, if S is an n.t.o. semigroup with ultimate 
element oo, and if some finite power of each element of S is equal to », 
then § is a segment. 


Lemma 2.2. Jf aand b are elements of a segment S such that ab =}, 
then b = o. 


Proof. ab —b implies a") = b for every n. Taking n to be the index 
of a, o ao-b—b. 


LEMMA 2.3. If a, b, c are elements of a segment S such that ab = ac 
o, then b—c. 


Proof. If, say, b < ¢, then bd =c for some de 8. From ab = ac = abd 
and Lemma 2.2 we would conclude ab = o. 


LeMMA 2.4. Jf a and b are elements of a segment S such that 
a<b<oo, there exists a uniquely determined element b/a of S such that 
a(b/a) = b. 


Proof. Since a < b, there exists ce S such that ac—b. If there were 
another such element d, then from ac = b = ad, b 4, and Lemma 2. 3, we 
would conclude c = d. 
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An n.t.o. semigroup 8S is called discrete if it contains a least element. 


Lemma 2.5. A discrete segment is a finite cyclic semigroup. 


Proof. Let S be a discrete segment, and p its least element. Let a be 
an arbitrary element ~ oo of S. Since pa and a< p"= o, where n is 
the index of p, there is a unique positive integer r such that p’ [a < p™. 
Ifa then a= p’b for some be 8. Since b= p, a= p'b = p™, contrary 
toa Hence a= p’, and is the cyclic semigroup generated by p. 


If the ultimate element o of an n.t.o. semigroup S has an immediate 
predecessor, the latter will be called the penultimate element of S, and will 
be denoted by oo*. 


LemMMaA 2.6. Let a and b be elements of a non-discrete segment S such 
thata<b. Then either there is an element of S between a and b, or else S 
has a penultimate element o*, and a= 0*, b= o. 


Proof. Assume that no element of § exists between a and b. Since 
a<b, b= ac for someceS. Since is non-discrete, there exists d < c in 8. 
Now aSad=ac=b. aad since otherwise a= o by Lemma 2. 2, con- 
trary toa <b. Hence ad=ac=b. Were b ~ this would imply d—c 
by Lemma 2.3. Hence }) = , and a is its immediate predecessor. 


Lemma 2.6 shows that a non-discrete segment § is dense-in-itself except 
possibly at oo. We shall call S dense or dense-except-at-« -according to 
whether S does not or does have a penultimate element. 


3. Extension of a segment. Let S be a segment. Let & be the set of 


finite sequences « = {a;,° -,@m} of elements 0 of S. We define an 
evidently associative product in ® by juxtaposition: if = {a1,° - Gm} 
and = {b,,-- -,bn}, then = +, -,bn}. The mapping 
%—> is evidently a homomorphism of ® onto 8: II(«B) 


= (IIx) (IIB). We say that B is a refinement of « if there exist elements 
> +, Bm of ® such that B = Bm and (t=—1,- - -,m). 
The relation “is a refinement of ” is transitive and reflexive. 


Lemma 3.1. If are elements of a segment 8, and if 4 
and j are positive integers Sr such that 
i= j. 


Proof. Immediate from Lemma 2. 2. 


11 
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Lemma 3.2. Let « = dm}, &= {%1,- Zp}, and n= {91,° Yq} 
be elements of &. Then & and &y cannot both be refinements of «. 


Proof. If is a refinement of «, there exists such that a, = % 
If & is also a refinement of a, then either there exists j such that 
a, = or else & such that a, Since 
a, ~ co, Lemma 3. 1 implies that the first case must hold, andi—j. Letting 
a’ == +, dm} and & = +,2%p} we then see that and are 


both refinements of «’, and the result follows by induction on the length of a. 


Lemma 3.3. There exists a single-valued function associating with each 
pair of elements a, 8B of & an element y of ® such that exactly one of the 
following propositions holds: 


(1) y is a common refinement of a and B; 


(2) y is a refinement of «, and there exists 8e ® such that y8 1s a 
refinement of B; 


(3) y ts a refinement of B, and there exists Se@ such that yd is a 
refinement of a. 


Remark. y will be called the greatest common part (GCP) of « and 8B. 


Proof. Let «= {a,,---,@m}, B = bn}. Define c, = min(q, 
Recalling Lemma 2.4, we define elements «,, 6:, of ® as follows: 


(1) if a,—6,, then - -,am}, Bi =~ {bo,- Dn}; 
(3) if a, > then a, = {a,/c,, Gm}, Bi = da}. 


In each case, {c,}a, and {c,}8, are refinements of « and B, respectively. We 
now do exactly the same thing to the pair a, 8,: define c, to be the lesser of 
the initial terms of a, and £,, and define a2(f2) to be the sequence ob- 
tained by lopping c. off the front end of @,(8:), as described above. Since 
is a refinement @,, Co}a2 = is a refinement of {c,}a, 
hence of «; similarly {c,, c2}B. is a refinement of 8. Continuing this process, 
we construct a sequence of pairs of elements «;, 8; of &, together with a sequence 
of elements c; of S, such that (for each i—1,2,---) +, and 
+, are refinements of « and £ respectively. 

To avoid circumlocution, we adjoin to ® the empty sequence, for the 
present proof only. The above process must terminate in a finite number ? 
of steps, for at each step at least one of the pair a, B; is shortened. Then 4, 
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or Bp, or both, is the empty sequence. Let y= Cp}. y%p and 
are refinements of « and £, respectively. If both « and B, are empty, we are 
in Case (1). If just a, is empty, we are in Case (2) with =>. If just 
Bp is empty, we are in Case (3) with = @. 

We define «~f (a,Be®) if a and B have a common refinement 
(“e.r.”) in @. Clearly ~ is reflexive and symmetric. 


Lemma 3.4. The relation ~ is transitive. 


Proof. Let a, B, y be elements of ® such that 7 ~B and B~y. Let € 
be ac.r. of @ and B, and let » be ac.r. of B and y. Let & be the GCP 
(Lemma 3.3) of € and 7. If Case (1) holds, ie. £ is a c.r. of € and 1, 
then, since € and » are respective refinements of # and y, we conclude that 
tisac.r. of « and y, whence «~y. We proceed to show that the other 
cases are impossible. Suppose e. g. that Case (2) held, i.e. £ is a refinement 
of é, and £8 is a refinement of » for some Se. Since é and 7 are both 
refinements of 8, we conclude that £ and £8 are both refinements of 8, con- 
trary to Lemma 3.2. Case (3) is, of course, similar. 

Thus ~ is an equivalence relation in ®. If «~ then clearly ay ~ By 
and ya~ yB for every ye ®, i.e. ~ is a congruence relation, and the equi- 
valence classes under ~ constitute a semigroup 7 when multiplied in the usual 
way. It will be convenient, however, to regard the elements a, 8,- - - of ® as 
themselves elements of 7, with ~ as the equality relation. 


Lemma 3.5. TJ is commutative. 


Proof. If a,be S— {oo}, then {a,b} = {b,a}. This is trivial if a=), 
and otherwise we may suppose a>b. Then abe for some ce 8. Since 
bc =cbh =a, we see that {b,c,b} is a c.r. of {a,b} and {b,a}. We may 
express this as {a}{b} — {b}{a}, and the lemma follows from the observation 
that the one-term sequences generate T’. 


Lemma 3.6. The cancellation law holds in T. 


Proof. Let «= {a,,- -,a@m}, B= {bi,- +, bn}, y= +, Cp} be 


elements of 7 such that aB—~ay. Let be ac.r. of 
and ay. Then we have correspondingly a, = 2,72: and a, = 
with 1Si,jSr. By Lemma 3.1, But then  — {2,1,° - -,2%,} is 
ac.r. of a8 and a’y, where a = {ds,- - -, am}, and we may use induction 


on the length of «. 


LemMA 3.7. T does not contain elements a, 8 such that aB ~ a. 


¥ 
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Proof. By induction on the length of «, we can show, by an argument 
similar to that used in the proof of Lemma 3. 2, that «8 and @ cannot have 
ac.r. 


Corotuary. If a and B are associate elements of T («= B and B= a) 
then a~B. 


Lemma 3.8. T ts naturally totally ordered. 


Proof. Let «, Be T, and let y be their GCP (Lemma 3.3). In Case (1), 
a—~B. In Case (2), y~a and yi ~ 8, whence a ~ i.e. in T. 
In Case (3), Ba. The lemma then follows from the corollary to 
Lemma 3. 7. 


Lemma 3.9. (1) If then Ilo = If. 
(2) If then a~f®. 
(3) If a~ By then Ma = (Ily). 


Proof. (1) If éisac.r. of @ and then (2) Let 
c=—TIla—TIIf. Then a and £ are refinements of {c}, whence «~ {c} ~8. 
(3) Evidently (By) = (IIB) (My), and Ila —II(By) by (1). 


An n. t.o. semigroup S is called archimedean if, given any two elements 
a, be 8, there exists a positive integer r such that a’ = bD. 


Lemma 3.10. T ts archimedean. 


Proof. First let a and b be elements ~ oo of 8. Let r be the index 
of a, and let § = {a,a,- - -,a} with r terms each equal to a. Let B = {b}. 
Then §< Bin T. For otherwise ~ for some y T, and = (ITy) (18) 
by Lemma 3.9 (3). This is impossible, since IB = while 
=o. Since §= {a}", we conclude that {a}" = {b}. 

Now let @= {b,,---,0n}. From the foregoing, there exist n posi- 
tive integers such that {a} = {bi}, (t—1,---,n). Letting 
we have {a}? =. Finally, if a= {a,,- -,dm} 
and B= {b,,---.bn}, we have {a,}"=£B for some r. But «= {a}, 
whence a” = £. 


TuHroreM 3. Let S be an ordinally irreducible, naturally totally ordered, 
commutative semigroup, every element of which has finite index. Then there 
exists an archimedean, naturally totally ordered, commutative semigroup T, 
without identity element, and in which the cancellation law holds, such that: 
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(A) T contains an ideal T,, such that the factor semigroup T/T, in 
the sense of Rees, is isomorphic with 8S; and 


(B) T is generated by the complement T —T. of Tx in T. 


T is uniquely determined to within an isomorphism leaving fixed the elements 
of S— {co} tf we identify S— {wo} with T—T.z. 


Proof. We take for T the semigroup constructed above. We have already 
seen that 7’ is an archimedean, n. t.0., commutative semigroup in which the 
cancellation law holds (Lemmas 3.5, 3.6, 3.8, and 3.10). The set T, of 
all ae T such that Ila — o is clearly an ideal of 7. Lemma 3.9 shows 
that the mapping «— Ila, T,,—, is an isomorphism between 7/7’, and 8S. 
For we may regard 7/7, as composed of the elements of 7 —T,, together 
with the set T,, regarded as a single all-absorbing element. From {a:,---, dm} 
= {a,}- it is clear that T—T, generates T. 


To prove the uniqueness, let 7’ be another semigroup with all the 
properties stated in the theorem for T, and identify T’ — T’, with S — {0}. 
We shall establish an isomorphism of 7 onto 7’ whereby each one-term 
sequence {a} in 7’ maps into the element a4 o of S. We denote the product 
of two elements a’, of by Then if a,be 8 we have aob —ab 
provided ab oo. 

Each sequence « = *,@m} of ® determines an element a’ = II’a 
=0,00,°9:--Od» of T’. If € is a refinement of a, we easily see that 
= II’a, and consequently a~ implies — Hence is a 
single-valued mapping of 7 into 7’, in fact onto 7’ by (B), and is evidently 
ahomomorphism. If «8 in T, then Ia’. For if, say, a > B, then 
4= By for some ye 7, and I’« = (Tl’B) (Il’y). Since cancellation holds in 
T’, and T’ has no identity element, this implies I’«=41I’B. Hence «— Il’« 
is an isomorphism of 7’ onto 7’, and evidently {a} —> II’{a} =a for every 
ae — {oo}. 


4. Completion of a segment. Let S be an n.t.o. subsemigroup of an 
n.t.0. semigroup =. We shall say that § is everywhere dense (“e.d.”) in & 
if (1) § contains the greatest and the least element of 3, if such exist, and 
(2) for every pair a, 8 of distinct elements of 3, either there is an element 
of § between them, or else there is no element of S between them,-and in 
the latter event both a and B belong to 8. (The reason for this last condition 
is given in Remark 6 of section 5.) 

Let = be a non-discrete segment. Then & has no least element, and any 
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subsemigroup of = contains the ultimate element o of 3. From Lemma 2. 6 


we conclude: 


Lemma 4.1. An n.t.o. subsemigroup S of a non-discrete segment & is 
everywhere dense in & if and only if there is an element of S between every 
pair of distinct elements of except the pair where is the pen- 
ultimate element of & (if such exists), and in this event, 0* (as well as o) 


belongs to 8. 


If = is a discrete segment (hence a finite cyclic semigroup), then an 
n.t.o. subsemigroup of & is e.d. in = if and only if it coincides with 3. 

An n. t. 0. semigroup & is called (conditionally) complete if every subset 
of S bounded from above has a least upper bound (LUB). This is equi- 
valent to the dual property, that every subset of = bounded from below has 
a greatest lower bound (GLB). Of course, every subset of a segment is 
bounded from above (by o). 

In the following theorem, P, and P*, denote the semigroups defined in 


the introduction. 


TueEorEM 4. Let S be a non-discrete, ordinally irreducible, naturally 
totally ordered, commutative semigroup, every element of which has finite 


index. Then there exists a uniquely determined isomorphism of S onto an 
everywhere dense subsemigroup of either P, or P*, (but not both), the former 
if S is dense, and the latter if S is dense-except-at-o. 


Proof. Let T be the semigroup described in Theorem 3. By Holder’s 
theorem, mentioned in the introduction, there exists an isomorphism y of T 
onto an e.d. subsemigroup yZ of the additive semigroup P of all positive 
real numbers. According to Theorem 3, S is isomorphic with the factor 
semigroup 7/7, in the sense of Rees. For convenience, we shall identify 
S’ = S—{o} with T—T,. Let »A be the GLB in P of yT,. Setting 
6, where @ is the automorphism of P (Ee P), ¢ is an iso- 
morphism of 7 onto an e.d. subsemigroup $7 of P such that the GLB in P 
of #7. is the real number 1. ¢8’ is an e. d. subset of the open interval (0, 1), 
and contains 1 if and only if 8’ has a greatest element, and the latter must then 
be the penultimate element o* of 8. By Lemma 4.1, $8 is an e. d. subsemi- 
group of P, or P*,, according to whether S does not or does have a penultimate 
element. (Of course we extend ¢ so that it maps the ultimate element of 8 
onto that of P, or P*,.) 

Since an e.d. subsemigroup of a segment = has a penultimate element 
if and only if = has one, it is clear that S cannot be isomorphic with e. d. 
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subsemigroups of both P, and P*,. Suppose ¢, and ¢, are two isomorphisms 
of S onto e.d. subsemigroups S,; = ¢,8, of (or P*,). Then 
$e: is an isomorphism of 8, onto S.. As a continuous function defined 
on an e. d. subset of the interval (0,1), ¢o¢:7? can be extended in the usual 
way to a continuous function y on P, (or P*,) to itself, and y is evidently 
an automorphism. But the only such is the identity mapping. This can be 
seen, for example, by noting that each element of the e.d. subsemigroup of 
P, (or P*,) generated by the numbers 2-# (t= 1, 2,- + -) must be left fixed. 


Hence = ¢2. 


Coronary. P, and P*, are, to within isomorphism, the only complete 
non-discrete segments. 


Proof. If S is a complete, non-discrete segment, and ¢ is an isomorphism 
of § onto an e.d. subsemigroup #8 of P, or P*,, then ¢8 is also complete, 
and so must coincide with P, or P*, (as the case may be). 


5. Concluding remarks. 


Remark 1. Theorem 1 still holds if we delete the word “ naturally.” 
The proof becomes almost trivial when freed of the necessity of proving that 
such and such a t. o. is natural. The notions of “ upper class ” and “ ideal ” 
are no longer synonymous, and we must use the former. The result is of 
limited value because the corresponding condition of ordinal irreducibility 
is far weaker than before. It is satisfied by Examples 3, 5, 8, 10, 12, and 14 
in [7]. 


Remark 2. We may obtain a partial reduction of an n. t. 0. semigroup 
8 by means of the set # of idempotent elements of 8S. JZ is an n.t.o. sub- 
semigroup of S with ef = max(e,f) for all e,fe#. Let J be the set of all 
cuts of the t.o. set #, as defined in section 1. For each aeJ, let LZ, and R, 
be the lower and upper classes, respectively, of a Define S, to be the set of 
all ze § satisfying (i) ee Lg implies e <2, and (ii) fe R, implies x Sf. 
Then it is readily seen from Lemma 2.1 that each S, is a (possibly empty) 
n. t.0. subsemigroup of S, and that § is their ordinal sum. In fact, Lemma 
2.1 states that the set p, of all ee § satisfying z > e is an absorbent prime 
ideal. So also are Pea= U py and qa= f) pe; and Sa—qa—Yq. Each 


8, contains at most one idempotent element e,, and if so, e, is the ultimate 


element of S, (cf. Theorem 2). If every element of S has finite index, then 
the S, are ordinally irreducible. 
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Remark 3. Let S be an n.t.o. semigroup in which the cancellation 
law holds. If ab = 6 for some a,beS, we see at once that a is the identity 
element of S. Hence 8 is ordinally irreducible unless it has an identity 
element i, and in the latter event it is the ordinal sum of the irreducible 
n.t.o. semigroups {1} and S— {i}. If we order the quotient group G of 
S in the usual way (cf. [6], Theorem 5, p. 182), G becomes a t. 0. commu- 
tative group, and § consists of all elements =1 (or >1), where 17 is the 
identity element of G. Conversely, the set of all non-negative (or of all 
positive) elements of a t.o. abelian group is an n.t.o. semigroup with 
cancellation. 


Remark 4. We show here that an archimedean n.t.o. semigroup S, in 
which the cancellation law does not hold, 1s a segment. For then a, 8, ¢ 
exist in S such that ab =acandb<c. bd—=—c for some de S, and ab = abd. 
Writing we have Since for some n, 
we have f —d"f=f?, whence f?—=f. For any aeS we have f*=a for 
some n, hence f =a, i.e. f is the ultimate element o of S. For any ac8 
we also have a” = o for some n, i. e. a" = o0, and we have observed in section 
2 that such a semigroup is a segment. 


If the cancellation law holds, then a complete n. t. 0. semigroup is archi- 
medean. This is not so if the cancellation law does not hold. For example, 
let S be the ordinal sum of a conditionally complete t.o. set of complete 
segments 8, (reJZ), such that 8, is discrete if + has no immediate predecessor 
in I. In fact this is the most general complete n. t. 0. semigroup if we allow 
the cancellation law to hold in S,, where » is the greatest element in J (if 
such exists). 


Remark 5. The completion of a segment S can be carried out by means 
of cuts in the t.o. set S. The closure a’ of a lower class a of § is defined as 
follows: if a has a LUB a, in S, then a’ =a VU {ap}, and otherwise a’ =a. 
a is closed if a’ =a. Let © be the set of closed lower classes of §. Define 
aobh= (ab)’, where a, be G, and ab denotes the set of all ab with aea, bed. 
S becomes thereby a complete n.t.o. semigroup. For aeS, the set (a) of 
all <a is called the principal closed lower class defined by a. The set G 
of all such is an e.d. subsemigroup of © isomorphic with 8. © is unique 
in the following sense: if } is any complete segment containing § as an e. d. 
subsegment, there exists an isomorphism between % and G whereby a (a) 
for allaeS. The corollary to Theorem 4 is shown by constructing an e. d. 
subsemigroup Q of the given complete segment S which is isomorphic with 
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the e. d. subsemigroup Q’ of P, or P*, (depending on the type of 8) generated 
by the numbers (¢—1,2,-- -), and then using the above uniqueness 
theorem. 

All this sounds very simple and quite the usual thing, but the dual 
procedure breaks down. The binary operation o defined dually for closed 
upper classes is no longer associative. For example, take the subsemigroup @’ 
of P*, mentioned above. Let (a) denote the set of all ze Q’ satisfying r= a 
(even if ag Q’). Take a= (1/6), b= (1/3), c= (1/2). Then 


acb—= (1/2), oc— w}; 
boc= (5/6), ao (hoc) = {oo}. 


Proof of the associativity of o depends upon the following lemma: a segment 
S is lower semi-continuous in the sense that if A and B are subsets of S with 
LUB’s ay and bo, respectively, then abo is the LUB of AB. The dual of this 
lemma breaks down when S has a penultimate element o*, and dob) = *: 
let A be the set of all a >a), and B that of all b > bo; then ab = o for 
every ae A, be B, so that the LUB of AB is 2(aobo). 

On the other hand, multiplication of v-ideals is associative ([1], p. 601). 
The difficulty here is that every upper class of a segment S is a v-ideal, and 
consequently the semigroup of all v-ideals is not n.t.o. For if a’ = (a), 
and a = a’ — {a}, there cannot exist a v-ideal b such that a’ = ab. 


Remark 6. Let S be an n.t.o. subsemigroup of a complete n. t.o. 
semigroup &. We shall say that = is a normal completion of 8 if every 
element of & is the GLB of a subset of S. This is concordant with the 
definition of “normal ideal arithmetic” given in [1], p. 605. Condition 
(N,), that a divides 6 relative to S (a,beS8) if and only if a divides b 
relative to &, is satisfied because § is n.t.o. Condition (Nz) requires that 
every element of % be the GCD (= GLB) of a finite subset of 8. But since 
the divisor chain condition was assumed in [1] to hold in 3%, the qualification 
“finite” could have been omitted. 

If } is a normal completion of a segment S, then & is easily seen to be 
a segment also. S is discrete if and only if = is discrete, and then S —=3; 
otherwise S and & are either both dense, or both dense-except-at-oo. If S§ is 
a subsemigroup of a complete segment %, then = is a normal completion of J 
if and only if § is e.d. in 3. Theorem 4 implies that every segment possesses 
a unique normal completion. 

If 3 is a normal completion of a segment S, then every element of & is 
also the LUB of a subset of S. The following example shows that the converse 
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is not true. Let 3 = P*,, and let S be the subsemigroup of P*, consisting of 
all rational multiples of 24, together with o. Every element of P*, is the 
LUB of a subset of S, but 1 (= o*) is not the GLB of a subset of 8S. 
S would be e.d. in P*, were it not that 1¢8. This example explains why 
we required o*e8 in the definition of “everywhere dense,” in section 4. 
If we regarded S as e. d. in P*,, then (since S itself is dense) the uniqueness 
clause in Theorem 4 would not hold. 


Remark 7 (added January 7, 1954). In a recent conversation, Professor 
A. M. Gleason told me that the results of the present paper are closely related 
to certain work of his own, namely the developments of section 2 of reference 
[2], especially Lemma 3. In fact, although he did not publish it, Professor 
Gleason was aware of both Theorem 1 and the Corollary to Theorem 4 in the 
case § is compact and connected in its order topology. 
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SIMPLE BASIC SETS OF POLYNOMIALS.* 


By M. N. Mixwait. 


1. Introduction. ‘The subject of basic sets of polynomials was introduced 
by J. M. Whittaker. For such a set {p,(z)} any polynomial and in particular 
the polynomial 2" admits a unique finite representation of the form: 


2" == = 


i.e. {wiz}, {pi} are the basic sets of operators and coefficients, respectively, 
associated with the set {p,(z) }. 

Thus any set of polynomials {p,(z)}, where p»(z) is of degree n, is basic 
and is said to be simple. 

Associated with any function f(z) regular about the origin, there is a 
basic series S,IInf(0)p,(z), where the operator II, is given by: 


The series is said to be effective on | z| =, if it represents in and on 
|z |= R every function which is regular in and on | z | = R. 

The reciprocal set of polynomials of {pn(z)} is denoted by {pn(z)}* and 
is defined by: = Sipnidjrij2). 

The product set {w,(z)} = {pn(z)}{qn(z)} is defined by the relations: 
= (Z) = where wij = Sapingnj. = 

The quotient set {tn(2z)} = {pn(z)}{qn(z)}* is defined accordingly by: 

According to Theorem 1 [1], the order w of the basic set of polynomials 
is defined as follows: = lim sup,(n log log maxy; | | | pi; |. 


2. In this paper, I investgate the order and the effectiveness of the 
reciprocal, product, and quotient sets of some class of basic sets of polynomials. 
My treatment deals with non-normalised simple sets, so that my results imply 
the results in [2, 3, 4, 5 and 6] for normalised simple sets. This investigation 
has been made possible by applying some of my results [1]. 


* Received April 15, 1953. 
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3. I prove in this paper the following theorems: 


THEOREM 1. Let {p,(z)} be a simple basic set of polynomials of order w 
and such that | pun| O(n") (h, finite) ; then its reciprocal set is of order 
such that do S o* S 2e. 


THEOREM 2. Let {pn(z)}, {Qn(z)} be two simple-basie sets of poly- 
nomials such that | dun | = O(n") (h, finite), and of orders wy, wg respectively 
and wp > 2wg=0. Then the product set {un(z)} = {pn(z)}{qn(z)} ts of 
order wy, such that: wp + 2og = wy = wp — 2ag. 


THEOREM 3. Let {pn(z)}, {Qn(z)} be two simple basic sets of poly- 
nomials such that | pPun| =| Qnn|—=O(n") (h, finite) and of orders wp, w, 
respectively, and wg >2w,20. Then the order of the product set 


{un(z)} = {pn(z) }{qn(Z)} ts oy such that: wp + 20g = wy = 40g — oy. 


THEOREM 4. Let {p,(z)} be a simple basic set of polynomials such that 
| Pun | = O(n") (h, finite) and CS1. Then both the set and tts reciprocal 
set are effective in the same region. 


THEOREM 5. Let {pn(z)}, {qn(z)} be two simple basic sets of poly- 
nomials such that | dun |= O(n") (h, finite). Then both the product set 


{pa(z)}{Gn(z)} and the quotient set sisiieimaiatidel are effective on 
|2|—R, where R= max(bp; bg). 


Proof of Theorem 1. We have 
o* = lim sup, (n log n)-* log maxi; | pri | | i; |, by Theorem 1 of [1] 


= lim supp (n log n)~* log | | Bi(n) j(n)). 


= lim supa{(n log n)~ log | pnicny| + log n)~ log | 
Since | Pan | == O(n"), we have: 
lim (n log n)-* log | an | == 0 and lim (n log n)-* log | py 
Therefore, we can write 


o* = lim sup, (n log n)-* log | aan | | Pnscny) | 
+ lim sup, (n log n)~* log | | 


Then o* 2. In a similar way, we get oS o* + = So 
do S o* S 
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Proof of Theorem 2. (a) Since both sets are simple, dg —=adg—=1. By 
Theorem 3 of [1], we get 


pq lim sup» (n log n)-* log maXm | Anm | 


+ wp + | lim sup, (n log n)-* log max; | qn; | , 
but 
lim sup, (n log n)-* log maxm | Anm | = lim sup, (nm log n)-* log | Anmin) | 


= lim sup, (n log n)-* log | Anminy| | Yminyminy| » 
because | gun | == O(n") for all n. So, by Theorem 1 of [1], 


lim sup, (7 log n)-* log maxm | Anm | S og. 


Also 
| lim sup, (n log n)-* log max; | gn; | | = | lim sup, (1 log n)-* log| gnj(n)| 
= | lim sup, (n log n)-* log | Ann | | Qnsiny| | 
because, since the set is simple, | Ann | =| nn |. Then 


| lim sup, (n log n)-? log max; | gn; | | S og, 
by Theorem 1 of [1]. Hence wpg S wp + 2ag. 
(b) Since 
{un(z)} = {Pn(2)} {qn(z)}, it is seen that {pn(z)} = {un(z)}{gn(z)}*. 
Since dy = a, = 1, by Theorem 4 of [1], we get 
Wp Slim supn (n log n)-* log maXm | nm | 
++ wy + | lim sup, (n log n)-* log max; | An; | | - 
But 
lim sup, (n log n)-* log maXm | Yam | = lim sup, (n log n)~ log | dnmin)| 
— lim (n log log | Ann | | @nminy | S og, 
by Théeorem-1 of [1]. And 
lim sup, (n log n)~* log max; | Anj | = lim supp, (n log n)~* log | Ansiny| 
= lim supn (n log n)-* log | Angim| | S og 
by Theorem 1 of [1]. Hence 
Op Sq + oy og = 20q Ou, 


1.€., wy = wp — 20g. Thus the theorem is proved. 
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Proof of Theorem 3. (a) It has been proved in the last theorem that 


(b) Since {tn(z)} = {pn(z)}{qn(z)}, then {qn(z)} = 
Hence from (a), we obtain wg S w,* + 20, But from Theorem 1, we know 
that wp* = so that wg S + i. €., wy = 40g— wy. Thus 


Proof of Theorem 4. By Theorem 7 of [1], {pn(z)} is effective on 
|2|—R, where R = b, and {p,(z)}* is effective on | z| = R where R = b* 
if C*=1. Thus, the proof of Theorem 4 will be given in two parts: 
(a) C*=1, (b) =D. 

(a) C* =supam [lim supp{sup; | | | 


= supa(n) [lim supna{| Pnicny! | 
Since | pan | = O(n") and the set is simple, then 


C* = supainy [lim supn{{| aan | | 
Xx {{| Ti(n)a(n) | | Pacnyacny | 1, 


But, as n 00, lim i(n)/n = lim d(n)/n = 1. Hence C* = C-C S 1 (because 
C=1). 


(b) b* = supsn [lim sup, {supi; | pni | | | 
= SuPs(ny [lim supn{| Pnicny| | | 
Since, as n—>0o, lim i(n)/n = 1, there are two sub-cases: 
(1) lim i(n)/n = 1, (2) lim i(n)/n < 1. 
(1) If lim «(n)/n =1, we have 
b* sups(n) [lim supn{{| aan | | 
X lim supn{{| aicnyscny| | Pecnyscny | 
so that b* = 1 X b (because C = 1). 
(2) If lim i(n)/n <1, we get 
b* supsin [lim supa{{| aan | | | }/ 
X lim supn{{] | | }2/ } 


( 
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so that b* does not exceed 
SUPs(n) [lim supn{b(s(n)) } /(n-a(n)) {b(s(n)) } 


Then 6. Similarly it can be proved that b= Hence b* — and 
the result of the theorem follows. 


Proof of Theorem 5. The proof of this theorem is in two parts: 
(a) Coq S1, (b) = max (bp; dq). 
(a) = supa(n [lim supp {supmiz | Xnm | | Tmi | | Pie | | Qad(n) [}3/"] 
= supa(n [lim supn{| Anminy| | tmenyicny| | Pecmyacny | 
Since | gan | == O(n") and the set is simple, then 
Coq = SUPa(ny [lim supp { {| An | | | 
X {{] wminyicny| | Picny | EL] | |} ] 
(b) = sups(n) [lim supn{| Anmeny| | || | | 
Then 
(5.1) Bpg = SUps(ny [lim supn{| Anmeny| | | 
X lim supn{] wmcnyicny | | Pecnys (ny | 
X lim supn{] | | 
since m(n) = = a(n) = s(n) and, as 
1 = lim m(n)/n lim t(n)/n = lim a2(n)/n. 


There are many cases for the value of by¢ due to the possible values of m(n), 
i(n), a(n) and s(n). I shall be satisfied to treat here two cases because the 
other cases can be treated in a similar way. 


(1) limm(n)/n=1, 1>lima(m)/n > lim s(n)/n, 
then (5. 1) gives 
= SUPs(n) [Lim supa{{| Anmcny| | | 
X lim supn{{| | | } (em 
X lim supp {{| Avinyecny| | | 
= sups(n) [lim supa{1 {bp(s(n)) 
x {bq(s(n)) } 
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(because C, = 1). Hence by, = max(b,; bg). 
(2) 1> lim m(n)/n = lim z(n)/n > lim s(n)/n, then (5.1) gives 
bog = SUPs(ny [lim supn{{{| Anmcny| | | 
X {{| | | 
supscny [lim supn{ {bqg(s(n)) } 
X 1X 


(because C,=1). Hence b,, = max(b,;b,). In a similar way, we find 
that in every case 
(5. 2) Dog S max (bp; Dg). 


Since always (i.e. in every possible case) 
= SUPs(ny [lim supa{| Anm | | | | Pie | | Geacn | 
for all values of m,1 and x, consider the two cases (i) bp = bg or (ii) bg > dy. 
(i) If bp = bg, then by is not less than 
SUPs(n) [lim supa{supi{| Ann | | ani | | | | 
Then = dp. 
(ii) If b, > bp, then by, is not less than 
suPs(n) [lim supn{sups{| Ans | | | | | | 
Then = dg. 
Combining the results of (i) and (ii), we get 
(5. 3) Bog = max (bp; bg). 


Hence in view of (5.2) and (5.3), we have bpg = max(bpy;b,). Then by 
Theorem 9 of [1], the product set {pn»(z)}{qn(z)} is effective on | z| =A, 
where R = max (bp; bg). 

In a similar way, it can be proved that the quotient set {pn(z)}{qn(z)}* 
is effective on | z|—R, where R > max (by; bg). 
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LENGTHS OF PRIME IDEAL CHAINS.* 


By I. 8S. CoHEN. 


Introduction. Given a commutative ring R and a prime ideal p in R, 
we may consider the maximal lengths of chains of prime ideals contained in p 
and containing p. These give rise to the height and depth of p (usually called 
the rank and dimension) ; see section 1 for the precise definitions. If 8 is a 
ring containing R, and P a prime ideal in S contracting to p, then we are 
concerned with the relation between the heights and depths of P and p. For 
Noetherian rings, with which this paper is principally concerned, partial 
answers are given in Theorems 1 and 4, the latter generalizing some of the 
results on dimension in Krull [8]. The results on height are motivated by 
some well known theorems on valuations of algebraic function fields; these 
are stated as corollaries to Theorems 1 and 3. These results will moreover be 
applied in a subsequent paper to the theory of valuations of local rings. 


1. Elementary properties. The term ring will always mean a commu- 
tative ring with identity. A proper ideal in a ring R is an ideal different 
from (0) and R. The term prime ideal shall always refer to one which is 
+~ R, though not necessarily + (0). 


Definition. Let R be a ring. A prime ideal chain (or simply a chain) 
in R of length k is a collection of k + 1 prime ideals in R, ordered by inclusion. 


In other words, a chain of length & is a collection of prime ideals 
Pos Pis* * *+ Pe, Which, when suitably numbered, satisfy 


(1) 


The chain is said to begin with po and to end with p,. The chain is said to 
be maximal if no further prime ideal can be inserted between p;_, and p; for 
t=1,2,:--,k. 


* Received August 9, 1953. 

11f A and B are sets, we write A CB to denote that A is a subset of B and A<B 
to denote that A is a proper subset of B. We point out explicitly that in the definition 
of chain the possibility p, = (0) is not excluded. 
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Definition. Let p be a prime ideal in a ring R. The depth* (height) 
of p is the maximum length over all chains in R beginning (ending) with p. 


The depth and height of p will be denoted by d(p) and h(p) respectively ; 
either may be infinite. Clearly d(p) —0 if and only if p is maximal; 
h(p) =0 if and only if p contains no other prime ideal of #. It is known 
that in a Noetherian ring h(p) <0, but d(p) may =o [7, section 8].° 
If R is a local ring, then also d(p) <<. 

If p is of height &, then there exists a chain (1) with p = px; this chain 
is necessarily maximal, and pp, can contain no other prime ideal of R, so that 
if R is an integral domain, then po = (0). Not every maximal chain ending 
with p need be of length &, as can be shown by simple examples of rings with 
zero divisors. Whether this unpleasant phenomenon may occur even when R& 
is an integral domain is an unsolved problem, which we shall refer to as 
“the maximal chain problem.” 

Still assuming that p is of height & and that p= p,; in (1), we see 
clearly that p; is of height 7, i—0,1,---,k. Also p/p; is of height k —1 
in the ring R/p;. If p and q are prime ideals in R such that q C p, then 
clearly 


(2) h(q) +h(p/q) Sh(p). 


One might hope for equality when F is an integral domain, but this question 
is equivalent to the maximal chain problem mentioned above. 

Let M be a subset of R closed under multiplication and containing no 
zero divisor of R, and let Ry denote, as usual, the quotient ring. If p is a 
prime ideal not meeting J, then elementary properties of the quotient ring 
show that h(p) =h(Ryp). 

Observe that from (2) we may conclude that if g < p, then h(q) < h(p). 

Entirely similar statements can be made about d(p). 


Definition. Let R and S be rings, RC 8S, P a prime ideal in S, 
p=POR. Then the R-dimension of P, to be denoted by dim P/R, is 
dt(S/P)/(R/p), where “dt” means degree of transcendence. 


If R is a field, then dim P/R =dt(S/P)/R; this definition is well 
known for prime ideals in a polynomial ring over a field. 


* With regard to terminology, see the remarks following Lemma 1. 

* Numbers in brackets refer to the bibliography at the end of the paper. 

*When we write RCS, we always assume that R and S have the same identity 
element. 
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If Q is a prime ideal in S such that Q C P, and if we place g=Q NR, 
then 


(3) dim P/R = dim (P/Q)/(R/q). 


For inasmuch as P/Q 9 R/q = p/q, the right side is the degree of trans- 
cendence of (S/Q)/(P/Q) over (R/q)/(p/q). Since these two rings are 
isomorphic respectively to S/P and R/p, the right side equals dim P/R. 
To see the behavior of the dimension under quotient ring formation, 
let us assume S is an integral domain. Let M and WN be multiplicatively 


Sy 
Ry 8 
R 


closed subsets of R and S respectively, not containing 0, such that M C N; 
then Ry C Sy. If, now, P is a prime ideal in S not meeting N, then 


For inasmuch as SyP N Ry = Ryp, the left side is the transcendence degree 
of Sy/SyP over Ry/Ryp. Now Sy/SyP can be regarded in a natural way 
as containing S/P, and indeed it is then a quotient ring of S/P, namely with 
respect to the set of the residue classes of the elements of N. Similarly, 
Ry/Rup can be regarded as a quotient ring of R/p. Hence the degree of 
transcendence just referred to equals dt(S/P)/(R/p) = dim P/R, and (4) 
is proved. 

The dimension is additive—i.e., if R, is a subring of R, then dim P/R, 
= dim P/R + dim p/R,. 

The elementary properties of the dimension and of the height here 
referred to, as well as the analogous properties of the depth, will in the 
sequel be freely used without further comment. 

In an important elementary case, the dimension and height are closely 
related : 


Lemma 1. Let Rand S be integral domains, R C 8, S finitely generated 
over R, Q a prime ideal in S such that QN R= (0). Thenh(Q) + dim Q/R 
= dt S/R. 


Proof. When R is a field, this is a classical result [4, p. 43]. In the 
general case, let M denote the set of nonzero elements of R, so that Ry C Su, 
Ry is the quotient field of R, and Sy is finitely generated over Ry. Then by 
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the classical result, h(SuQ) + dim SyQ/Ry = dt Sy/Ry. Since dt Su/Ru 
= dt S/R, the lemma follows. 

If R is a field, then it is also well known [loc. cit.| that d(Q) = dim Q/R. 
For this reason the term dimension has generally been used for what we have 
called the depth. However the two are not in general equal (e. g., the maximal 
ideal of the local ring of a subvariety of positive dimension of an algebraic 
variety), and we have therefore preferred to use different terms. What we 
have called the height has been generally called the rank or the dimension 
defect, the latter term finding some justification in the formula proved in 
the lemma. 


2. The defect. We now introduce some notations which will be held 
fived for the. remainder of the paper. R and S are integral domains such 
that R C S, P is prime in 8, 


p= POR, R = R/p, = §/P, 


K, L, K, L are the respective quotient fields of R, S, R, 8. The capital letter 
‘\ will always denote an indeterminate. 

We are concerned here with the proof, under certain conditions, of the 
following relation : 


(5) h(P) + dt S/R Sh(p) + dt S/R. 


We assume the right side to be finite since otherwise there is nothing to 
prove. It is then useful to study systematically the difference’ of the two 
sides of (5). 


Definition. If h(p) and dt S/R are finite, then the defect of P over R, 
to be denoted by def P/R, is given by 


def P/R = dt S/R + [h(p) —h(P)]—dim P/R. 


The defect of P over FR is thus an integer or —-oo, and indeed any such 
value is possible for suitable R, S and. P. We are to prove that under certain 
conditions it is non-negative. 


LemMa 2. (a) The defect is additive. I.¢., suppose Ro is an integral 
domain contained in R, and let po=pOR; then if h(po), h(p), and 
dt 8/R, are finite, then def P/Ry = def P/R + def p/R). 


(b) Let M and N be multiplicatively closed subsets of R and S 
respectively such that MCN and N does not meet P. If either one of 
def P/R, def SyP/Ry is defined, then so is the other, and they are equal. 
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(c) Let Q be a prime ideal of S contained in P, let QN R= (0), 
and let S be finitely generated over R. If h(p) ts finite, then def P/R 
< def (P/Q)/R, with equality holding if and only if h(P) = h(Q) + (P/Q). 


(d) If S is integrally dependent on R and h(p) is finite, then 
def P/R = 0; and if R is integrally closed, then def P/R = 0. 


Proof. (a) It follows from the hypothesis of (a) that all three defects 
are defined. The additivity then follows from the fact that all three terms 
of which the defect is a sum are additive. 


(b) This follows from facts previously stated about dimension and 
height. 


(c) Note that R can be imbedded in natural fashion into S/Q. When 
this is done, then (P/Q)N R=p. Now dim P/R = dim (P/Q)/R. From 
Lemma 1, 


dt S/R —h(P) =h(Q) + dim Q/R — h(P) 
= dt(S/Q)/R + [h(Q) —h(P)] S at(8/Q)/R —h(P/Q), 


the last inequality because h(P) =h(Q) +h(P/Q). Thus (c) is proved. 


(d) In this case, dtS/R=dtS/R=0. That h(P) Sh(p) follows 
from the fact that the contraction to R of any chain ending in P yields a 
chain ending in p [5, Theorem 2], [1, Theorem 4]. If R is integrally closed, 
then any chain of, say, m prime ideals ending in p gives rise to a chain of 
length m ending in P. Namely, take P,P. By the “going down” 
theorem [5, Theorem 6], [1, Theorem 5], P» contains a prime ideal P+ 
lying over pm-1; and we may obviously continue in this fashion. Both parts 
of (d) were proved by Krull [5, Theorems 7,8]. It seems to be still an 
unsolved question whether inequality may actually hold when PF is not 
integrally closed. 


Before proceeding to the proof of (5), we prove two well known lemmas 
on polynomial rings. 


Lemma 3. Let R’=R[X], let p’, and p’. be two prime ideals in PR’ 
having the same contraction p in R, with p’, < p's, Then p’; = R’p. 


Proof. We note that R’/R’p is a polynomial ring over R/p. In R’/F’p, 
p':/R’p and p’./R’p are prime ideals contracting in R/p to (0). Since we are 
trying to prove that p’,/R’p = (0), it follows that it is sufficient to prove 
the lemma under the additional hypothesis that p= (0). If K is the 
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quotient field of R, then K[X] is the quotient ring of R[X] with respect 
to the multiplicatively closed set of nonzero elements of R. Since neither 
p’, nor p’, meets this set, their extensions to K[X] are prime ideals, wit: 
K[X]p’: < K[X]p’2 < K[X]. Since K isa field, this requires K[X |p’, = (6}. 


Jf < < p’s are prime ideals in R[X], they cannot 
all have the same contraction in R. 


Lemma 4. Let R be a Noetherian integral domain, R’ = R[X]. If p 
is a minimal prime ideal in R (i. e., the only prime ideal properly contained 
in pis (0)), then R’p is minimal in R’. 


Proof. If ye p, y+ 0, then p is an isolated prime ideal belonging to Ry, 
hence R’p is an isolated prime ideal belonging to R’y. Since R’ is Noetherian, 
the principal ideal theorem |7%, Theorem 7*] implies that R’p is minimal in R’. 


3. The main theorems on height. We are now in a position to prove 
the basic inequality (5) for the case where R is Noetherian. In order better 
to see just where the Noetherian character comes in, we state our theorem 
for a (potentially) wider class of rings. Whether this class is actually larger 
will not be discussed in the present paper. 


TuEoreM 1. Let D be a collection of integral domains having the 
following properties: 


(a) If AeD and if ¢ is a prime ideal in R, then A/qe QD. 
(b) If Ae QD, then A[X]eD. 
(c) If Ae D and if q is a minimal prime ideal in A, then A[X]q is 


a minimal prime in ALY]. 
Under these conditions, if Re D, then the inequality (5) holds.* 


Remarks. The collection of all Noetherian integral domains satisfies 
(a), (b), (ce), the last by Lemma 4. If Ae QD then (b) implies that any 
polynomial ring over A in a finite number of indeterminates is also in D, 
and hence by (a) any integral domain finitely generated over A is in 9D. 
Observe that it is not required that S be in D. 


Proof. We assume that h{p) and dt S/R are finite and shall show that 
def P/R > 0. 


* We point out once more that the notation introduced at the beginning of section 2 
is to be in force for the remainder of the paper. 
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Let m be any positive integer = h(P) (which may a priori be infinite), 
Then there exists in S a chain 


(6) < Pm =P. 
For t= 1,2,:--,m, let Pi, Pi. If n is any non-negative integer 
= dt S/R, then there exist in S elements 9,,- - -, J, algebraically independent 


over R. Let y; be a representative of 7; in S, and place 


Thus 
h(p’) = m, dt n. 


Assuming for the moment that we have proved that def p’/R= 0, we have 
h(p’) + dt R/R=h(p) + dt R’/R, hence m+ nS h(p) + dt S/R for all 
m Sh(P) and alln = dt S/R. And this proves the theorem. So it is enough 
to prove that def p’/R = 0, and since RF’ is finitely generated over R we may 
henceforth confine ourselves to proving the theorem in the case where S is 
finitely generated over R. 

Consider first the special case where S is a polynomial ring in one 
variable, S = R[X]. We proceed by induction on h(p). If h(p) = 0, then 
p= (0), and we must show that A(P) + dtS/RS1. If P = (0), this is 
trivial; and if P 4 (0) then S is algebraic over R, and by Lemma 3, Cor., | 
P is minimal, so that h(P) = 1. 

Now suppose h=h(p) >0. If m is any positive integer = h(P), 
then there exists in S a chain of the type (6) above. It is enough to show 
that m+ dtS/R=h+1. If m =1, this is clear, so assume m= 2. Again 
by the corollary to Lemma 3, P, M FR is a nonzero ideal, necessarily of finite 
height. So it contains a minimal prime ideal po, and obviously py C p. By 
hypothesis (c), Spo is minimal in S, hence Spo P». Passing to S/Sp, and 
denoting images in this ring by an asterisk, we have h(P*) =m—l. 
The ideal p* = p/p, is prime in R* and clearly h(p*) Sh —1, 
P* M R* = p*. Moreover S* is a polynomial ring over R* and R* e D by 
hypothesis (a). Hence the induction hypothesis gives h(P*) + dim P*/R* 
= h(p*) +1. But dim P*/R* = dim P/R, so 


m—1-+ dim P/R = (h—1) +1, 


and this completes the proof for the case S = R[X]. 
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By induction it now follows for S a polynomial ring in any finite number 
of variables in view of the additivity of the defect. Induction is permitted 
by hypothesis (b). 

Finally, let S be any integral domain finitely generated over R. Then S 
is the homomorphic image in a natural way of a polynomial ring 8, over R. 
Let Q, be the kernel and let P, be the complete inverse image of P. Then 


By the polynomial case, def P;/R20. By Lemma 2(c), we have def P/R 
> def P,/R. The proof is thus complete. 

Lemma 2(c), which we have just used, depends on Lemma 1, which in 
turn depends on the dimension theory of polynomial rings over a field. We 
could avoid the use of this dimension theory as follows. To prove Theorem 1 
for 8 finitely generated over R, we may assume S = R[ a], since additivity 
of the defect permits induction. If x is transcendental we proceed as in the 
proof above. If « is algebraic, then S is the homomorphic image of a poly- 
nomial ring S,-=R| |. in one variable. With the notation of the pre- 
ceding paragraph, we have that h(P,) +dimP,/RSh(p)+1. But 
h(P,;) 2h(P) +1 and dim P;/R = dim P/R, whence the desired result. 
We have, in effect, used the fact that Lemma 1 is trivial if S = R[X] and 


() is minimal. 


CoroLLtary 1. Let K be an algebraic function field of n variables over 
a field F. If v is a valuation of K over F of rank m and dimension d, then 
m+t+d=n. 


Proof. If S is the valuation ring of v and P is the maximal ideal of 8, 
then it is well known that h(P) =m. Taking R= F, we get the stated 
result, since by definition, d = dt S/F. 


Corottary 2. Jf Re D and S is a polynomial ring over R in a finite 


number of indeterminates, then equality holds in (5). 


Proof. We may assume one indeterminate, S = R[X], dtS/R=1. If 
P = Sp, then dt S/R = 1, so the theorem reads h(P) = h(p). The opposite 
inequality is obtained by extending to S any chain in R ending in p. If 
Sp < P, then dt S/R = 0, so the theorem reads h(P) Sh(p) +1. But also 
h(P) =h(Sp) +1—h(p) +1. 


One might attempt to extend Cor. 2 to the case where § is any domain 
finitely generated over R. If we proceed as in the last paragraph of the 
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proof of the theorem, success would require that we have def P/R = det P,/R. 
Referring to the proof of Lemma 2(c), this equality is equivalent to 
h(P) =h(Q) +A(P/Q) (in the notation of that lemma). So this exten- 
sion of Cor. 2 would be equivalent to a special case of the maximal chain 
problem. Even this special case appears to be unsolved. 

But it is easy to see that the conclusion of Cor. 2 may fail if S is not 
finitely generated over R. For let R be a polynomial ring over a field F in 
n variables X,,---,Xn, let p= Let S be the valuation 
ring of a rank 1, 0-dimensional valuation of F(X,,:--.Xn) whose center © 
in R is at p. If P is the maximal ideal of S, then dt S/R = dt S/R =0, 
h(P) =1, h(p) =n, def P/R =n—1. This shows, by the way, that the 
defect may be any non-negative integer. 

Condition (c) of Theorem 1 asserts, in effect, that the conclusion of fF 
the theorem is true in a special case, namely S a polynomial ring over R 
in one variable and h(p) =1. So we are able to conclude inequality (5) 
in general from its validity in a special case. Along the same lines we have 


THEOREM 2. Let D be a collection of integral domains satisfying con- 
ditions (a) and (b) of Theorem 1, as well as } 


(c’) If Re QD, if p is minimal in R, and if S has the same quotient 
field as R, then def P/R = 0. 


Under these conditions, inequality (5) holds generally, provided only 
that Re D. 


Proof. It is necessary only to prove that (c) of Theorem 1 holds. So 
let Ae D, g be minimal in A, and suppose A[X] q is not minimal, so that 
there exists a prime ideal Q such that (0) <Q < A[X]q. Let S=A[X]/Q, 
and let P denote the image of A[X]q in this ring. By virtue of Lemma 3, 
there is no prime ideal between Q and A[X]g, hence h(P) —1. Since 
QOA=(0), S contains A and is algebraic over it. Also dim P/A 
= dim A[X]q/A =—1, so def P/A =—1. But this easily leads to a contra- 
diction. For let xz be the residue class of X in S. Let a be the leading 
coefficient of a nonzero polynomial in Q, so az is integrally dependent on A. 
Let R=A[ar], so ReD, and S=R[x] has same quotient field as R#. 
Let p =P NR, so pM A and hence p is minimal [1, Theorem 4]. By 
(c’), def P/R=0. Since RP is integrally dependent on A, def p/A 20 by 
Lemma 2(d). Hence def P/A = 0, contradiction. 


4, Additional results on height. We now give an example to show 
that the conclusion of Theorem 1 is not true for an arbitrary domain A. 
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For suppose it were and suppose # had a single proper prime ideal p. Then 
every valuation ring of the quotient field of R which contains R would have 
to be of rank 1. So if R were integrally closed, so that it was the inter- 
section of valuation rings S, then all of these would be of rank 1, hence & 
would be fully integrally closed (i.e, if ae K and if be R, b0, and 
ba®e R for all n, then ae R). So we need an integrally closed ring with a 
single proper prime ideal and which is not fully integrally closed. A simple 
example of such a ring has been given by Krull [6, p. 670]. 

The construction of this ring is as follows. Let F bea field, K = F(X, Y) 
a rational function field in two variables. Let FR consist of all rational 
functions f(X, Y) such that f(0, Y) is finite and in F. The set of all fe R 
such that f(0, 1°) = 0 is clearly a maximal ideal p containing all non-units, 
and R/p=F. Moreover p is minimal—indeed every proper ideal A is 
primary for p. For let g(X,1’) be a nonzero element of A and write 
g(X, ¥) = (X,Y), with g,(0,¥) 40; if f(x, ¥) ep, then f(X, Y) 
=Af,(X,¥) with f,(0,¥Y) finite, so f''/g =Xf,""/giep CR, hence 
f"eRg CA. If f(X,¥)eK and is integrally dependent on R, then 
{(0,¥) is finite; moreover f(0,Y) would be integrally dependent on F, 
hence in F, so f(X,Y)eR. Thus PR is integrally closed. But not fully, 
for Y ¢ R, whereas YY" ec R for all n. 

An explicit case of a P with def P/R < 0 is obtained by taking S to be 
the set of all f(X, Y) with f(0, Y) =0, or else f(0, Y) finite and 40, and 
f(0,0) finite. This is the valuation ring for the rank 2 valuation of K 
obtained by composing the valuation of F(X, ¥Y) defined by the Y-axis 
with the valuation of #'(1°) defined by the origin. If P is the maximal ideal 
of S, then S/P =F, so dt S/R =0; since h(P) = 2, h(p) =1, we have 
def P/R = —1. 


THEOREM 3. Let R be Noetherian and let L be finitely generated 
over K. If def P/R =—0, then L is finitely generated over K. Furthermore, 
if m=h(P) and if 


(6) (0) =P, 


is any chain of length m ending in P, then the ascending chain condition 
holds for those ideals between P,, and P, which are primary for P; 
(t—=1,2,---,m). 


Proof. Observe that h(p) is finite since R is Noetherian; also dt S/R 
=dtZ/K. Hence def P/RP is indeed defined. 
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Let ane Pi, Pin, t=1,2,---,m (ef. proof of Theorem 1). If 
n= dt S/R, let 91,° - -, Jn be elements of S algebraically independent over R. 
let y; be a representative of y; in 8. Since L is finitely generated over K, 
we may write L = K(%,° Now place 


=POR, pi =Pi NP, = K’ = quotient field of Now R’ is 
clearly finitely generated over R, hence likewise K’ over K, so it is enough 
to prove that Z is finitely generated over K’. In other words, it is enough 
to prove the theorem for S and Rf’. These two rings satisfy the hypotheses 
of our theorem: Ff’ is surely Noetherian. From 0 = def P/R = def P/R’ 
+ def p’/R, we get def P/R’ =0. S and RF’ both have L as quotient field 
since 2,¢R’. Also dt S/R —0, since D Gn]. Hence h(p’) 
=h(P)=m. Finally, we have (0) =p. p'm=p’, since 

Writing R for R’, we are reduced to proving our theorem under the 
following additional hypotheses: dt S/R dt S/R=0, the contractions 
P;- R are distinct (the P; being as in (6)). We then have h(p) = h(P) =m, 
and (0) =po< pm=p, where p—=P,N 

We prove the theorem under these additional hypotheses by inductions 
on m. For m=O, the theorem is trivial. Now assume the theorem has 
been proved for height m—1. Let R’ and S’ be the quotient rings of R 
and S with respect to p, and P, respectively. Since p, is a minimal prime 
ideal, R’ has only one proper prime ideal, namely R’p,, hence R’ satisfies the 
restricted minimum condition [2, Theorem 1]. From [2, Theorem 38], it 
now follows that S’/S’P, is finite-dimensional over R’/R’p,, and that the 
ascending chain condition holds in S’.6 The latter implies the conclusion 
of the theorem for the primary ideals belonging to P,. 

Passing to the rings S/P, and R/p,, we have the ideals P/P, and p/p: 
both clearly of height m—1. The quotient fields of these rings are 9’/S’P,; 
and R’/R’p,. Furthermore (S/P,)/(P/P:) is algebraic over (R/p,)/(p/p1); 
since these are isomorphic respectively to S and R. From all this, it follows 
that S/P,, R/p,, and P/P, satisfy not only the hypotheses of our theorem, 
but also the additional hypotheses stated in the second preceding paragraph. 
By induction hypothesis, Z is finitely generated over K. For the same reason 
the ascending chain condition holds for the ideals between P;_,/P, and P;/P: 


*The quoted theorem is proved under the assumption that S is integral over R. 
But a trifling modification in the proof suffices to cover the case where S is merely 
algebraic over R. 


| 
| 


PRIME IDEAL CHAINS. 665 


which are primary for P;/P, (i=2,---,m). This implies the desired 
result for the ideals between P;_, and P;. 
From the theorem we can derive a well known result on valuations. 


CoroLttary. Let K be an algebraic function field of n variables over a 
field F. If v is a valuation of K over F of rank m and dimension d, and tf 
m+d=n, then v is discrete and its residue field is finitely generated over F. 


Proof. (Cf. Theorem 1, Cor. 1.) Take S§ to be the valuation ring of v, 
P its maximal ideal, RF. S contains a unique chain of type (6) above. 
The theorem states that the quotient ring of S/P;. with respect to Pi/Pi. 
is Noetherian, t= 1,2,:--,m. So each of the m rank 1 valuations of 
which v is composed is discrete, so v is discrete. 


5. Depth of prime ideals. We retain in this section, as throughout, 
the notation introduced in section 2. 

We shall obtain here Krull’s results on depth of prime ideals [8, Theorems 
13, 14, 16, 18] in somewhat stronger form, and with rather a simpler proof. 

Lemma 5. Let R be any ring, R’=R[X]. If ¢ is prime in R, then 
d(R’q) = d(q) +1. 

Proof. If m= d(q), then there exists a chain g = qo < qi < Gm 
in R. Then a chain in RF’ is given by 


k’q qo RG R’ dm + R’X, 
so that d(R’q) = m + 1, whence the lemma. 


THerorEM 4. Let D be a collection of integral domains satisfying con- 
ditions (a), (b), (ec) of Theorem 1. Then, if Re D, 


(7) d(P) <d(p) + dim P/R. 
If S is finitely generated over R, and if R is Noetherian, then the right side 


exceeds the left by at most unity. If, in addition, R is a Hilbert’ ring, 
then equality holds. 


Proof. Since the quantities involved in (7) are properties of the residue 
class rings S/P and R/p, we may assume without loss of generality that 
P= (0). So we must prove that 
(8) d(S-0) Sd(R-0) + dt S/R. 


*A Hilbert ring is a ring in which every prime ideal is the intersection of maximal 
ideals [3]. Krull [8] uses the term Jacobson ring. 
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Now if P, is prime in S, then by our main result on. height (Theorem 1), 
A(P,) Sh(PiN Rk) + + dt S/R. 


Since d(S-0) is the least upper bound of all h(P:), (8) follows. It is easy 
to avoid explicit use of the theorem on heights; for this see the remarks 
at the end of the proof. : 

Now let S be finitely generated over R and R be Noetherian (note that 
our reduction to the case P = (0) has not disturbed these conditions). Then 
we must show that d(S-0) =d(R-0) + dtS/R—1. Let t=dt S/R, let 
* *,2 be a transcendence basis for S over R, let R’ = 2]. 
By Lemma 5, d(R’-0) = d(R-0) + # (indeed, equality holds by virtue of 
the first part of the theorem) ; hence we must prove that 


(9) d(S-0) = d(R’-0) —1, 


this with the information that § is finitely generated and algebraic over PR’. 
So let S=R’[y:,- - +, ym]. Since is algebraic over R’, it is a zero of a 
certain polynomial in R’[X]; we may assume that each polynomial has the 
same leading coefficient a, and of course a0. If M is the multiplicative 
system consisting of the non-negative powers of a, then each y; is integral 
over R’y, hence Sy, which equals Ym], is integral over R’y. 
If d is any integer = d(R’-0), then we show that d(S-0) =d—1. 


Sar 
R’y 
R’ 
We have a chain 
(10) R-0< py < pai < pa- 
We claim that the chain may be selected so that a# pj, 1=1,2,- --,d—1. 
This is an old lemma of Krull, but we sketch a proof by induction on d. 
Namely let g:,q2,: -- be the minimal prime divisors of R’-a; they are 


minimal in R’, hence cannot contain p.. So there exists an element b in pz 
not in any of the q;; p. must contain a minimal prime divisor q of R’:), 
q< po, afq. In R’/q, we have a chain (0) < p:/q<:-+< pa/q, and the 
residue class of a is not zero. Applying induction hypothesis here and 
mapping back to R’, we find our claim substantiated. 

Thus in (10), we assume af¢ p;,, i=1,---,d—1. By the elementary 
properties of quotient rings we have in Ry a chain 


(11) 0< < Pups < 
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Since Sy is integrally dependent on R’y, the “ going up ” theorem [1, Theorem 
3] gives us in Sy a chain Sy:0 <P’, <:-:< P’a, lying over the chain 
(11). Contracting to S, we get a chain 


and clearly P; R’ =p; i=1,:--,d—1. Hence d(S-0) =d—1. 

Now if R were originally a Hilbert ring, then it remains so after the 
preliminary passage to the case p= (0) [3, Theorem 2], and then FR’ is like- 
wise a Hilbert ring [8, Theorem 3]. Since 8 is finitely generated over RP’, 
and R’ is Hilbert, every maximal ideal in S contracts to a maximal ideal in 
R’ [8, Theorem 5].* It follows that in the prime ideal chain (12), Pa+ 
is not maximal, hence d(S:0) 2d. If R’-0 has finite depth, we take 
d=d(R’-0) and have d(S-0) 2d(R’-0) 2d(R-0) + #, as was to be 
shown. This completes the proof of Theorem 4. 

If the assumption that # is Hilbert is omitted, then proper inequality 
may well hold in (7), even though R is Noetherian [8, Theorem 4]. For 
there will exist in R[ A] a maximal ideal P whose contraction to R is a non- 
maximal ideal p [3, Theorem 5]. Thus d(P) —0, d(p) >0. 

As indicated above, the proof of inequality (7) can be carried through 
without specific appeal to Theorem 1. First: 


Lemma 6. Let D beas in Theorem 4,ReD, R’=R[X]. If p, and pz 
are two prime ideals in R such that p, < po and there are no intermediate 
prime ideals, then there are likewise none between R’p, and R’po. 


Proof. Since R’/R’p, is a polynomial ring over R/p,, it is sufficient 
to prove the lemma in the case p, = (0). But then it follows from condition 


(c) on 


We now prove (7), first for the case that S is a polynomial ring over R 
in one variable and P = Sp; we may assume d(p) is finite. Then dim P/R = 1 
and we must show that d(P) Sd(p) +1. If d(p) =0, then p is maximal, 
8/Sp is a polynomial ring over the field R/p, so d(Sp) 1. We then 
proceed by induction on d(p). Suppose that we have in § a chain 
Sp<P,<P.<:-++-<P,; we shall prove thatr=d(p) +1. By Lemma 3, 
Cor., P21 R cannot equal p, so it contains some prime ideal p’ such that 
P <p’ and such that there is no prime ideal between p and p’. By Lemma 6, 
there is none between Sp and Sp’, so Sp’ P,. The chain Sp’ < P,<::: 


*The quoted theorem refers to polynomial rings, but the result is then obviously 
valid also for homomorphic images of such rings. 
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< P, shows that d(Sp’) = r—1; and obviously d(p’) = d(p) —1. But the 
induction hypothesis shows that d(Sp’) = d(p’) + 1, whence r S d(p) +1. 

Still continuing with the case S=R[{X], suppose P > Sp. Then 
dim P/R = 0, d(P) < d(Sp) Sd(p) +1, d(P) Sd(p). 

Induction now takes care of the case where S is a polynomial ring over 
R in several variables, and residue class ring formation suffices to handle the 
ease § finitely generated over R. 

To conclude the proof for the most general case, we once more reduce 
to the case P= (0). Let m be any positive integer = d(S-0) (which may 
be infinite). Then there exists a chain S-0=P)< Py < <P». 
Let Pi1, i=1,--+-,m (ef. proof of Theorem 1), and let 
R’ = R[2z.,---,2%m]. Since R’ is finitely generated over R, we have 
d(R’-0) =d(R-0) + dt R’/R. But clearly m S d(R’- 0), hence 


m<d(R-0) =dt S/R 


for all m= d(S-0). Thus (7) is proved for 8-0. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 


BIBLIOGRAPHY. 


[1] I. S. Cohen and A. Seidenberg, “ Prime ideals and integral dependence,” Bulletin 
of the American Mathematical Society, vol. 52 (1946), pp. 252-261. 

[2] I. 8. Cohen, “ Commutative rings with restricted minimum condition,” Duke Mathe- 
matical Journal, vol. 17 (1950), pp. 27-42. 

[3] Oscar Goldman, “Hilbert rings and the Hilbert Nullstellensatz,” Mathematische 
Zeitschrift, vol. 54 (1951), pp. 136-140. 

[4] W. Krull, Idealtheorie, Ergebnisse der Mathematik und Ihrer Grenzgebiete, vol. 4, 
no. 4, Berlin, 1935. 


[5] ————, “Zum Dimensionsbegriff der Idealtheorie,” Mathematische Zeitschrift, 
vol. 42 (1937), pp. 745-766. 

[6] ———, “v-Ideale und vollstindig ganz abgeschlossene Integritatsbereiche,” Mathe- 
matische Zeitschrift, vol. 41 (1936), pp. 665-679. 

[7] ———, “ Dimensionstheorie in Stellenringen,”’ Journal fiir die Reine und Ange- 
wandte Mathematik, vol. 179 (1938), pp. 204-226. 

[8] ————, “Jacobsonsche Ringe, Hilbertscher Nullstellensatz, Dimensionstheorie,” 


Mathematische Zeitschrift, vol. 54 (1951), pp. 354-387. 


| 

| 

| 


ON THE STRUCTURE OF A CERTAIN CLASS OF 
KAEHLER VARIETIES.* * 


By JuN-IcHI IGusA. 


It; is a classical theorem that simply connected Riemann surfaces can be 
divided into three types. In the case of several complex variables the circum- 
stance is not so simple, but there is a generalization in the following sense. 
Let V" be a compact complex variety carrying a Kihlerian metric with constant 
curvature; then its universal covering variety V” has only three complex 
structures. In fact V” is the complex projective space 8”, the interior of a 
unit sphere AK” or the complex vector space C” according as the curvature is 
positive, negative or zero. They are known respectively as elliptic, hyperbolic 
or parabolic cases. We conclude that V™ itself coincides with 8” in the elliptic 
case, Which gives a characterization of the complex projective space in terms 
of global differential geometry. Moreover in the hyperbolic case V" can be 
considered as a non-singular projective model. Finally in the parabolic case 
V" has a finite covering variety of complex torus 7” such that the types of 
extensions are finite for each n. We note also that V” are minimal models 


” seems to be closely 


in the hyperbolic and parabolic cases. The “ minimality 
connected with the “ uniformizability ” of compact complex varieties. Finally 
we shall give an example of parabolic variety with the first irregularity equal 


to zero, which can only happen for n= 3. 


1. Local consideration. Let guge(z,2) (1S2,Bn) be n? power 
series in 2n variables 7g (1S forming a positive definite 
Hermitian matrix in some neighborhood of the origin. In this case we can 
take ds? — 2 Jose (% 2) (dz*dz®) as a metric ground form in local Her- 

a, 


mitian geometry. We shall assume thereby that this metric is without 
torsion, or, in an apparently stronger form ([7], pp. 175-176). that gag+(z, 2) 
can be derived from a local potential U(z,Z) in the form 


Jape (2, Z) = (2, Z) (lsa,B=n). 


* Received May 25, 1953; revised February 11, 1954. 
* This work was partially supported by a research project at Harvard University, 
sponsored by the Office of Ordnance Research, U. S. Army. 
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tensor in our geometry satisfies the relation 
Rapeyse + + = 0 (154, p,y,8 Sn) 


with some constant k. This definition is a natural counterpart of the real 
case (cf. [2], Theorem 5) and the following theorem is proved by Bochner 
({2], Theorem 6). 


Lemma 1. There exists an analytic coordinate transformation after 
which the local potential has the normal form 


(1/k)-log(1++%- 0) 

The metric is called elliptic, hyperbolic or parabolic according as k is 
positive, negative or zero. If we multiply the absolute value of & to our 
metric, we may assume that & is +1 in the case k 540. We shall endow 
elliptic, hyperbolic and parabolic metrics to 8", K" and C” globally and we 
call them normal domains. The normal domain D admits a well-known 
group G of analytic isometries, which is in any case a transitive Lie group 
of n? + 2n real parameters. Thus D is a homogeneous space with the “ strue- 
ture group” G and with the unitary group U(n) as its “ isotropy group.” 

Our later consideration is based on the following remark. 


Lemma 2. Let Q, and Q2 be two points in the normal domain D and 
let o be an analytic local isometry sending Q, to Q., then o can be extended 
uniquely to some global isometry of D belonging to G. 


Proof. Since G is transitive on D, we have only to prove the following 
assertion. Let fa(z) (1S be n analytic functions defining an iso- 
metry between some neighborhoods of the origin, then it holds fa(z) = > dag% 

B 


(l1Sa=n) with (ag) in U(n). Im fact let S be a sufficiently small 
geodesic sphere about the origin, then the image }* of > is also a geodesic 
sphere with the same radius about the origin. However } and * are 
ordinary spheres, hence our lemma follows from a result of Bochner ([3], 
pp. 58-59). 


We conclude from this lemma that G is the full group of analytic iso- 
metries of D. Moreover since D admits only one invariant metric with respect 
to G, the hyperbolic metric is also Bergmann’s metric (cf. [8], pp. 121-122). 
Hence G is the full group of analytic homeomorphisms of D in this case. 


We say that the metric has a constant curvature when Riemann’s curvature 
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2. Global consideration. Let V” be an n-dimensional complex variety 
which is simply connected. A global Hermitian metric on V is called a 
Rahlerian metric when it is everywhere without torsion. We shall assume 
that V has a Kiahlerian metric with constant curvature, then we can prove 


the following 


Lemma 3. There exists a single-valued analytic isometry Y from V 
into the normal domain D. 


Proof. Let P, be any but fixed point of V and let y be a “germ” of 
an analytic isometry from V into D sending P, to the origin of D. The 
existence of Y is assured by Lemma 1. We then take an arbitrary point P, 
of V and connect it to Py by a continuous path P; (0S¢t1). In the 
following we shall show that there exists one and only one family y 
(0t¢=1) of germs of analytic isometries from V into D sending P; to 
v(P:), which is “coherent” in the sense that y; induces yz when Py is 
near enough to P;. Since the uniqueness is evident, we shall prove the 
existence and that we have only to prove the following assertion. Let y% 
(0<t< 1) be a family of coherent isometries which is open on the right 
side, then we can complete it by qy1. 

Let @ be an analytic isometry of some neighborhood U of P, into D 
sending P, to the origin of D. Let P, be a point of the path which is near 
enough to P, such that P; for s = ¢=1 are all contained in U. Also let W 
be a neighborhood of P,, which is contained in U, such that y; maps it into D. 
In this case ys ° 7? gives an analytic isometry from ¢(W) to ¥.(W), which 
can be extended uniquely to a global isometry o of D by Lemma 2. It is a 
simple matter to see that the germ y, determined by o° ¢ at P, satisfies our 
demand. 

Now we define a mapping ¥ from V into D by ¥(P;) =y(P:). Since 
¥1(P:) is independent on the homotopy of paths from P, to P, and since V 
is assumed to be simply connected, © is single-valued completing the proof. 

We note that the changes of P, and yp in the definition of ¥ amount 
to replace it by o°y, where o is an operation of the group G; hence the 
mapping © is essentially unique. 

Now we shall assume that (V",f) is the universal covering variety of 
a compact Kahler variety V" with constant curvature. It is clear that the 
projection f induces a similar metric on V. Therefore we can find a single- 
valued analytic isometry Y from V into the normal domain D according to 
Lemma 3. However in the present case the following holds. 
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TuHeorem 1. The mapping Y gives a homeomorphic analytic isometry 
of V onto D- 


Proof. We shall show that we can define a single-valued analytic iso- 
metry ¥* from D into V such that Vow* —v*oW=—1. Let y*> be the 
inverse of wo in the proof of Lemma 3; let Q; be an arbitrary point of D 
and connect it to the origin of D by a differentiable path Q; (OS¢= 1). 
If y*, (0 St <1) is a family of germs of coherent isometries from D into 
V sending Q; to P;, we can complete it by y*,. In fact since the path f(P;) 
on V has a finite length and since V is compact, lim f(P;) exists and is a 


point of V. Therefore lim P; = P, also exists and is a point of V. It is 


clear that P; (0 ¢=1) is a continuous path on V and we have only to 
take the inverse of ¥, as y*,. Since D is simply connected, it follows from 
this argument that we can define a single-valued analytic isometry ¥* from 
D into V, which “extends” The relation poo = = 1 is 


clearly preserved by this extension. 


It is essential in this theorem that V is the universal covering variety 
of a compact variety. 


3. Structures. A compact Kihler variety V" with constant curvature 
will be called elliptic, hyperbolic or parabolic in an obvious manner. Our 
Theorem 1 asserts that the universal covering variety V” is then 8”, K™ or 
C” respectively. Moreover the Poincaré group T of V is represented as a sub- 


ce 


group of G, which is “ discontinuous ” and has no fix-point in D. Therefore 


THEOREM 2. The elliptic variety V" is the complex projective space 8". 


Proof. This follows at once from the fact that every projective trans- 
formation in 8" has n + 1 fix-points. 


The following lemma will simplify some discussions in the proof of 
Theorem 3. 


Lemma. Let M" be a compact differentiable manifold and let fy 
(p= 1,2,-- +) be a family of differentiable mappings from M into another 
differentiable manifolds N, with the following two properties: a) The family 
of point sets X, in M such that f, is not “ regular” at x in X_ (the Jacobian 


* The author was notified of Hawley’s paper on the same subject (“Constant holo- 
morphic curvature,” Canadian Journal of Mathematics, vol. 5 (1953), pp. 53-56) by 
Professor Sasaki in Princeton. 


672 

C 

| 

| 


KAEHLER VARIETIES. 673 


matrix of fp at « has rank <n) 1s decreasing from a certain p onward and 
has an empty intersection. b) The family of point sets Ap in M X M such 
that fp(x) =fp(y) ts decreasing from a certain p onward and has A as its 
intersection. In this case fy gives an analytic homeomorphism from M into 


N, from a certain p onward. 


Proof. Since M is compact and since X, are closed, a) implies that Xp 
is empty from a certain p onward. However A,—A is closed, when Xp is 
empty. Since M x M is compact, b) implies therefore that A, =A from a 
certain p onward. 


THEOREM 3. In the hyperbolic case we can define an analytic homeo- 
morphism F from V" onto a non-singular variety in a complex projective 
space by a set of automorphic forms of the same weight in D with respect to YL. 


Proof. (ef. [9], chap. 10). Let jo(z) be the Jacobian of the isometry 
o of D at z, then the series } | jo(z)|" (m= 2) is uniformly convergent on 


every compact set in D. Moreover since the factor space D/T = V is compact, 

the set F of points z in D such that | jo(z)| <1 for all o in T is also compact 

and clearly generates D. Let 7) be the number of o in I such that F° inter- 

sects with F; take r= 2r,—1 and put N =C,"* —1, then the set of 

“ Poincaré series ” ®,,(2, 6) = 2 (02) -Jo(z)™ (m = 2) defines an analytic 


mapping ®,, from D/T into the projective space SY when ¢(z) varies in the 
space of polynomials of degree not greater than r. Thereby if z, 2, and 22 
are given points of F such that z, and z, are inequivalent under I, then ®», 
is regular at z and ®,,(z,) ®(z2) from a certain m onward. Hence we 
can apply the previous lemma to fp = ®2 X ®3 X K and we conclude 
that fp gives an analytic homeomorphism from D/TI into p-ply projective 
space SY x SY x: - - x S* from a certain p onward. Therefore if we denote 
by F the composition of one of such mappings and the well-known imbedding 
of multiply projective space into an ordinary projective space, the image 
F(V) of V is an everywhere algebroid variety without multiple point. 

Now we can apply a result of Chow ([5], Theorem 5) to F(V), which 
is thus an algebraic variety. This idea is general and could be applied to a 
simplification of the theory of automorphic functions. 

We shall now consider the parabolic case. The following theorem shows 
that a complex torus is “ universal ” in a certain sense. 


THEOREM 4. When V" is of parabolic type, there exists a uniquely 
determined finite “ normal” covering variety T” of V" which is a complex 
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torus. Moreover this extension has only a finite number of possibilities 


depending on n. 


Proof. As we have remarked before V" is a factor space of C™ by a 
discontinuous group I of “motions” in C”. Thereby since V is compact, T 
is “indecomposable.” Therefore by a result of Frobenius ([6], p. 663) 
translations in T form a normal subgroup T of finite index with 2n indepen- 
dent generators. Moreover the extension I of T does not depend on the choice 
of the coordinates in C” and is divided into a finite number of classes 
depending on n. We have now to put 7" = C"/T. 


It follows from the above proof that the totality of parabolic varieties 
V" forms a finite number of “ analytic families ” in a natural way such that 
two members in the same family have the same real analytic structure. In 
particular those which are algebraic form a finite number of analytic sub- 
families. This is a consequence of the “theorem of Riemann-Weierstrass ” 
and of the following general 


THEOREM 5. Let V" and V" be complex varieties such that one of them 
is a finite covering variety of the other, then they are or they are not 
analytically homeomorphic to non-singular projective models at the same time. 


Proof. It follows from a general theorem of Kodaira ([8], Theorem 3) 
that a finite covering variety of a non-singular projective model is a non- 
singular projective model. Therefore we have only to prove the converse. 
Let V be a non-singular subvariety of a projective space S%, which is at the 
same time a finite covering variety of V with projection f. Let P be a point 
of V and let F(P) be the “ Chow point ” of the 0-cycle f-+(P) in S% which 


is carried by V, then we see readily that F is homeomorphic, analytic and 
regular. 


4, Minimality. In this section we shall show that hyperbolic and 
parabolic varieties are “minimal” in a certain sense. We first put a general 
definition. (This definition is given essentially by Weil in his book on 
Abelian variety [Hermann, 1948] on page 27.) 


Derrnition. Let V" bea projective model and F a meromorphic mapping 
from a complex variety U™ into V", which can be defined invariantly using 
local coordinates on U and affine representatives of V. If every such F 1s 
necessarily analytic for any U, we say that V is a minimal modei. 
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We shall now prove the following remarkable 


THEOREM 6. Any hyperbolic variety is a minimal model. If a para- 
bolic variety is a non-singular projective model, it is also a minimal model. 


Proof. Let F be a meromorphic mapping from U™ into a variety V” 
which is stated in the theorem, then F is analytic. In order to prove this 
assertion we may assume that U™ is a connected neighborhood of the origin 
of C™. Since V" is imbedded in a projective space, the “ fundamental locus ” 
W of F have dimension at most equal to m—2; hence its residual space 
U—W is simply connected. Let z be a point of U—W and let P* be a 
point of the universal covering variety (D,f) of V” lying over F(z), then 
the germ of analytic mapping z— P* can be extended as a single-valued 
analytic mapping F* from U—W into D. Thereby the simply connected- 
ness of U —W is essential. Now any component F*, of F* is a bounded 


holomorphic function on U — W for a=1,2,:--,n. Therefore by a well- 
known theorem of Hartogs (cf. [3], Theorem 5, pp. 173-174) F*, must be 
holomorphic throughout U for «—1,2,---,n. 


As a consequence of this theorem we shali prove a complete generalization 
of the theorem of Schwarz-Klein. 


THroremM 7%. Let K be the field of meromorphic (hence algebraic) 
functions on a hyperbolic variety V"; then the group of automorphisms of K 
over C is finite. 


Proof. We know that every such an automorphism can be represented 
as a birational correspondence between V and itself. However since V is a 
minimal model, the birational correspondence is necessarily everywhere bi- 
regular; hence it is induced by some analytic homeomorphism of D. How- 
ever by a remark at the end of section 1 such operations form a subgroup 
of G, which is discrete by a result of Bochner ([1], Theorem 5). On the 
other hand every discrete subgroup of G is discontinuous on D (ef. [9], 49) ; 
hence our assertion. 


5. Further results. Let g, (1S pn) be the p-th irregularity of a 
compact complex variety V", which is the number of linearly independent 
p-ple differentials of the first kind; we put also g)=1. In analogy with 


the Euler-Poincaré characteristic y(V"), we call y,(V") = > (—1)?gp the 
analytic characteristic of V". 


Now we consider a hyperbolic variety V". If we denote by dv the 


— 
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invariant measure in its normal domain D which coincides with the Euclidean 
“volume element ” at the origin of D, the total measure v(V") of V™ has 
an intrinsic meaning and we have the following 


TueoreM 8. It holds that 
= (m+ 1) = (—1)"((m 1) Yar) 


Proof. If we remember Chern’s generalization of Gauss-Bonnet’s formula 
({4], pp. 114-115) which is valid in general and the so-called “ Todd’s 
formula” which was proved recently by Hirzebruch for non-singular pro- 
jective models, we have only to verify this identity (except for (—1)") in 
the case of the projective space 8". 

We next consider a parabolic variety V" and we shall prove the following 


THEOREM 9. There exists a uniquely determined series of compact com- 
plex varieties V = Vo, -, Vers = W such that V; ts a fibre space over 
its “Albanese variety” T; (complex torus attached to the period matrix of 
simple differentials of the first kind on V;) with fibre Vi,, fori —0,1,---,s 
The final variety W is a parabolic variety with the first irregularity equal to 
zero. 


Proof. We can write T in the form z* = Az-—+a and consider A as a 
representation of T in U(n) with kernel T. The factor group G T/T is 
finite and gives the group of “covering transformations ” of T = C"/T over 
V. It is clear that the projection from T onto V induces a natural isomor- 
phism between the module of differentials of the first kind on V and the 
module of invariant differentials with constant coefficients in dz* (1S an) 
under the operation of G. In particular g, is nothing but the number of 
principal characters in A. Therefore by a suitable coordinate transformation 
in C*, the operation of T can be represented in the form 


I,, 9 a 
0 Ai (1, = unit matrix) 
0 0 1 


and it is easy to see that the vectors a) form a discrete group of rank 2g, in 


C% which determines a torus 7). If we consider the partial matrix | 


corresponding to a 0, we can repeat the same reasoning. 
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The following corollary shows that the “arithmetic genus” of a para- 


bolic variety V" is equal to (—1)”". 
Corotnary. Jt holds that y(V") =ya(V") =0. 


In fact we conclude from the above proof that 


= (1/(G@) ) det (tly + A) 


p=0 

with an indeterminate ¢, where (G) is the order of G. However since every 

transformation in has no fix-point, we have det(A —J,) whence 
2 1)*? = 0. 

On the other hand it would be favorable, if a variety W stated in 

Theorem 9 does not exist actually. However the circumstance is not so 


simple. 


EXAMPLE. Let M be a discrete additive group of rank 2 in C; let 
te: Ya (1S a3) be six complex numbers satisfying the following con- 
ditions: 21, Y2, — ys 0 (mod M), 221, 2y2, — ys) =0 (mod M); put 
= 2, +2, 2%. = — + Lo, 2*, = — + and =— 4+, 
== + Yo, = — 2, + Let T be a group of translations of the form 
ma (Mae M) (143), which generates a discontinuous 
group T of motions in C® with o, and o,. It can be readily verified that 
every member of IT has no fix-point in C* and that the factor group [/T is 
the Klein’s four group. Since the factor space T* = C*/T is a product of 
three elliptic curves corresponding to M, the factor space V* = C%/T is 
also a non-singular projective model by Theorem 5 which is covered four 
times by 7J*%. Moreover the Poincaré polynomial of V* is given by 
1+ + + + 

However in the case of curves and surfaces, we have g, = 1 by the above 
corollary. The case of n = 1 is trivial and gives an elliptic curve. The case 
of n = 2 is either a complex torus or is a fibre space over an elliptic curve 
with another elliptic curve as its fibre by Theorem 9. In the latter case they 
are necessarily non-singular projective models and known as “ hyperelliptic 
surfaces.” 
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LOWER SEMI-CONTINUITY AND AREA FUNCTIONALS, II. 
THE BANACH AREA.* * 


By CaspER GOFFMAN. 


1. Introduction. In part I of this series [1], we showed that the 
Lebesgue area for continuous non-parametric surfaces has a natural extension 
to a lower semi-continuous functional on the space of summable functions 


suitably metrized. Specifically, if 
d(p,q) = j | p(x, y) y)| da dy 
e770 0 


is taken as the distance between polyhedral functions p(a, y) and g(a, y), the 
extension of elementary area to the space of equivalence classes of summable 
functions, obtained by the Fréchet process [2], is a lower semi-continuous 
functional. It agrees with the Lebesgue area for continuous surfaces and, 
in general, retains the character of the Lebesgue area. 

The present paper is an attempt to obtain similar results for the para- 
metric case. We are concerned with the following existence problem: 

Is there a distance function d(p,q) for the parametric polyhedral 


mappings such that 


(a) elementary area is lower semi-continuous on the space ? defined 
by d(p, 


(b) the completion ?’ of ® contains a reasonably wide class of dis- 


continuous mappings along with all the continuous ones? 


We find that the analogue of the metric used in the non-parametric case 
is unsatisfactory in the parametric case since the elementary area H(p) is 
not lower semi-continuous. Nevertheless, we find a distance function for 
which P’ contains all Baire mappings (i.e. mappings f: v), 11, 2,3 
for which 2;(uw, v) are Baire functions) whose images on the coordinate planes 
have finite planar measure. We remark that the extended functional obtained 
here is related to the Banach definition of area rather than to the Lebesgue 


definition. 


* Received January 21, 1954. 
* Research supported by a grant from the National Science Foundation. 
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2. Natural metrics for which E(p) is not lower semi-continuous, 
The analogue of the above metric for parametric polyhedral mappings 
p:pi(u,v) and gq: gi(u,v), += 1,2,3 is the function 


3 
d(p, q) | pi(u, v) —qi(u, v)| du dv. 


This metric cannot be used as a basis for a theory of surface area since the 
elementary area E(p) is not lower semi-continuous on the resulting space.’ 
Indeed, consider the mapping p: p:(u, v) =u, po(u, Vv) =, ps(u, v) =0 on 
the unit square 7. Subdivide 7, for every positive integer n, into a grill 
of squares Ink,» all of whose diagonals are of length 1/2". 
Place in each of these squares, J,;,, a pair of squares Jn, and K,,x, concentric 
with Jy, such that the sum of the areas of the J,; is 1-1/2" and the sum of 
the areas of the K,, is 1/2". Now, let K, be a simply connected polygonal 
region in J which contains all the A,,;, in its interior and whose area is less 
than 1/n. There is a homeomorphism f,: fni(u, v), t= 1, 2, 3, between J and 
K, which takes each J,, into K,,. There is then a polyhedral mapping 
7: Qui(u, v), += 1, 2, 3, such that | pi(u, v) — gui(u, v)| < 1/2” for each 7 on 
a set of measure greater than 1 — 1/2", and such that E (qn) < 1/2". Now, 


1 13 
lim f f | p(u, v) — qu(u, v)| du dv =0. 
0 0 


n> 


But, lim (qn) =0 and = 1, so that the elementary area is not lower 


n> 


semi-continuous. 
It was also noted in [1] that if the distance between continuous non- 
parametric surfaces is taken as 


8(f,9) =m(B), 


where # is the set for which f(z,y) #g(z,y), then the Lebesgue area is 
lower semi-continuous and the Fréchet process leads to a lower semi-continuous 
extension to the set of all measurable functions. We now show that the 
elementary area also fails to be lower semi-continuous for the parametric 
analogue of this metric. Indeed, let (x) be the Cantor function on [0, 1] 
and let f be given by 


(u,v) = 4, Le(u, Vv) Vv) = (uv). 
It is clear that the Lebesgue area L(f) of f exceeds 1. Moreover, it is easily 


* E. Silverman has shown recently [3] that the Lebesgue area is lower semi-con- 
tinuous with respect to point wise convergence. 
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seen that, for every « > 0, there is a continuous f, such that 8(f, f.) < « and 
L(fe) <1. This shows that the functional L(f) is not lower semi-continuous 
on the given space. We remark, however, that the elementary area is lower 
semi-continuous on the set of parametric mappings of class C’ with 8(f,g) 
as metric. This leads to an area functional defined on a wide class of 
mappings which does not agree with the Lebesgue area for certain continuous 
mappings. We shall discuss this case elsewhere. 


3. The space ?. We now introduce a distance function for the poly- 
hedral mappings for which the elementary area /'(p) is lower semi-continuous. 
For convenience, we shall work first with transformations 7: a(u, v), y(u, Vv) 
of the unit square J = |0,1] X [0,1] into the («, y)-plane and shall speak of 
mappings (triples of transformations) later. Our distance function is defined 
in terms of a special sequence of subdivisions of 7. For every n, let 


R,", R.", Reon” 


be the set of squares in J, whose sides are of length 1/2”, obtained by means 
of the lines u, v = 0, 1/2”",- - -,1—1/2",1, where each bounding edge is 
assigned to one of the two squares which it bounds in such a way that each 
Rj" is contained in an F,.". 

Now, let P and Q be polyhedral transformations of J into the (2, y)- 
plane; i.e., J may be decomposed into a finite set of triangles such that P 
is continuous on 7 and is linear on each triangle of the decomposition, and 
similarly for Q. Using TR as notation for the image set of a’set R under 
a transformation 7, and RS as notation for the symmetric difference 
RUS—RQS of sets R and S, we write »;"(P,Q) = m(PR" * QR") for 
the planar measure of PR," * QR;". 

We shall say that a sequence {P,} of polyhedral transformations con- 


verges to a polyhedral transformation P if for every - -,2°"; 
m=1,2,---, we have limp”"(P,P,) =0. The neighborhoods U of P 
n> © 


which correspond to this notion of convergence are given by finite sequences 
(m),t,), (ms, t2),° (Mr, €1, Where the ¢’s are positive, and 
QeU if wij (P,Q) for every =1,2,---,k. It then follows that 
the space ® obtained in this way is such that 


THEOREM 1. P is a uniform space which satisfies the Hausdorff first 
denumerability condition and so it is metrizable. 


It is convenient to work with an equivalent distance function d(P, Q) 
Instead of the given convergence notion. For every n, we let 


— 
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N 
dn(P,Q) =D wi" (P,Q) where N = 2. 
i=1 


The function d,(P,Q) is monotonically non-decreasing in n. We let 
d(P,Q) =1/2" where n is the smallest integer for which d,(P,Q) = 1/2". 


LemMA 1. A sequence {P,} converges to P in P tf and only tf 


lim d(P, = 0. 


n> o 


Proof. Suppose {P,} converges to P in P. For every n, there is an NV 
such that, for every m > N, and i= 1, 2,---, 2°", we have wi"(P, Pm) < (4)*", 
so that da(P, Pm) < 1/2", whereby d(P, Pm) < 1/2". 


Conversely, suppose {P,} does not converge to P in P. There is then 
an and m with lim sup.yj"(P, Pn) > 1/2* > 0. Moreover, we may choose 
n> co 


k>m. Then, for every N, there is an n > N with d,(P, P,) = pi™(P, P,) 
>1/2*. Since k >m, we have d;(P, Pn) 2 dn(P;, Pn) > 1/2*, so that 
ad(P, P,) 1/2*. 

We mention, in passing, that d(P,Q) is not a metric but satisfies 
(i) d(P,Q) >0 if PAQ, (ii) d(P,P) =0, (iti) d(P,Q) =d(Q, P), 
and (iv) d(P,Q) + d(Q,R) 4d(P, 

The elementary area H(p) of a polyhedral transformation P has its 
usual meaning [4]. 


THEOREM 2. The functional E(p) is lower semi-continuous on the 
space P. 


Proof. Observe that for every polyhedral P and positive integer n we 
have H(P) = m(PRi") where VN = 2°", but that for every « > 0 there 
<), with 178° for which < m(PR") 4 6/2. 
Suppose d(P, Q) < 1/2" where n= n(P,¢). Then d,(P, 9) < 1/2", so that 


E(Q) BE m(QRe) = (PR) — 
m(PRe) —d(P,Q)> m (PR) > 


This proves the lower semi-continuity. 


It is also clear that for every P there is a sequence {P,,}, PnP, such 
that lim d(P, P,) =0 and H(P) =lim F(P,,). Thus, by the Fréchet pro- 


n> oo 
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cess, H(P) has a lower semi-continuous extension to the completion ?’ of 
the space P. 

We also wish to remark that H(P) is not continuous on ?; examples 
of this fact are easy to construct and we shall omit them. In this connection 
we make the conjecture that there is no metric for the polyhedral mappings 
for which E(P) is continuous for which the completed space contains all 
continuous mappings. 


4, Measurable point and set transformations. The completion P’ of 
@ involves the following sort of set functions with the rectangles R;", 
t=1,2,- - -,2°; n—1,2,- - - as domain. Associated with every R,” 
there is a measurable set S;°—f(R,") in the (z,y)-plane such that 


4 
LU Si,"**, where the R,,"*', 7 = 1,---,4, are the four squares whose 


union is R;". The set function f(R;,") has an associated point function ¢(z, y) 

defined as follows: For each n, let dn(z,y) be the number of squares ,", 

i=1,2,-- -,2°, for which (2, y) ef(R"), and let y) = lim ¢,(2, y) 
n> 


or co, as the case may be. The function ¢(2,y) is measurable. For, the 
set of points for which ¢,(z,y) =k, for every n and 0=k Sn, is expres- 
sible as a finite number of unions and intersections of the sets f(R,"), 
i= 1,2,---,2?", and so it is measurable. In the context, when we refer 
to set functions we shall always mean those of the above type. 

By the Banach area of a set function f we mean the number 


= ff 


where the integration is over the whole (z,y)-plane. We shall show, in the 
next section, that the completion P’ of P is just the set of all set functions f 
for which the associated 4(z, y) is different from 0 on a set of finite measure. 

We shall call a point transformation T measurable if for every Borel 
set S, the image 7S is measurable. Every measurable point transformation 
has an associated set transformation f defined by f(R,") = TR," which evi- 
dently satisfies the above condition. The Banach area B(T’) is defined as 
B(f) where f is the set function which corresponds to 7. On the other hand, 
there are set functions 8;"—f(R) which cannot be obtained from point 
transformations [5] in the above way. 


N 
For every n, let B,(f) => m(Si"). We have the following fact, essen- 
t=1 
tially proved by Banach [6]. 
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THEOREM 3. B(f) =lim B,(f). 


n> co 


Proof. Bi(f) = f f gn (x, y)dx dy and {¢n(x, y)} is a monotonically 


non-decreasing sequence converging to (2, y). 


5. The space ?’. We show now that the completion P’ of P consists 
of the set ¥ of all set functions f for which the set f(Z) has finite measure, 
and that the lower semi-continuous extension of H(P) to P’ agrees with 
B(f) on &. 

We first note that the definition of distance d(f,g) given in section 3 
for polyhedral transformations can be carried over without change to the 
set F; we shall henceforth designate this space by &. 


(a) FF is complete. For, let {fx} be a fundamental sequence in @. 
For each £&,", consider the sequence {f»,(#;")} of measurable sets. This 
sequence is fundamental in the sense that for every « >0 there is an NV 
such that m,k > N implies m(f;,(Ri") There is a sub- 
sequence {f;,} such that m(fi,,(Ri") * fi,( Ri") < 1/2/, for every j. Let 


fi,(Ri") and define the functions S;*=f(R"). It is a routine 


k=1  j=k 
matter to check that f is a set function and that lim d(f, fn) = 0. 

(b) P is dense in ¥. Let fe F be given by 8" —f(R"). For any n, 

there are polygons i=1,-- -,27", such that m(Q," * 8;") < 


There is a polyhedral P such that P(R,") * Q" < 2-@"*) for every i, so that 
N 
dn(P,Q) = =m (P(Ri") * f(Ri")) 
N N 
SX m(P(R") * QO") +B * 8") < 1/2", 
i=1 i=1 


where N = 22" and d(P,f) < 1/2", so that P is dense in ¥. Steps (a) 
and (b) furnish the proof of 


THEOREM 4. §F is the completion of P, so that F = P’. 


The proof that B(f) is the Fréchet extension of E(P) to P’ will also 
be given in two steps. 


(a) B(f) is lower semi-continuous on P’. Suppose B(f) <<. There 
is an n such that B,(f) > B(f) — (4)"™".. Now, if d(g,f) < (4)"", then 
< (3)"", so that B(g) = Ba(g) > Bn(f) — (4)™ > B(f) — 1/2". 
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If B(f) oo, then for every M there is an n such that B,(f) > M+1, so 
that if d(g,f) < (4)"** then again B(g) = Ba(g) > Ba(f) —1>M. 


(b) For every fe P’ there is a sequence {Pn} of polyhedral trans- 
formations such that limd(f, Pn) =0 and lim E(P,) =B(f). To prove 


this we simply observe that the polyhedral P in (b) above may be taken so 
that H(P) < B,(f) —1/2". Writing P, for this P we have lim d(P,, f) =0 


| and lim H(P,) lim B,(f) = B(f), by Theorem 3. But lim #(P,) 2 B(f) 


by the lower semi-continuity of B(f). 


Now, the content of the Fréchet process is that there is a unique 
extension of a lower semi-continuous functional on a metric space to its 
completion with properties (a) and (b). We have thus proved 


THEOREM 5. The unique extension to P’ of E(P) on P given by the 
Fréchet process 1s the Banach area B(f). 


We conclude this section with a remark on an analogue of the Fréchet 
equivalence relation [7] for the case under consideration. The discussion 
will be restricted to the subspace J C P’ of point transformations since if f 
is a set function and + is a one-one transformation between J and itself, even 
polyhedral, then fr may be meaningless. We shall say that 7, is equivalent 
to T, if, for every « > 0 there are one-one transformations 7, and r, between 
I and itself such that d(7,, Tot.) <¢ and d(T,7;,T2) <<. It is evident 
that this is an equivalence relation. Writing ¢(z,y;7) for the multiplicity 
function of the transformation T, we prove that 7, is equivalent to T, if and 
only if y; = y;T1) almost everywhere ; in other words, surfaces 
may be identified with multiplicity functions. 

Lemma 2. Jf limd(T,T,) =0 then lim inf (2,4; Tn) 2 $(2,y;T) 


n> © 


almost everywhere. 


Proof. We need only observe that for every m, lim inf ¢(2, y; Tn) 
n> 


= ¢m(z,y;T) almost everywhere. This holds since for every « > 0 there 
is an N such that n> WN implies m(T,(Ri) * T(Ry") < €/2™ for every 
i=1,2,---,2?™, and a standard real variable argument supplies the proof. 


1. If T; is equivalent to T, then o(2,f;T1) =$(2,y;T>) 
almost everywhere. 


Proof. For every r, it is evident that (2, y; T:r) = ¢(2,y;7,). Let {tn} 


14 


| 


6386 CASPER GOFFMAN. 


be such that lim d(7,7n, T2) =0. Then lim inf y; Tima) 2 $(2, y; T2) 


so that ¢(2,y¥; 7.) = ¢(2,y;T-), almost everywhere. Similarly y; 
= o(x,y;T;) almost everywhere. 


The converse also holds. 


Lemma 3. If $(x,y;T1) =$(%,y;T2) almost everywhere then T, is 
equivalent to T>. 


Proof. We need only show that for every « > 0 there is a 7 such that 
d(T,r, T:) < «. Let 1/2" < «, and consider the sets T,(R,"), 1 = 1, 2,-- +, 2°. 
Since ¢(2,y;T,) = ¢(2,y;T2), there are sets Si, t= 1,2,- - 27", such 
that 7,(S;) = T.(R") for every i. Now, let + be a one-one transformation 
which takes each onto S;. Then 7,7(R;") = T:(S;) = T2(Ri") for every i 
so that d(T,7, T2) 


Remark. In the above proof, if 8; should be of cardinality less than c 
the necessary adjustment may be made by adjoining to it a set of cardinality 
c whose image under 7, has measure 0. 


THEOREM 6. Two transformations T, and T, are equivalent if and 
only if they have the same multiplicity function. 


6. Mappings in 3-space. Most of the above results on mappings in 
2-space go over with little change to mappings in 3-space. Thus, the com- 
pletion P’ of the space P of polyhedral mappings p: pi(u,v), i= 1, 2, 3, 
contains all mappings x: 2;(u, v), 1 = 1, 2, 3, for which the a;(u, v) are Baire 
functions, and the Banach area B(x) is lower semi-continuous on ?’. How- 
ever, we do not know whether B(f) always agrees with the Fréchet extension 
of to P’. We know only that B(x) = always, and that 
B(x) = (zx) if (zx) is infinite. We proceed with the details. 

A polyhedral mapping p: v), i= 1, 2,3, into (2, 73)-space has 
three associated polyhedral transformations P;(u,v), i—1, 2,3, some of 
which may be degenerate, where P;(u,v) is the pair (U,V), V)) 
and all integers are taken module 3. Let ? be the space of all polyhedral 


3 
mappings with d(p,q) = > d(Pi, Qi) ; clearly, P is metrizable. 


A set mapping f: fi, 11, 2,3, is a triple of set functions. We show 
that the completion P’ of P is the space ¥ of all set mappings f for which 


8 
the images of the f; have finite measure, and where d(f,g) = > d(fi, 9:)- 


=1 
The proof that ¥ is complete is the same as for set functions. We need 


| 
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accordingly only show that ? is dense in ¥. For this, we fix n, and for 
each Rj”, j =1,2,- - -, 27", and i—1, 2,3, consider the set f;(R,;") in the 
(Tis1, Tixe)-Plane. Now, this is a measurable set of finite measure so that there 
is a polygon Qj; in the Zi,2)-plane for which m(fi(Rj") * < ($)?"*. 
We construct a polyhedral mapping p such that the image of each FR,” contains 


3 N 
the three polygons = 1, 2,3, and E(p) <<} Sm(Qiy) < where 
4=1 j=1 


N An easy calculation shows that d(f, p) < 1/2", so that F¥ =P’. 
For every fe F, we let 


{Lm fale) + Lm (fal Bey]? + Lm (fa 
and define the Banach area by B(f) =—lim B,(f). We show that B(f) is 


n> 


lower semi-continuous on F = P’. Suppose fe P’ and B(f) <0. Let n 
be such that B,(f) > B(f) —«/2 and 1/2" < «/6. If d(f,g) < 1/2" then 


gi) < 1/2" so that dn(fi, gi) < 1/2" for every 1,2,3. In other 
8s N 
words, S m(fi(Rj") * gi( Rf") < 3/2". But now, 
4=1 j=1 
N 2 N 3 
| Bu(f) —Br(g)| =| CX [m(fi(B") PB = Em Bi") ) | 
j=1 j=1 ¢=1 


N 


=3| [m (fa [m (gi( Ry") ) | 


= > =m (fi( Ri") * gi(Ri")) 
by the use of a well known inequality [8]. Thus, 
B(g) = Ba(g) = B,(f) — 3/2" > B(f) —«/2 — = B(f) —«. 


The proof of lower semi-continuity when B(f) =o is similar and will be 
omitted. We observe that for polyhedral p, B(p) = F(p), so that B(f) is a 
lower semi-continuous extension of E(p) to P’. Since the Fréchet extension 
6(f) is the maximal lower semi-continuous extension of E(p) to P’, we 
have &(f) = B(f). 

We have already shown that if f is planar, then 6(f) —B(f). We 
remark that if S(f) is infinite, then so is B(f). This follows immediately 
from the obvious inequalities 


S and @(f;) =@(f)S (fy), for any f. 


For, if (f) is infinite then at least one B(f;) = ®(f;) is infinite. 
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In conclusion, it may be of interest to look at certain peculiar examples 
of the Lebesgue area theory. The Gedcze surface of zero Lebesgue area and 
infinite Banach area is commonly cited as an example to show that the latter 
definition does not possess the lower semi-continuity character. It is clear 
that the sequence of polyhedra converging to the Gedcze surface which illus- 
trated this point does not converge with respect to the distance function 
used here. A second surface of interest is one first given essentially by 
Nobeling [9] and then used by Besicovitch [10] as an example of a homeo- 
morphic image of the unit square in euclidean 3 space whose 3 dimensional 
Lebesgue measure exceeds its 2 dimensional Lebesgue area. The sequence of 
polyhedra converging to the given surface which shows this turns out also 
to be a fundamental sequence with respect to the distance d(f,g). However, 
now, instead of converging to the continuous surface given by Besicovitch it 
converges to a discontinuous one. Actually B(f) oo for the Besicovitch 
surface. 
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ON THE ZEROS OF SOLUTIONS OF SECOND-ORDER LINEAR 
DIFFERENTIAL EQUATIONS.* * 


By NEHARI. 


1. The present paper is concerned with the relation between the analytic 
function p(z), defined in a domain D of the complex plane, and the location 
of the zeros of the solutions of the differential equation w’’(z) + p(z)w(z) = 0. 
As in the case of a real variable, we shall refer to an equation as non-oscillatory 
if none of its solutions has an infinity of zeros on the set under discussion. 
If at least. one solution has an infinity of zeros, the equation will be said 
to be oscillatory. 


2. Our first result is a complex non-oscillation theorem for an infinite 
half-strip. 


THrorEM I. Let p(z) be regular and let | p(x+ ty)| tm the 
half-strip OS (2 =2+ where e(x) > 0 forz>0. 
If, for sufficiently small positive n, 


(1) J, ple + de 
0 
then the differential equation 
(2) w’"(z) + p(z)w(z) =0 
is non-oscillatory in any half-strip OS <0, —1+8SyS1—8 


(0<8<1). 


Since p(z) is regular for —1+8SyZ1—4, it is obvious that no 
solution of (2) can have an infinity of zeros on any of the linear segments 
=const.,. —1+8=y=1—8. It is therefore sufficient to consider only 
zeros whose abscissas are different from each other. 
If a and b (Re{a} < Re{b}) are two such zeros, we may transform (2) 
into an integral equation by means of the Green’s function of the differential 
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equation w”(z) =0. Formally, the result is the same as in the real case, 
but the Green’s function will now depend on the choice of an integration 
path C between a and b. We take C to be a rectifiable Jordan are connecting 
a and b which is entirely contained in the half-strip in which p(z) is regular. 
On C, we choose an arbitrary point £, and we denote by C, and C, the sub- 
arcs of C which connect a and £, and ¢ and 3, respectively. The Green’s 
function in question is defined as follows. 


(3) = (z—a)(b—£) (b—a)”, zeC,; 
= (€—a) (b —z) (b—a)*, zeC,. 
In view of the identity 


Leg’ —9w'] + = (wg” —gw”’)dz, g=g(z,f), w=w(z), 


it follows then that a solution of (2) which vanishes for z—a and z=} 
will satisfy the integral equation 


(4) w(g) g (2, £)p(z)w(z) dz. 


We now identify £ with the point (or one of the points) at which | w(z)! 
attains its maximum on C. If M is this maximum, we conclude from (4) 


and therefore 


(5) | | ds, | dz | =ds. 
It may be remarked that (5) contains an inequality of Hartman and Wintner 
[3], to which it reduces if C is a segment of the real axis and p(z) is real 
on C. 

We next specialize the integration path C. Ifa—a’ + ia’, b = 0b’ + ib” 
(a’, a”, b’, 6” veal), we shall use a path consisting of three linear segments, 
and running from a to a’ +1(1—8), to b’ + i(1—8), to B. 


By elementary geometry, 
|(6—£)/(b—a)| S (1+ 4/(0’ S14 2/('—a’)*? (Ee C), 


and we have the same bound for |b—z||b—a|-, and also for 
|¢—a||z—a|* if zeC.. Hence, it follows from (3) that 
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| 9(z,€)| S| z—a|(1+ 2/(0’—@’)?), ze0,, 
| 9(z,0)| —a@’)’), ze Os. 
If, as we shall assume, a’ = 2, we have | z—a| <2, and thus 
(8) | 9(2,€)| Sa(1 + ze€. 


We shall, however, use (6) only on the horizontal part of C. On both 
vertical parts it follows from (3) by elementary considerations that 
| g(2, €)| S2(1 + 2/(b’—a’)*). Combining this and the inequality (6) 
with (5), and assuming that b’—a’ > 2, we arrive at the inequality 


(1) | pla + de 


| pla’ + iy)| dy +2 f | + ty)| dy. 
-14+5 


It is evident that we lost no generality by assuming that a’ = 2 and b’—a’ >2: 
if there is an infinity of zeros, we can choose a subset for which these 
assumptions are true. By the same argument, it is sufficient to consider 
subsets of zeros whose spacing is arbitrarily large. If 0<6< 1, we shall 
thus assume that for two consecutive zeros (the word “ consecutive ” referring 
to their real parts) a and b we have b’—a’ > 2(1— 6). 

With this choice of a subset of zeros, the left-hand side of (7) reduces 
to 6. Suppose now that we have n+ 1 consecutive zeros do, 4;,° * *, Qn, and 
add the n corresponding inequalities (7). This yields 


2 an’ n 1-6 


ay’ y=0 e 6 
(the right-hand side has been increased by doubling the first and last term 
of the summation). We now use the assumption (1). If 


foe ple + i(1—8)]| de <M, 
0 

we arrive at the inequality 

-5 


| p(ay’ + iy)| dy. 


1 


(9) no=M+43 
y-0 


We now have to estimate the integrals on the right-hand side of (9). 
To this end we consider the Cauchy integral 


(10) p(a) p(z) (2—a)7, 


= 
— 
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where the integration is extended over the perimeter of the rectangle 
a’ /2 —1i(1—8/2), 2a’ —i(1— 8/2), 2a’ + 1(1— 8/2), a’/2 + 1(1— 8/2). 
On the upper horizontal boundary of the rectangle we have 


|z—a|? =| + i(1— 8/2 —a”) |" S |1—8/2—a” 


the last inequality following from —1+8Sa”=1—8. Hence, if I, 
denotes the integral over this segment, 


|S (1/28) ple + —82)]]| ae 
S | ple + i(1—8/2)]]| ae. 


By (1), there exists a positive constant M, such that 


(11) f zl ple + 8/2)]| de < Me, 

Hence, 

(12) | |S 


and this bound is also good for the contribution of the lower side of the 
rectangle to the integral (10). On the left side, we have 


and on the right side 


|2—a | 2a’ + —a”) S 


Thus, by (10) and (12), 


1-5/2 
(13) | p(a)| + (1/na’) f | pla'/2 + dy 


1-8/2 
+1/(2na’) p(2a’ + dy. 


By assumption, | + iy)| Se(x), where e(z) for zo. It 
therefore follows from (13) that 


| p(a) | S 4M, + (1/r)e(a’/2) + (2/m)e(a’). 


Combining this with (9), we obtain 


no <M +. (32M,/z8) 1/4,’ + (8/n) ¢(a,’/2) + (16/x) > 


| 
] 
( 

| 

a 
t 
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For sufficiently large a,’, this inequality will be true a fortiori if e(a,’/2) and 
«(ay’) are replaced by «(a ’/2). Writing, for short, h(ao’) = (32M,/m8a,’) 
+ (24/m)€(a’/2), Mz(ao’) = M + 32M,/n8ao’ + (24/2) €(do'/2) S Ms, we 
thus have 

(14) n[@— h(do’) |] = Ms, 


where h({do’) > 0 for ao’ >, and the constant M, is independent of ay’. 

We now recall that the choice of the constant 6 (0<6< 1) affected 
the spacing of the zeros of (2) to which our considerations refer, but did 
not influence the choice of the first zero, i.e., of a. If there were an infinity 
of zeros in our strip, we could identify a) with a zero whose abscissa is large 
enough so that h(a’) = 6/2. It would then follow from (14) that n S 2M;/8, 
implying that the number of zeros with abscissas larger than a)’ is finite. 
The assumption that there is an infinite number of zeros in our half-strip 
thus leads to a contradiction, and Theorem I is proved. 


3. In the proof of Theorem I, the assumption | p(x + ty)| S ae(z) 
played the role of a “‘ Phragmén-Lindeléf ” condition whose task was to ensure 
that the inequality (1) should imply lim p(#-+ iy) =0 as xo through- 
out the half-strip. We may, instead, prescribe the asymptotic behavior of 
| p(«+iy)| for large x and —1<y<1. While this procedure does not 
yield a non-oscillation condition, it leads to restrictions for the density of 
the zeros. 

To illustrate the procedure, we assume | p(a + iy)| < C/a2*, C = const., 
o>0, and use (8). This yields 


id nd + + C2 (ay’)~, 


y=1 


where Cy, Ci, C2, are suitable constants. Our aim is to find a type of 

asymptotic behavior of the a,’ which contradicts (15). Since a,’—0 for 

n—>o, the sum on the right-hand side of (15) can clearly be neglected in 
| this connection. It follows that the assumption (a,’)*-* = o0(n) will contra- 
t dict (16). However, before we formulate a result concerning the zeros of 
_ the differential equation, we have to consider the fact that a,,qa2,° - - were 
» not all the abscissas of the zeros. Since we are dealing with a subsequence 
for which a,,,— a, is larger than a certain constant, the assumption that 
dy is o(n*), 0 << &k <1, will contradict (15) trivially, and no conclusion may 
therefore be drawn in this case with respect to the asymptotic distribution of 
the zeros. 


=] 
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TuHeoreM II. If p(z) is regular in the half-strip 0S¢t<o, 
abscissas of the zeros of any solution of the equation w”(z) + p(z)w(z) =0 
in the half-strip in question, then the assumption 


is incompatible with ay’ = 
Theorem II is an analogue of a result of Hartman and Wintner [2] for 


the case of a real function p(x) of the real variable z. 


4. Our next result gives a non-oscillation criterion for the unit circle. 


TueoreM II]. If p(x) is regular in |z| <1 and 


(16) d0 <2, 
then the equation 
(17) w"(2) + p(z)w(z) =0 
is non-oscillatory in |z| <1. 
We remark that the integral on the left-hand side of (16) is defined as 
the limit, for p—1, of the non-decreasing function p(pe*®) | dd. 


The proof of Theorem III will again be based on the inequality (5). 
If a and b ({a| <1,|b| <1) are zeros of a solution of (17), we take C 
to be the uniquely determined arc of the circle orthogonal to | z | = 1 which 
passes through a and 6 and is contained in |z| <1. (3) shows that in this 
case (as generally in the case of a convex are C) | g(z,¢)| S| 9(€,¢)|, and 


we may therefore deduce from (5) that 1 max | g(G0)1 | | p(z)dz |. 
Cc 

Since |b —¢|=|]b—a| and |£—a| b—a|, it follows from (3) that 

Henee, 


(18) 1S |b—a p(e)dal. 


Our next step depends on the following result of Fejér and Riesz [1]: 
If g(z) is regular in | z| < 1, then 


| 
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If g(z) =F’(z) and C, denotes the linear segment —1=2=1, this may 
be written in the form f | F’(z)dz|< af | | F’(z)dz|. Evidently, this 
Co 2|=1 


inequality is unaffected by a transformation of the variable z. Hence, (19) 
implies the more general inequality 


(20) <4 f 


where I’ is a rectifiable Jordan curve and C’ is an are which is transformed 
into a diameter of | z|—1 by a conformal mapping of the interior of T 
onto the unit disk. 

In particular, (20) will hold if © is the circumference | z| 1 and C’ 
is the intersection of | z | <1 and any circle orthogonal to | z | —=1, since a 
linear substitution w= (z—«)/(1—a*z), |a| <1, which maps the unit 
disk onto itself will transform C’ into a diameter if a is appropriately chosen. 
If C’ belongs to the same orthogonal circle as does the integration path C 
in (18), we have C C C’ and therefore, by (20), 


Combining this with (18), we obtain 
(21) 2/|b—al< | p(e)| dé. 


It may be remarked that the constant 2 on the left-hand side of (21) can be 
easily improved to 4 by means of elementary geometric considerations; how- 
ever, for our present purposes this is unnecessary. 


Suppose now that there exists a solution of (17) which has an infinity 


of zeros in |z| <1. If a,,a:,: ++ is a converging infinite subset of these 
zeros, we have lim | |= 0. Since, by (21), 


2/| An+1 — An | | | dé, 
0 


it follows therefore that (16) is incompatible with the existence of an infinity 
of zeros in the unit disk. This completes the proof of Theorem ITT. 


5. Theorem III is easily generalized to the case of a domain which is 
bounded by an analytic Jordan curve. 
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THEOREM IV. If p(z) is regular in a domain D bounded by an analytic 
Jordan curve C, and 


(22) 
c 

then the equation 

(23) w’"(z) + p(z)w(z) =0 


is non-oscillatory in D. 


The integral in (22) may be defined as follows. We choose a point ¢ 
of D and denote by C’, the level curve g(z,f) =m (m > 0) of the harmonic 
Green’s function g(z,£) of D. A conformal mapping if D onto the unit 


circle which carries ¢ into the origin shows that I(m) = f. | p(z)dz| grows 
Cm 


monotonically with decreasing m. For m—>0, Cy tends to C, and lim J(m) 


may thus be taken as a definition of the integral in (22). It is obvious 
that in the case in which D is the unit circle and £ the origin, this definition 
coincides with the one used in Theorem III. 

Let now z=h(u) map D conformally onto |u|<1, and rewrite 
equation (23) in terms of the variable u. This yields 


(24) + =0, 
where w[h(u)] —o(u)v(w) and o(w) is a regular and non-vanishing func- 
tion in |w| <1. By classical results, p(z) and g(w) are connected by the 
identity 

(25) p(z)h?(u) =q(u) —3{h(u), u}, 

where {h(w), uw} = (h”(u)/h’(u))’ is the Schwarzian 
derivative of h(w). Since C is an analytic curve and h(u) maps the interior 


of C onto the unit circle, the function h(uw) is regular for |w|=1. Hence, 
h’(w) is regular and ~0, and {h(w),u} is regular, in |w|S1. By (25), 


lim f° | p(2)de | tim f q(u) — | | 00, 


where u = pe*?, 0 < p <1, and it follows therefore that 


lim f | p(2) de | 
m>0 JCm 
and 
27 
(27) lim "| g(pe!)| ao 
pol 0 


will be either both finite or both infinite. 


— 
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By Theorem III, equation (24) will be non-oscillatory in |u| <1 if 
(27) is finite. But since the solutions v(w) of (24) and the solutions w(z) 
of (23) are related by w(z) —v(u)o(u), where o(u) is regular and 40 
in |u| <1, (23) and (24) are either both oscillatory or both non-oscillatory. 
Hence, (23) is non-oscillatory if (26) is finite. This proves Theorem IV. 
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LIE ALGEBRA KERNELS AND COHOMOLOGY .* 


By G. HocuscHILD. 


Introduction. Let ¢ be a homomorphism of a Lie algebra / onto another 
Lie algebra L, and let JJ denote the kernel of ¢. In this situation, we can 
associate with each element x of LZ the set y(x) of derivations m — [e, m] 
of M, where e ranges over the subset ¢“*(x) of #H. Then y is actually a Jf 
homomorphism of LZ into the Lie algebra D(M)/I(M), where D(M) is the [| 
derivation algebra of M, and I(M) is the ideal consisting of all the inner 
derivations m — [e,m], with ee M. The pair (M,w) is called an extendible 
L-kernel. Generally, an L-kernel is a pair (1, yw) consisting of an arbitrary 
Lie algebra M and a homomorphism y of LZ into D(M)/I(M). Not every 
J-kernel is extendible, i.e., not every L-kernel arises, in the way explained 
above, from an extension (#,¢) by LZ. A criterion for extendibility of a 
kernel can be given in terms of cohomology theory. With every L-kernel, one 
can associate a certain 3-dimensional cohomology class for Z in the center 
of M, and the Z-kernel is extendible if and only if this cohomology class is 0. 
This is the precise analogue of a result of Hilenberg and MacLane [1], for 
the case of groups, and can easily be proved in the same way, mutatis mutandis. 

Pursuing the analogy with the theory of Eilenberg and MacLane a little 
further, one is led to consider equivalence classes of kernels, two kernels being f 
equivalent if (as L-modules) their centers coincide and if, furthermore, they 
differ (in a sense to be made precise later) only by an extendible kernel. [| 
One can then show that the set of equivalence classes of kernels, for a fixed 
Lie algebra LZ, and a fixed Z-module NW as center, can be made into a vector 
space in such a way that the correspondence between kernels and cohomology 
classes induces a linear isomorphism of the space of the kernel classes into 
that of the 3-dimensional cohomology group H*(L, NV). 

The first difficulty arises when one tries to show that this map is onto, 
i.e., that every element of H*(Z,N) arises from an L-kernel with center N. 
This requires the construction of an Z-kernel from a given 3-cocycle for L 
in NV. It appears that this is difficult to accomplish directly. (The case of a 
semisimple Lie algebra of characteristic 0 shows that one must abandon finite 
dimensionality in such a construction.) However, if one employs the uni- 
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versal enveloping algebra of Z one obtains cocycles of the associative type as 
representatives for the elements of H*(L, NV), and with these one can construct 
corresponding Z-kernels by a slight modification (actually a simplification) 
of the construction of associative kernels which we used in [2], where we dealt 
with the analogous theory for associative algebras. 

The corresponding theory for restricted Lie algebras over fields of charac- 
teristic p is considerably more complicated. Here, the enveloping algebra 
ceases to be purely auxiliary but plays a principal part. Also, contrary to 
the case of ordinary Lie algebras, the passage from a restricted kernel to a 
restricted cohomology class (which cannot be defined without reference to the 
enveloping algebra) is even more laborious than the passage from a (restricted) 
cohomology class to a (restricted) kernel. For this reason, our procedure 
below is in the direction opposite to that of our sketch above. The final result 
is that the space of the restricted kernel classes is isomorphic with that of the 
restricted 3-dimensional cohomology group H*,(L, NV), where N is a restricted 
[-module with N'?! (0). For our terminology and for the elementary 
results concerning restricted Lie algebras and restricted cohomology, we refer 
the reader to [3]. Also, one of the main results of that paper is used in 
section 2. 

It goes without saying that the case of ordinary Lie algebras is implicitly 
contained in that of restricted Lie algebras. One merely omits everything 
relating to the p-map. Note, however, that our above account of the ordinary 
case ties up with our remarks at the end of section 1 to show that section 3 
is completely superfluous in the case of ordinary Lie algebras. 


1. Restricted kernels. Let M be a restricted Lie algebra over a field F 
of characteristic p. If m is an element of M we denote by D,», the inner 
derivation of M which is effected by m, i.e., Dn(u) =[m,u]. A derivation 
{ of M is said to be a restricted derivation if £(m'*!) — D,,?-1(¢(m)), for all 
meM. In particular, every inner derivation of M is restricted. The restricted 
derivations of M constitute a restricted Lie algebra, with the commutation 
[{,7] and the p-map We denote this restricted Lie 
algebra of the restricted derivations of M by D,(M). The inner derivations 
of M constitute a restricted ideal J(M) of D,.(M), so that we have a restricted 
factor Lie algebra D.(M)/I(M). 

Let Z be another restricted Lie algebra over F. The structure of a 
restricted L-kernel on M is defined by giving a restricted homomorphism » 
of Z into D,(M)/I(M). This homomorphism y defines the structure of a 
restricted Z-kernel on the center N of M. This restricted L-kernel N is called 
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the nucleus of the restricted L-kernel 1/. Note that, since N is abelian, the 
y-structure of N is the structure of a restricted L-module, while the restricted 
Lie algebra structure of N is reduced simply to a p-semilinear map n— nl?! 
of N into the annihilator N¥ of Z in N. When we refer to a nucleus we 
intend this to comprise both of these structures. 

Let M, and M, be two restricted L-kernels with a common nucleus N. 
We construct a composite restricted L-kernel M with nucleus WV as follows. 
In the direct sum (M,, M,) of the restricted Lie algebras M, and Mz, let J 
be the restricted ideal consisting of the pairs (n,—n) with ne N. We take © 
M = (M,, M.)/J, where we identify the center (N,N)/J with N by means ; 
of the isomorphism which is induced by the homorphism (1,2) >, + 1 | 
of (N,N) onto N. Clearly, M, and M, may be regarded as restricted ideals | 
of M, and when this is done we have M—M,+ M., M,N M.=—N, and 
[M,,M.] (0). The L-kernel structure of M is defined from the given f 
restricted homomorphisms y; of LZ into Do(M;)/I(M;) as follows. If ve L, 
let £; be a restricted derivation of M; which belongs to the class y;(z). There 
is a unique restricted derivation £ of M which coincides with g; on Mj, for 
i—1,2. The coset mod.I(M) of ¢ is evidently determined uniquely by | 
yi(z) and y.(z), and we denote it by y(z). Then y is a restricted homo- 
morphism of LZ into D,.(M)/I(M) and defines the structure of a restricted 
L-kernel on M. Clearly, the nucleus of M is N. 

We may evidently identify M,+ M, with M.+M,, and M,+ (M.+ 
with (M,+M.)+M;. The restricted [-kernels with a common nucleus N 
do not constitute an abelian group with this composition. However, one 
obtains the structure of an abelian group in the set of certain equivalence 
classes of such kernels. The equivalence relation which we shall introduce 
is based on the point of view from which extendible kernels are disregarded. 
Here, a restricted L-kernel M is said to be extendible if there exists a restricted 
extension (H,¢) of M by LZ which induces the given L-kernel structure on M, 
i.e., which is such that, for every ee EF, the restriction of D, to M belongs 
to the class y(¢(e)) of restricted derivations of M. 

If M, and M, are restricted L-kernels with a common nucleus N we say © 
that M, and M, are equivalent if there exist extendible restricted L-kernels [ 
U and V with nucleus N such that M,+U—M,.+ V. This relation is ¢ 
evidently symmetric and reflexive. The transitivity follows at once from the 
fact that a composite of extendible kernels is extendible. Indeed, if (F:, ¢:) 
and (£2, 42) are restricted extensions by Z whose kernels M, and M, have 
the same nucleus N, we can consider the restricted subalgebra of the direct 
sum £.) which consists of all pairs in which = ¢2(e2)- 
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If we reduce this restricted Lie algebra by the restricted ideal of the elements 
(n, —n), with ne N, we obtain a restricted Lie algebra # containing M, + M, 
as a restricted ideal. There is an evident restricted homomorphism ¢ of E 
onto L (induced by ¢: or ¢2) such that (H,¢) is a restricted extension of 
M, + M, by L which induces the proper kernel structure on M, + M,, whence 
M, + M, is extendible. 

We shall see that the equivalence classes of the restricted L-kernels with 
a fixed nucleus NV constitute an abelian group, with the composition of kernel 
classes that is induced by the composition of kernels as defined above. More- 
over, in the case where N'?! — (0) (which is the case of principal interest 
here), we can even define the structure of a vector space over the basic field F 
on this group of kernel classes. This is done as follows. 

Let M be a restricted Z-kernel with nucleus N, and let «eK, «0. 
Denote by J, the ideal of the direct sum (M,N) which consists of the elements 
(n, —an), with ne N. If NM! — (0) we have J,'?! — (0) so that Jq is, in 
particular, a restricted ideal of (Mf, N). In the general case, it is still true 
that J, is a restricted ideal of (M,N) whenever « belongs to the prime field 
of F, and in particular for ¢—-—1. Now we form (M, N)/Jq and identify 
its center (N,N) /J, with N by means of the isomorphism which is induced 
by the homomorphism (7;, m2) > an; + n, of (N,N) onto N. The L-kernel 
structure of M induces on (M,N)/J, the structure of a restricted L-kernel 
with nucleus VY. This kernel will be denoted aM. It is easy to see that 1M 
may be identified with J/, namely by the map which is induced by the map 
(m,n) —>m-+n. Also, a(BM) may be identified with («8)M. In fact, 
the elements of are represented in (M,N, N) by triples (m,n, nz), 
and it is easily seen that the map (m, 7,2) — (m, an, + nz) induces the 
required isomorphism of «(@M/) onto («#8)M. Furthermore, one checks almost 
immediately that «(1/, -+- M.) may be identified with aM, + «M/,. Observing 
that the passage 1/-—> «Mf can be carried over to restricted extensions of M 
by Z, we may conclude that aM is extendible whenever M is extendible. It 
follows that the maps 1/-» aM induce corresponding maps in the set of 
equivalence classes of kernels. 

It is quite possible to give non-computational proofs for the fact that 
the equivalence classes of kernels constitute an abelian group with the com- 
position defined above, and that our maps M—>«M induce automorphisms 
of the group of kernel classes that represent the multiplicative group of F in 
the automorphism group of the group of kernel classes. The only difficulty | 
resides in the treatment of the additive structure of F. Note that we have 
not even defined the zero multiple of a kernel. The above definition fails 


15 


| 
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for «=0, because the center of (M,N)/J.) may very well be larger than 
(N,N) /Jo. 

Before settling these questions concerning the group structure and the 
vector space structure (in case N'! = (0)) of the set of kernel classes, let 
us enumerate the facts we have to establish. The 0-element of the group of 
the kernel classes will be the class consisting of the extendible kernels. This 
means that if M and U are restricted L-kernels with a common nucleus such 
that U and M + U are extendible then M is extendible. The negative of the 
equivalence class of J/ is the equivalence class of (—1)M. This means that 
the kernel M + (—1)WM is extendible. Finally, if M is a restricted L-kernel 
with nucleus N such that N!! == (0), and if a, 8, and «+ 6 are non-zero 
elements of If then «MV + BM + (—«—£)M is extendible. 

All these facts will follow easily from a criterion for extendibility of a 
kernel which we shall establish next. Let (1/,w~) be a restricted L-kernel 
with an arbitrary nucleus N. We can find a linear map r—?, of LZ into 
D,.(M), called an operator map, such that ¢, belongs to the class y(x), for 
every We have then [t,, ty] — so that we can find an 
alternating bilinear map + from Z to MM, called a hindrance, such that 
ty] = ttey) + Drizy, for all and y in Since is a restricted 
homomorphism, we have t,2 — t,ti1eI{1), and we choose an element x* in 
M such that t,? =t,0:-+ D,.. It is easily verified by induction on k& that 


k-1 

(ty) = ty, + Dm, where yx = D,*(y), and m= (2, It 
i=0 - 

follows at once from this that [D,+, t,] is equal to the inner derivation of M 


-1 
which is effected by the element —7r(a!!, y) + > (2%, Yp1-i)). This means 
i=0 


that the element 7',(y) = t,(«*) — y) belongs 
i=0 


to N. We shall say that T (or sometimes T,(y)) is a secondary deviation 
of M. 

We obtain what we shall call a primary deviation of M by taking the 
formal coboundary of +, i.e., the alternating trilinear map $r defined on L 
by the formula 


87 (x, y, 2) = te(r(y, z)) — ty(r(x,2)) + t2(7(2, y)) 
7([z, y], z) ( [z, z], 7 ( [y, z], a). 
A straightforward verification shows that 8r takes its values in the center NV 
of M, and that it is actually a 3-cocycle for I, in N whose cohomology class 


in H*(L,N) depends only on (M,w), and not on the particular choice of t 
and + (The proof is given in [4]). 


— 
| 
| 


LIE ALGEBRA KERNELS. 703 


If M is extendible, we can choose the operator map ¢, the hindrance + 
and the elements x* in such a way that the deviations become 0. In fact, let 
(E,¢) be a restricted extension of M by L which induces the given L-kernel 
structure on M. Then we can find a linear map x—e(zx) of L into H such 
that ¢(e(z)) =<, for all we ZL. We may then define t,(m) = [e(z),m], 
for ve LZ and me M. For a corresponding hindrance, we may take r(z, y) 
= [e(r),e(y)] —e([2,y]). We may take 2* = e(a)'”! — e(2!!), and it is 
easy to verify that we then obtain T,(y) —0. Also, a direct computation 
shows that 5s = 0. Conversely, we have the following result. 


Lemma 1.1. Let MW bea restricted L-kernel, and suppose it is possible to 
choose the operator map t, the hindrance +, and the elements x*, corresponding 
to the elements x of a basis for L over F, in such a way that 8+ =0, and 
T2(y) =0, for all ye L and each basis element x. Then M is extendible. 


Proof. Since 6s = 0, we can define an extension (H,¢) of M by L as 
follows. As a vector space, # is the direct sum (L,M) of LZ and M. The 
commutation in # is defined by the formula 


[ (2, u), (y; v) | (L2, yl; te (v) — t,(u) + [u, v| + y)). 


Indeed, the Jacobi identity is equivalent to 6s =0. The homomorphism ¢ 

of £ onto L is defined by ¢(z,u) =a. There remains to define a p-map in 

E such that (#,¢) becomes a restricted extension of M by LZ which induces 

the given kernel structure on M. Let Rg denote the universal enveloping 

algebra of H, and let (a;) be the basis for Z over F for which the 2;* have 

been defined. We write a given element x of Z in the form r= Dd ae; If 
i 


ueM we define (2, u)'! within Rg by the following formula, in which it is 
assumed that Ry, has been identified with a subalgebra of Rg in the canonical 
fashion: (2, = (x, wv)? — (uP — ul!) — ¥ (ai, 0)? — 


By the usual computation with p-th powers in an associative algebra (all the 
formal results of this kind which are needed are collected in [3]), one sees 
that this element is actually the sum of a linear combination of commutators 
of elements of HZ and the element wu! + 3S a,?(2;!?1, 2;*), so that it actually 


lies in H. The map (2, u) — (a, has the additive property of a p-map in 
virtue of our definition, and it is also evident that (a(2, u))@! = u) 
Hence, in order to conclude that it is actually a p-map, it suffices to show 
that (2, u)? — (z,u)™! is in the center of Rp. Now we—ul?! is in the 
center of Ry. Also, since ty is a restricted derivation, (y, 0) (uw — ul!) 
= (we — ull) (y,0), for every ye L. Hence u? —u!! is in the center of Rg. 


= 
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Hence it suflices to show that each (2;, 0)? — (a;'?!,a,*) lies in the center 
of Ry. Since tz,» — tz,01 = Dz,+, each of these elements commutes with every 
element of M. On the other hand, the computation we made above in con- 
nection with T,(y) shows that 


((xi, 0)? xi*))(y, 0) (y, 0)((a%, 0)? a;*)) (0, T2,(y)) = 0. 


Hence we have indeed a p-map with which (1, ¢) is a restricted extension 
of M by LZ. It is evident that this extension induces the given kernel struc- 
ture on M. This completes the proof of Lemma 1. 1. 

Now we observe that we can carry deviations along with our operations 
on kernels in a linear fashion. More precisely, if Mf, and M, are two 
restricted [-kernels with a common nucleus and if we have deviations for 
M, and M,, the natural carrying over of the operators, hindrances, and 
elements x* will yield deviations for 1, -++ IM, which are the sums of the 
deviations for M, and M,. Similarly, the a-multiples of deviations for M 
are deviations for «//. These facts can be seen directly from our definitions, 
and require no computational proofs. It is clear how one can now apply 
Lemma 1.1 in order to show that our operations on kernels induce the 
structure of a group or, if N'?] — (0), of a vector space in the set of equiv- 
alence classes of restricted L-kernels with a common nucleus JN. 

As we have already remarked above, the primary deviations of a restricted 
LI-kernel M with nucleus N all belong to a single cohomology class in 
H*(L,N). Furthermore, it follows from what we have-.just seen that this 
yields a homomorphism of the group of the kernel classes into H*(L, NV) 
which is, moreover, a linear map in the case where Ni] = (0). If every- 
thing concerning the p-maps is ignored, and ordinary kernels for ordinary 
Lie algebras are considered, it is easy to see that the linear map of the space 
of the kernel classes into H*(L, N) is an isomorphism. In fact, if a kernel 
has a primary deviation 57 which is a coboundary, i.e., of the form 380, 
where o is a 2-cochain for Z in N, then we can replace the hindrance 7 
by the hindrance s—o, and thus obtain 0 as a primary deviation. By 
Lemma 1.1, it then follows that the kernel is extendible. Finally, in the 
ease of ordinary Lie algebras, the construction of kernels which we shall 
give in the next section leads at once to the conclusion that every cohomology 
class in H*(L, N) arises from an L-kernel with nucleus N, so that H*(L, NV) 
is isomorphic with the space of the kernel classes, by the linear map which 
is obtained from the primary deviations, just as in [1] for the case of groups. 


2. Construction of kernels. We begin with the construction of an 
associative kernel for an arbitrary associative algebra U over the field F. 


[ 
| 
| 
| 
| 
| 
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Then we shall take for U the u-algebra of a restricted Lie algebra L and 
turn the U-kernel into a restricted L-kernel by replacing the multiplications 
with commutations. 

We denote by u—w’ a linear isomorphism of U onto a copy U’ of the 
underlying vector space of U. Let J denote the sum of the homogeneous com- 
ponents of degree 1 and 2 of the tensor space over U’, relative to F, i.e., 
I[=U’+U’®@U’. We introduce an auxiliary associative algebra C of dimen- 
sion 2 over F': C = Fe + Ff, where ec? =e, f? =f, ef =f, fe—e. We make 
the direct sum J — C+ I into an associative algebra over F’, containing C 
as a subalgebra, and such that IJ = (0), while ev = v = fv, for every vel. 
Now let K denote the direct sum J + U@U. It is easy to verify that there 
is a multiplication in K with which K is an associative algebra over F' con- 
taining J as an ideal, and such that the following requirements (which 
evidently characterize the multiplication) are satisfied : 


(U'@U’+U@U)K = (0); for u, uw, and uw, in U, 
e(u; us) uy’ us’, @ ue) = 0, and 
u’ us) = (uu,)’ us’ — uw’ @ 


One can verify directly that the center of K is (0). Finally, let N be a 
left U-module over F, and put M—K-+N. We make M into an associative 
algebra over F’ such that MN = (0) —NM. Then N is the center of M. 

Now let g be a given 3-dimensional cocycle for U in N. This means 
that g is a trilinear map from U to WN satisfying the identity x- g(y, z, t) 
—g(xy, 2,t) + g(x, yz, t) —g(a, y, zt) =0. We can make M into a left 
U-module containing NV as a U-submodule, and such that U-J = (0) and 
@ us) = uu, uu, + g(u,u,u2). We claim that u- (ab) 
= (u-a)b, for all a and } in M and all wu in U. Indeed, since J is an ideal 
in M which is annihilated by U, u- (ab) and (u-a)b are both 0 if aed. 
IfaceU@U+ N then ab —0 and u-aeU®U+N, so that (u-a)b =0. 
This suffices in order to establish our claim. 

Next we define right U-operators in M such that 


e-u—w, f-u—0, and 
(uu)? + u,’ w’. 
These right U-operators commute with the left U-operators. On J, this 
follows at once from the fact J-U CJ and U-J=(0). On U®U+N, 


this follows from the fact that U-(U@®U+N)CU®U+N and 
(U®U+ N)-U= (0). 


| 
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Furthermore, (ab) -w=a(b-u), for all a and 6 in M and all w in U. 
Indeed, if b lies in U’@U’+U®8U-+WN then so does ab, whence 
(ab)-u—0—a(b-u). Hence we may assume DeC+U’. Then, if 
aceI+U@U+N, ab=0 and b-uel so that a(b-u) =0. Hence we 
may asume that ac C and be C+ U’, and in this case it is easy to verify 
directly that (ab) -uw—a(b-u). 

Finally, we claim that (a-u)b—=a(u-b). Indeed, if aeU@U+N 
we have (a:u)b —=0—a(u-b). Hence we may suppose that aeJ. Then 
a:uel. Now if be J +N we have u-beWN and therefore (a-u)b —0 
—a(u-b). Hence we may now assume that aeJ and be UQU. Now if 
ae Ff+U’S@U’ we have a-u=0O and a(u-b) =0. There remains the 
caseae Fe + U’ andbe In this case, the relation (a: u)b —=a(u-b) 
is easily verified directly. 

With our right U-operators, M is not a right U-module. Instead of 
(U,U2), we have actually u,) U2— a+ = a(u; 
Indeed, each side is 0 whenever ac Ff+U’@U’+U8U+N. The 
remaining case ae Fe + U’ is easily checked directly. 

Now take U—Uy,*, where L is a restricted Lie algebra over F. 
(U;=R,/Pr, where Py, is the ideal generated by the elements of the form 
a? — lel, with ee L. is the ideal of Uz which is generated by the 


elements of Z.) Then NW is already defined as a restricted L-module, and 
the given cocycle g represents an element of the restricted cohomology group 
H,°(L,N). We regard M as a restricted Lie algebra with [a, b| = ab — ba 
and a'?] —a?. If xe L we denote by ¢, the restricted derivation of M which 
is defined by #,(a) = 2:-a—a-wx (working in the ring of linear transforma- 
tions of Fr @ M, one sees at once that t,(a!!) == D,?1(t,(a))). We have then 


(tzty — — (a) 
= (yx) —(a-2)-y+a- (ay) =[e@y—y a]. 


Hence ¢ induces a homomorphism of LZ into D.(M)/I(M). If uz denotes 
the left z-operator on M and v, the right z-operator we have tz—= uz— tr. 
Since uz, commutes with v,, we have therefore t,? = u,”—v,”. Since M is a 
left U;*-module, u,? = u,1. On the other hand, one finds by induction on & 
that, for k > 1, v,*(a) =a: + a(a*" 2), and so v,? = — where 
Thus t,? — t,01 == D,+, which shows that our homomorphism 
of LZ into D)(M)/I(4) is a restricted homomorphism, so that we have the 
structure of a restricted L-kernel on M. In order to exhibit the dependence 
of this structure on the given cocycle g, we shall now denote our kernel by %/,. 
- Observe that the nucleus of M, is N, with N'] — Ne — (0). 
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If h is any 2-cochain for U;* in N it is easily seen that Ms, is extendible. 
In fact, we can actually define the structure of an associative algebra on the 
space U;*-++ Ms, which contains the associative algebra M5, as an ideal. 
The multiplication is defined by the formula 


(u+a)(v+b) 


where wu and v are in U;*, and a and 6} in Ms,. We make this associative 
algebra into a restricted Lie algebra in the same way as we made Jf into a 
restricted Lie algebra. Then Z-+ Ms, is a restricted subalgebra, and if we 
define the pair (L + Mu, is a restricted extension of Mo, 
by L which induces the given kernel structure on M5. 

Now we can easily find deviations for J/, which depend linearly on g. 
In fact, it is clear from the above that we may take r(z,y) =ct@@y—y®@uz 
for the hindrance. .\ direct computation shows that the corresponding primary 
deviation 8r is given by =g(2.y.2) —9(%,2,y) +9(2, Y) 
—g(z,y,2) + 9(y, 2,2) —g(y, 2, 2). 

In order to compute the secondary deviation, write y, for D,*(y), and 
note that 


te! (©, Yp-r-i) ) te! @ Yp 11 OL) = (4 Yp — Yp-14 7) 


for i= 1. If we sum these terms from to i= p—1, and then add 
the term x — 2, we find that 


tel (2; Ypi-i)) = (9 (24, — 9 (24, t)) + cl 
i=1 


i=0 
p-1 
i=0 
By a well known identity, valid in any associative algebra of characteristic p, 


1 
we have s atyy i= ye. If we substitute this above, we find that, using 


in T,(y), we have 


Te(y) = 9(y, 21,2) + (9 (2+, 2, Yp-si) — 9 
4=1 


By what we have seen in section 1, it follows from the linear dependence 
of these deviations on g that the map which sends g into the equivalence 
class of Mf, is linear. Since Ms, is extendible, this linear map induces a 
linear map of H,,°(I, N) into the space of the kernel classes with nucleus N, 
Where Nir] — (0). 
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We shall show next that this linear map is an isomorphism. This means 
that if M, is extendible then g is a coboundary. Suppose that (£,¢) isa 
restricted extension of M, by L which induces the given kernel structure on 
M,. Then we can find a linear map x—-e(zx) of L into F such that 
and t,(m) = |[e(x), mJ, for all ee L and all meM,. Then, 
for x and y in L, [e(x), e(y) |] —e([a, y]) is an element of M,, and the inner 
derivation which is effected by this element coincides with [ to, ty] — toy 
which is the inner derivation effected by e®y—y®@zx. Hence we have 
fe(x),e(y)|] —e([a,y]) —tSy+ySreN. Similarly, the inner deriva- 
tion which is effected by the element e (x) — e(a”!) coincides with t,” — 
which is the inner derivation effected by x?*®z. Hence we have 


e(x)) — — @zreN. 


The component J of M, is easily seen to be a restricted ideal of #, and 
M,/J may be identified with the component = of M,. 
Put S = L/J, and let ¢’ be the homomorphism of S onto L which is induced 
by ¢. Then (S,¢’) is a restricted extension of Q by L. If s(x) denotes 
the coset mod. of e(x) we have, from the above, [s(z),s(y)] —s([@, y]) 
and s(x)” Since [Q,Q| 
= (0) = Q!*!, we can apply the theory of [3] to (S,¢’). In particular, this 
theory shows that there is a 2-cocycle h for Uz* in Q such that (S, ¢’) is 
equivalent to the restricted extension (4£,,¢,) which is constructed in the | 
standard fashion from h, i.e., the space of /; is the direct sum (L, Q), the 
commutation is given by the formula 


[ (41, 91), (Xe, Go) |] = Te], G1 — + h(a, —h(22,%)), 


the p-map is given by (a, = - + x)), and g) =«. 

Now let us consider the extensions (S/N, ¢*) and which 
are obtained by reducing the above extensions mod. N. These are equivalent 
restricted extensions of Y/N by L. From our above relations involving the 
map s of L into S, it is clear that (S/N, ¢*) is equivalent to the extension 
(Ex+, ox*), Where & is the 2-cochain for U;* in Q which is defined by 
k (u,v) =u®v, and k* is the 2-cocycle (for =g) for in Q/N which 
is obtained by reducing kmod.N. On the other hand, (#;/N, ¢:*) is 
equivalent to (H+, n+), where h* is obtained by reducing h mod. N. It 
follows from the theory of restricted extensions that there is a 1-cochain c* 
for U;* in Q/N such that h* + 8c* —k*. Choose a 1-cochain c for U;z* in 
Q which reduces to c* mod. N. Then the 2-cochain k — h — 8c takes values 
in N, and 6(k— h — 8c) = 8k =g, i.e., g is a coboundary. Hence we may 


| 
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conclude that the map g— M, induces a linear isomorphism of H,°(L, V) 
into the space of the restricted kernel classes with nucleus NV, and N'?! = (0). 
There remains to show that this actually is an isomorphism onto, and this is 


the purpose of the next section. 


3. Construction of cocycles. We require a construction which attaches 
a 3-cocycle for U;* in N to a given restricted L-kernel 2 with nucleus N, 
such that V2] — (0). Let «—>t, be a linear map of Z into D.)(M) which 
represents the kernel homomorphism of LZ into Do(M)/I(AM), and let + be a 
corresponding hindrance, so that [t,, ty] = Drew. The restricted 
derivations ¢, of M extend uniquely to derivations of Uy which we still 
denote by ¢,. We have then, for all « and y in L and all uweUy, 
— — tpey)) (U) = y)u— y). 

We shall construct a non-associative algebra Q over the tensor product 
(relative to the base field #) U;,®Uy which contains Uy as a subalgebra 
and whose deviation from associativity will yield the required 3-cocycle for 
U,* in N. The underlying idea of the construction of Q is that Q is made 
to come as close as possible to the u-algebra of a Lie algebra, which would be 
a restricted extension of J/ by Z inducing the given kernel structure on M, 
if it actually existed. At any rate, Q/QN will actually be the u-algebra of a 
restricted extension of J//N by L. 

We choose a well-ordered basis (2;) for L over F and write the elements 
of Uz, as linear combinations of ordered monomials in these basis elements, 
with all exponents less than p. The elements of Q will be written in the form 
2 Guittes where the qg; are such ordered monomials, now regarded as elements 


of Q, and the wu, are arbitrary elements of the subalgebra Uy of Q. This 

notation will be consistent with our definition of the multiplication in Q, 

for we define, for we Uy, The products u(qv), 
k 


for uw and v in Uy and q an ordered monomial, are defined inductively on the 

degree of q as follows: u(xjv) = —t,,(w)v. Suppose we have already 

defined u(qv) whenever q is of degree no greater than some m, in such a way 

that u(qv) = S qx(uxv), where each gy, is a partial product of the factors 
k 


of qg, and u,e Uy. Then, if r is an ordered monomial of degree m + 1, we 
have r= 2x;g, where q is an ordered monomial of degree m. We then define 
u(rv) = ¥ ign (wv) — te,(u) (qv). Note that, since g, is a partial product 


of the factors of g, xig; is indeed an ordered monomial with all exponents 
less than p. 


— 
= 
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For each basis element 2;, we select an element 2;* in M such that 
te,? — tz,01 = D,,+, on M. Then, on Uy, (tz,? — (u) = — ua;*. 

Now we define the products 2;(qv) for ordered monomials q by induction 
on the lexicographic order ((degree of q),71). We may suppose that q is of 
degree > 0, and that the products of smaller lexicographic order than 2;(qv) 
have already been defined. We write g = 2;‘r, where 0 < e < p, and r is an 
ordered monomial containing only 2;’s with k > j, or r=1. Also, we set 
s=2j°"r, so that g = ajs. Now define (It is implied throughout that our 
definitions of products ab are extended by bilinearity to products of linear 
combinations of terms like a and b.) 2 (qv) =aqv, if either 1 < j, or 
t=j and e< p—1; ai(quv) = (a?! + (rv), if += 7 and e=p—1; 
ai(qu) = 2;(xi(sv)) + ([2i, (sv), if +> 7. 

Finally, the definition of the multiplication is completed by defining, 
inductively on the degree of the monomial in the first factor, (qxiw) (rv) 
= 4(2;(u(rv))). 

It is evident from our definitions that if a and 6 are arbitrary elements 
of Q@ and weUy we have (ab)u—a(bu). We shall prove also that 
(au)b =a(ub) and (ua)b = u(ab). This will be proved by induction on 
the degrees of the monomials occurring in a, using preliminary results which 
we must establish first. 

If aq is an ordered monomial beginning with 2;, and u,v are in Uy, 
our definitions show that w(aiqv) = 27;(u(qv)) —ts,(u) (qv). From this, 
it is easily shown by induction on the degree of the ordered monomials 
occurring in a that u(va) = (uwv)a, for all ain Q and all u,v in Uy. Using 
only this and the last item of our definition of the multiplication in Q, one 
finds that (aw)b = a(ub), for all a and b in Q and all u in Uy. 

Next we shall show that, for all w and v in Uy, and all ordered monomials 
qg, we have u(2;(qv)) = (ua;) (qv). Indeed, this is clear from the above 
whenever 2g is an ordered monomial, i.e., whenever g = 2;r, where either 
j >i, or j =i and e << p—1. Now suppose that Then we have 
u(a(qv)) =u(az)(rv)) + (ua*) (rv). Assuming our result has already 
been proved for monomials of smaller degree than gq, this can be written 


u(ai(qv)) = (ua?! + (rv) = + — (u) (rv) 
= (xP*(u(rv))) —t,,?(u) (rv). 


Now we have 2;(u(rv)) =u(arv) + tz,(u)(rv), and if we apply 


this p—i1 times we obtain 2*(u(rv)) * rv) 
k=0 


p-1 
= > (—1)*tz,*(u)(a?*“*rv)._ If we multiply by 2; and then replace z;t,,"(u) 
k=0 


b 
a 
( 
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by ta#*(uw) + te,*(u)2; most of the terms from the above sum will cancel out, 
and we obtain = tz,?(u) (rv) + (ua) (qv). Substituting 
this above, we find w(zi(qv) ) = (ua) (qv). 

There remains the case g = as, with 7 <7. Here we make the induction 
on the lexicographic order ((degree of q),7). Then, assuming the result 
proved in lower lexicographic order, we have u(x(qv)) = u(2;(a%(sv) ) 
+ [ai, 2] (sv) 4- 2))(sv)) = (ua) (ai(sv)) + (ul ai, aj] + ur(ai, In 
the first term, replace wa; by xju —t,,(u). Then we can write it in the form 
a;(u(ai(sv))) (vi(sv)). By inductive hypothesis, this is equal to 
a;( (way) (sv) ) — = (ajay — ajtz,(u) — tz, (u)a;)(sv). Hence 
we have 


(u(ai(qu)) = — ajt,,(u) — te, + ula, + ur (a, (sv). 
On the other hand, using the inductive hypothesis, 


(was) (qu) = ai(u(qu)) — te, (qv) = (way) (sv) ) — (te, (4) 29) (80) 

= ( (xju — te, ) (sv) ) — (te, (U4) (sv) 

= — (aite, (14) + te, (sv) 

It is easy to check that the first factors in our final results for u(a;(qv)) and 
(ua;) (qv) are equal. Hence we may conclude that u(aa) = (ua;)a, for all 
ain Q and all wu in Uy. 

We are now in a position to prove inductively on the degree of the 
monomials in a that (wa)b—u(ab). The critical inductive step is the 
following : 

(u(xjqv))b = ( (ux) (qv) )b = ( (au — te, (u)) (qv) )b 

= (a,(u(qv)))b— (te, (qv) )b. 
By inductive hypothesis, the second term is equal to t,,(w)((qv)b). On the 
other hand, since xq is an ordered monomial, it is easily seen from our 
definitions that the first term is equal to 2;(u((qv)b)), and hence, by the 
above, is equal to (a;uw)((qv)b). Hence we have 


(u(aiqv) — te (w)) ((qu)b) = (wes) ((qr)b) 
= u(xi((qv)b)) = 
which is what was to be proved. 


Now observe that our kernel structure on M induces the structure of a 
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restricted Z-kernel in M/N. By Lemma 1.1, this restricted L-kernel M/N 
is extendible. Denote by (H,¢) the restricted extension of M/N by L which 
is constructed as in the proof of Lemma 1.1. The space of EF is the direct 
sum (L,M/N). We define a linear map p of Uz; into Ug by making the 
image of an ordered monomial gq in the basis elements of LZ equal to the 
ordered monomial in Ug which is obtained by replacing each 2; by (2,0) eL. | 
Let o denote the canonical homomorphism of Uy onto U uyn C Up. Then 4 
p@o is a homomorphism of Q onto Ug whose kernel is the ideal QN of Q Fy 
which is generated by the elements of N. Since Uz is associative, it follows | | 
that a(bc) — (ab)ce QN, for all a,b,c in Q. 

We have a linear decomposition of Q into the subspace Uz, and the 
ideal U;, ®@ Uy*. For any element a of Q, denote by a® its component in Uy, 
so that a= a°®+ a*, where ate We claim that there is one and 
only one linear map y of QN onto N which is the identity map on N and 
which is such that y(ab) =a°-y(b), for all be QN and all ae Q, where NV 
is regarded as a U;-module in the natural fashion. Indeed, there can 
evidently be at most one such map. In order to prove its existence, it suffices 
to show that if (n;) is a basis for NV over F, and a; are elements of Q such 
that } ayn; = 0, then each a; must belong to U, ®Uy*. Let u; denote the 

j 


product of the finite set of elements n,?-', where & runs over all those indices 
for which a, 40 and k~j. Since each n,” = 0, right multiplication by u; 
gives ajnjuj; = 0, and this evidently implies that a;eU,@ Uy’. 

We shall now have to consider multiplications in Uz, and multiplications 
in Q simultaneously. In order to avoid confusion we shall therefore denote 
elements of Uz;* by 2, y,2, ete., and their corresponding elements in Q by 
x’, y’, 2’, ete. With this understanding, we define a trilinear function g from [| 
to N by g(2,y,2) =y(2'(y'2’) — (2’y')2’). 

Now we observe that if any one of the elements a,b,c is in QM then 
y(a(bc) — (ab)c) =0. We use the fact that QM = MQ=—U,® Uy", so 
that annihilates MQN =—QMN. With me WM, our assertion now follows 
from the relations 


(ma)(bc) — ((ma)b)c = m(a(be)) — (m(ab))c = m(a(be) — (ab)c) © MQN; 

a((mb)c) — (a(mb))c = a(m(bc)) — ((am)b)c = (am)(bc) — ((am)b)e, 
which belongs to MQN by the first case, because am e MQ; | 

a(b(mc)) — (ab)(mc) = a((bm)c) — ((ab)m)c = a((bm)c) — (a(bm))ec, 
which belongs to MQN by the second case, because bm e MQ. 
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From this we can show that g is a 3-cocycle for U;* in N. In fact, 
we have 
y, 2 t) = g(y, 2, t) — g(ay, 2, t) + g(a, yz, t) — y, 2t) 

= (20) — — (ayy + 

+ — — (y'(aty’) + 
Since (zy)’ —a’y’ ¢ QM, etc., we may replace (zy)’ by 2’y’, etc., in the above 
without changing the value of the total expression. If we make these replace- 


ments we obtain 
89(2, y, 2, t) = y(((a’y’)2’ — 2’(y’2’))t’). 


Now (a’y’)z’ — a’(y’2’) e QN, and if ne N we have nt’ = t’n —t-n, whence 
y(ant’) = 0, for all a in Q. Hence we may conclude that 6g = 0. 

Next, using that «’y’ — = [a, y]’ + 7(2,y), for z,y in L, ete., one 
verifies that, for x,y,z in LD, 


g(a, 2) — z, y) + g(2, — y, + gly, 2, — gy, 2, 2) 
= 8r(2, y, 2). 


This means that the primary deviation of the kernel AZ, which was constructed 
from g in section 2 coincides with the primary deviation of Mf. We wish to 
show that M is actually equivalent to V,, i.e., that the kernel M+ M_, is 
extendible. We shall prove this by means of Lemma 1.1. A hindrance p 
for M+ M_, is given by p(x, y) + r@y—y@u. By what we 
have just seen, the primary deviation $p is then 0. 
Put Then we have, in M+ 
— t,,te1 = We shall show that the secondary deviation computed 
| from p and the 2;° is 0. It suffices to prove that T,,(2;) = 0, for all ¢ and j. 
First take the case We have T,,(2i) = te,(ai°) — p(ai'?!, 2;) 


tr,*(p(a, This reduces to = te,(xi°) — 24). 
k=0 
On the other hand, we have, in Q, 
— a; = a; + — + a; 


This last expression is a secondary deviation for the kernel M, and therefore 
lies in NV. Hence it is equal to its image under y, i.e., to g(a, 7", 2;). On 
the other hand, we have, in M_,, 


te, @ x;) = a, — x, @ — 2;). 


From these two results we see at once that the above 7,,(2;) = 0. 


| 

| 
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The case j £1 is considerably more difficult. In order to dispose of it, 
we require further associativity relations in our our algebra Q. These are 
the following: let a:,---,@m be a set of elements of Q such that, with 
some given pair of indices 1 and j, 147, the monomials occurring in the a’s 
involve only x; and 2z;, the sum of their degrees in 2; being no greater than 
p—1 and the sum of their degrees in 2; being no greater than 1. Then 
all products a,- - ‘dm (with all possible arrangements of the parenthesis) 
are equal. In order to prove this, we observe that each of these products is 
obtained by first writing down the factors 2;, z;, or we Uy in the order in 
which they occur in a@;- - ‘dm and then making a series of straightening 
operations as follows: 


or according to the schemes obtained from these by interchanging 7 and j. 
Clearly, each such straightening yields a linear combination of (symbolic) 
products each of which is of the same type as @,- * -@m, the index j being 
replaced by some possibily different index /, resulting from [2;,2;]. Hence 
it will suffice to show that the order in which these straightenings are carried 
out is immaterial for the final result. It is clear that differences can result 
only from inversion of the order of the straightenings of a triple, as shown 
in the following scheme, where (a,b) denotes the result of straigtening the 
symbolic product ab, ete. : 


If we write down all possible permutations of 3 elements from the set 
U, Vi, Vi, Vi, Zj, We see at once that there is only one triple in which such 
alternatives exist. This is if 1> 7; or if But, since u 
associates with every pair, the two straightening procedures indicated above 
give the same result. This completes the proof of our assertion. 

What remains to be proved is that, for 7 41, T,,(2;) =0. This is the 
sum of a secondary deviation for Jf and a secondary deviation for M_,. We 
have already computed the secondary deviation for M_, in terms of g. The 


result was — 5 (9(v#, Do? —g(r#, 
k=1 


By the «-sociativity relations we have just proved, the values of g which occur 
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in the sum are 0, except possibly for k==p—1. Hence the secondary 
deviation in M_, reduces to 


— + xj, v1) — %, 2). 


Now suppose first that j > 7. Then we have, in Q, = (a? 
whence g(x, =0. Similarly, 2, 2;) =0. Hence the above 
deviation for J/_, reduces to — g(a;, v?-1, z;). We must therefore show that 
the corresponding deviation for M takes the value g(a;,z?",2%). Let ¢ 
denote commutation with a; in Q, i.e, €(4) =axa—az; Consider the 
product xv? in Q. Since the products of at most p—1 factors 2; and 
one factor z; are associative, we have, by an associative identity in charac- 

p-1 
teristic p which we have already used in section 2, > 
k=0 
= + Hence we obtain = 
k=0 


(x In the first term, we may replace xa; by ; 
k=0 
in the sum, we may re-associate the products because £?-*-*(2,;) is of degree 1 


in the 2’s, and then replace by — €?*(a;). Then we 
obtain a; = — £?(a;). Hence we find 


— (xvi? *) = te, — — 25) + (24). 


If we write out £?(v;) in full, we find that the right side of the last equation 
is precisely the value of our secondary deviation for M. Hence the associator 
on the left lies in N and therefore coincides with g(a;, 7, x). Hence the 
secondary deviation for MW is equal to g(2;, v?-1, x), as we wished to show. 
There remains only the case 7 In this case, = 0, so 
that the value of the secondary deviation for M_, reduces to g(a;j?"1, xj, %) 
— g(a", x, xz;). Accordingly, we consider the difference of associators in Q, 


(ajay) — (xP a; — (aaj) 4+ a; 


= — — (xP a; + ala; 


By an associative identity like the one we have used above, we have, with the 


same justification as before, (— and similarly 
p-1 k=0 
= (— In the first of these equations, we 
k=0 
isolate the term with / = p—1 as before. Then we substitute the sums in 


e 
h § 
n ff 
) 
is 
n 
g 
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the above expression for the associator difference and re-associate permissibly 


to obtain 


p-1 p-2 
=0 k=0 


+ alle; + 
This reduces to — £?(x;) — + —1,,(a;*). As we have already 
seen above, this is the negative of the secondary deviation for M, and hence 
lies in N and so coincides with g(x", — %). 
Hence we may now conclude that the deviations for M+ M_, are 0, so 
that M + M_, is extendible, i.e., M is equivalent to M,. This completes the 
proof of the following main result. 


THEOREM 3.1. The map g—M, of the space of 3-cocycles for Uy* in 
the U;*-module N induces a linear isomorphism of H,?(L,N) onto the space 
of equivalence classes of the restricted L-kernels that have N as nucleus, with 


— (0). 
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REMARKS TO AN EARLIER NOTE (Vol. 57, pp. 539-540). 


By AvuREL WINTNER. 


By using the elements of the theory of normal families, the “best” 
domain of regularity, as determined in [2], will in this note be transferred 
to the case in which the differential equation is regular only in the unknown 
function. This will supply the determination of the “best” convergence 
interval of the expansion introduced by Perron [1]. 


1. Under the normalization that (0,0) is a point of the (2, y)-region 
on which a given function f(x,y) is continuous, consider the differential 
equation and the initial condition 


(1) dy/dx = f(x,y), y(0) =0, 


and suppose that, on the real x-interval under consideration and for small | y |, 
the function f(z,y) is a uniformly convergent power series in y, say 


(2) f(a, y) — 3 


Then the solution y(x) of (1) is not in general a power series @a-+ °°: - 
+a,2" +--+ but, corresponding to the mere continuity of the coefficients 
fx(x) of (2), a curve y= y(wx) about which not more than the existence of 
a continuous slope dy(z)/dz is assured. Correspondingly, the assumptions 
imposed on (1) do not exclude the case, important in many applications, in 
which the coefficients f;,(x) of (2), instead of being regular power series in 2, 
have at x0 singularities of the type of 23, wsin (z"),--- or are not 
analytic at all. Then such methods as that of the successive approximations 
(Cauchy-Lipschitz) are still applicable. But these classical methods, being 
applicable also when f({2,y) is not a power series in y, fail to utilize the 
advantages which, even from the point of view of a computer of y(), might 
result from the power series structure of (2). 

In a little-known paper, Perron [1] considered an algorithm which does 
not fail to utilize the assumption that f(x,y) is a power series in y (but not 
necessarily in 2). His procedure is as follows: First, replace y in (2) by 
ty, where ¢ is a parameter, and, for a fixed value of t, denote by yt = y*(z) 
the solution of what results from (1) upon this replacement; so that 
(3) dy*/dz == f(z, ty), y*(0) =0. 


* Received January 30, 1954. 
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Next, assume for the solution y‘(x) of (3) a power series in ‘t, say 


(4) yt (2) = 


(the coefficient functions ¢,(z) of (4) can be calculated uniquely, since 
substitution of (4) into (3), when followed by a comparison of the coefficients 
of ¢” on both sides of what then becomes of (3), leads to the recursive formulae 


where P, is a universal polynomial, with positive coefficients, in its 
(n + 1) + n arguments fi, ¢;). Finally, observing that (3) is the case t =1 
of (3), hope that the case {1 of (4) will supply a convergent expansion, 


(5) y(2)=3 bn(2), 


n=0 


of the solution y(x) of (1). 


2. In his paper referred to, Perron justifies this hope, on the interval 
(6) $b/M), 


under the following assumption: The coefficient functions f;,(#) of (2) are 
continuous on the interval 0S a-4-« and satisfy the inequality 


(7) | S for OS rSat+e (k =0,1,2,- - -), 


where a, b, AJ and « are positive constants. In fact, Perron proves by a 
majorant argument that, by virtue of (7), the series (5) is absolutely con- 
vergent on the interval (6) and represents there the solution y(x) of the 
case (2) of (1). [The letters a, 6, M, used above, correspond to the standard 
notations, those occurring in (8) and (9) below; in these notations, Perron’s 
four constants K, M, b —a, y are represented by M 1/b, a, 0 respectively. | 

A glance at Perron’s explicit solution of his majorant of (1) shows that, 
under his hypothesis (7), the occurrence of the factor 4 in the limitation 
(6) of the convergence range of (2) cannot be helped. The occurrence of 
the 4 seems nevertheless uncalled for, since, if a, b, WM have their meaning 
customary in the treatment of (1) by successive approximations, then the 
latter are known to lead to the range 


(8) 0=27Smin(a, b/M), 
which does not contain the $ of (7) and is the “best possible” range for 


function-theoretical reasons; cf. [2]. It is understood that the customary 
meaning of the three constants a,b, M is that contained in the assumption 
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that (for some fixed « > 0 which, except for the sake of a comparison with 
(7%), could be replaced by « = 0) 

(9) | f(a,y)| SM—e for OS rSa+e, |y| <b, 

where, (2) being a power series, y is allowed to be complex. Thus | y| <b 
is a circle, and not an interval, in the second proviso of the first inequality 
in (9). 

Accordingly, by Cauchy’s estimate of the coefficients of the power series 
(2), Perron’s inequalities (7) are implied by (9). The converse inference 
is false, as seen by choosing f;,(2) independent of & in (2), that is, by 
choosing f(z, y) =fo(x)/(1—y) (if |y| <1). In other words, (9) is 
more general than (7). This, when combined with the fact that, as empha- 
sized in [3], Cauchy’s principle of majorants can never supply “best” 
x-ranges of validity (“best ” from the function-theoretical point of view: 
ef. [2]), makes one suspect that Perron’s estimate (6) for a valid expansion 
(5) can be improved to (8), that is, to the classical estimate (8) for a valid 


range of successive approximations. 


3. In what follows, this suspicion of a mischievous part played by 
Cauchy’s majorants will be justified. In fact, it will be shown that if f(z, y) 
is continuous in (2, y), satisfies the first of the inequalities (9), and is regular 
in y, i.e., is of the form (2), whenever z and y satisfy the conditions specified 
in (9) (where with r—|y|<b, and not just —b<y<b, is 
meant), then Perron’s expansion (5) is convergent on the entire interval 
(8) and represents there the solution y(z) of (1). In addition, the conver- 
gence of (5) is absolute on (8). Incidentally, it will be clear from the proof 
that the convergence of (5) is uniform on (8) and that, at every point of 
(8), the derivative occurring in (1) can be obtained by term-by-term differ- 
entiation of (5). 

Proof. The occurrence of the « >0 in (9) implies the existence of a 
positive number A which is somewhat greater than 1 and has the following 
property: If ¢ is any complex number contained in the circle 


(10) |¢| <a, (A> 1), 
then 
(11) | f(z, ty)| <M for OS jy| <b. 


Consider (3) for a fixed ¢. The sequence of the standard successive 
approximations belonging to (3) (for a fixed ¢) is assigned by 


(12) Y'm(2) = f f(u, tytms(u))du, where =0. 
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In view of (11), all the functions (12) are defined on the interval (8), and 
(13) | ytm(x)| <b if OS eS min (a,b/M). 


Furthermore, since (2) and (11) assure the satisfaction of Laipschitz’s 
condition, 
(14) yt(x) as m—>oco on the interval. (8), 


where y*(z) denotes the solution of (3). 

On the other hand, since f(z, tz) results if y is replaced by ¢z in the 
power series (2), it is readily seen from the recursion formula (12) that 
each of the functions y*)(x), - is a power series in say 


n=0 


and that, corresponding to the induction leading from (11) and (12) to (13), 
each of the power series (15) is convergent on the circle (10) if x is on 
the interval (8). Hence, for fixed x on (8), the functions y‘m(x) of ¢ are 
regular on the circle (10). In addition, they are uniformly bounded there, 
by (13), and converge there to y*(x), by (14). Consequently, if 2 is fixed 
on the interval (8), then, by the elements of the theory of complex functions, 
y*(x) is a regular function of ¢ on the circle (10) and possesses, therefore, 
an expansion (4) convergent on (10). Hence, in order to conclude the 
validity (and the absolute convergence) of (5) at every point 2 of the interval 
(8), it is sufficient to note that, in view of the parenthetical inequality in 
(10), the value ¢ = 1 is allowed in (4). 

It is clear from this proof that the method and the result can be 
generalized from (1) to the case of a system. 
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ABSTRACT PROBABILITY SPACES AND A THEOREM OF 
KOLMOGOROFF.* ? 


By I. E. SEGAL. 


1. Introduction. In the more recent mathematical work on prob- 
ability the mathematical model for a probabilistic system that is customarily 
employed is a “ probability measure space.” This consists of a set M, a o-ring 
9n of subsets of M which includes the set J itself, and a countably additive 
non-negative measure m on 9M normalized by the requirement m(#) —1. 
In the present paper a somewhat different sort of model is presented which 
while equivalent to the standard one is more cogent for certain technical 
purposes, and seems to us to be more in agreement with the historical and 
conceptual development of probability theory. Among other technical advan- 
tages, the use of this model makes it possible to extend a well-known theorem 
of Kolmogoroff concerning the existence of random variables having pre- 
assigned joint distributions from the case of real-valued to that of general- 
valued random variables. 


2. Abstract random variables. Random variables are usually defined 
as measurable functions on probability spaces. This is a long way from either 
the practical statistical or intuitive conceptual formulation of the notion, 
especially as the probability spaces required for dealing with simple concrete 
situations may be mathematically relatively sophisticated. For example, in 
treating sequences of coin tosses one is interested to begin with in the pro- 
portion of heads in n throws, in elementary algebraic expressions formed from 
such propositions, and in their expectations. At present however, in order 
to give a rigorous treatment of these simple matters, one is required to 
introduce the infinite direct product spaces in which each factor is a fixed 
two-point space, and to formulate the proportion of heads as a measurable 
function on this rather ungeometrical space, which must yet be endowed with 
the appropriate measure. A more direct approach to the matters of basic 
interest seems desirable, and if it is possible to avoid the explicit use of 
infinite product spaces, so much the better. 


* Received November 10, 1953; revised March 29, 1954. 
? This research was supported in part by The Office of Naval Research. 
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Such an approach can be given by starting with axioms of simple con- 
crete significance for the random variables of interest and their expectations, 
rather than with relatively technical axioms concerning a measure on a space 
of “elementary events.” Instead of deriving the random variables from the 


space of events, one can go in the other direction, but actually for many 
purposes the space of events is not only not necessary and to some extent 
arbitrary but introduces tiresome mathematical complications concerning sets 
of measure zero. To arrive at an effective set of axioms, notice that in the 
coin-tossing example above, the random variables that one is primarily con- 
cerned with are products of proportions of heads in n throws (for various 
values of mn), and linear combinations of these. From the conventional “ space 
of events ” outlook, these are just the random variables that depend only on 
a finite number of throws. On this set 22 of random variables there is defined 


a key linear functional EF, the “ expectation,” which is determined by linearity 
from the knowledge of the proportion of heads in n throws and its various 
moments. Now whenever all such moments are given, the linear set deter- 
mined by the corresponding set of random variables will be an algebra, as is 
in particular the case with @. In the general case it is reasonable to start 
with some undefined “ random variables,” all of whose moments (including 
joint moments) are given, and one is led immediately by forming linear 
combinations to an algebra with a distinguished linear functional. 

Now to be specific, we define a probabilily algebra (®,H) as an algebra 
é@ (commutative and over the real field) on which there is a distinguished 
linear function /, having the following readily interpretable properties: 


1) For any element a of &, H(a)* = 0, and F(a?) =0 only if 


2) For any element a of ®, there is a constant » such that F(ab’) 
<= pF (b*) for all elements 6b of ®. 


3) @ has a unit e and H(e) —1. 


The second axiom is the most restrictive, as it means implicitly that it is 
being required that the random variables in @ are bounded. This is actually, 
however, a light restriction, since in a real experiment one can measure only 


such random variables, and mathematically there is no difficulty in treating, 
as usual, the unbounded random variables in terms of the bounded ones. 

The elements of @@ are the basic random variables, there may of course 
be additional ones that can be obtained from the basic ones by a limiting 


process. Without going into detail at this point about the precise character 


of this limiting process, we mention that the term “ probability algebra ” is 
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a- justifiable along conventional lines by means of the result that the most 
sf general probability algebra is essentially an algebra of conventional random 
ee | = variables (i.e. measurable functions) on a probability space, and this space 
ie is essentially unique, if it be required that the random variables of the algebra 
y | determine the probability space (in e.g. the sense that the smallest o-ring 
it with respect to which they are all measurable is the o-ring of the space, 
ts modulo null sets). The limit random variables obtained from the original 
le ones then correspond to the remaining random variables on the space. The 
1- next section is devoted to the elucidation and proof of the result just quoted. 
Is 
ef 3. Structure of probability algebras. The measure ring of a measure 
n space is defined as the ring of all measurable sets, modulo the ideal of all 
d null sets, and the measure of an element of the measure ring is thus uniquely 
y defined if set equal to the measure of any measurable set that is a repre- 
8 sentative for it. By a random variable on a probability space we shall mean, 
. to be perfectly explicit, an equivalence class of real-valued measurable func- 
8 tions on the space, two functions being defined as equivalent if they differ 
t | only ona null set. Any collection of random variables on a probability space 
y generates a subring of the measure ring, in the following fashion: if ft is a 
r random variable with representing measurable function f, the elements of the 
measure ring corresponding to sets of the form [a: f(x) SA], for arbitrary 
1 real X (i. e. to their residue classes modulo theideal of all null sets) constitute 
a o-subring of the measure ring, which is a o-ring. The o-subring generated 
by all these subrings as ft varics over the collection is the subring of the 
measure ring determined by the collection. A collection of random variables 
is said simply to be determining if the subring determined by them is the 
entire measure ring. 

THEOREM 1. Any probability algebra is isomorphic to a determining set 
of bounded random variables on a probability space, the isomorphism pre- 
serving alegbraic operations and expectation. This space is unique, in the 
sense that for any two such spaces there exists a probability-preserving iso- 


morphism between their measure rings which connects their isomorphisms 


with the probability algebra. 


To prove the existence part of the theorem, it is convenient to introduce 


complex random variables, in order to make use of a known representation 
theorem dealing with the complex case. Let @ be the complexification of ®, 
so that a general element of @ has the form x + iy, with 2 and y in @, and 
extend to @ by defining E(x + iy) = E(x) (y). It is easy to verify 
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that # as extended is complex linear. Now define an adjunction operation 
u—->u* on as follows: («+ iy)*—ax—iy. It is immediate that 
E(u*) = [#(u)]* (where * represents complex conjugation), E(u*u) = 0, 
and H(u*u) =0 only when u=0. Now introducing a function (u,v) on 
ordered pairs of elements in @ by defining (u,v) = H(v*w), it is clear that 
(u,v) is a positive definite inner product on 4, with respect to which @ 
has a Hilbert space # (say) as completion. The map w—> u* clearly extends 
to a conjugation J on & and there is no difficulty in verifying that the system 
(#,J, @) is a Hilbert algebra in the sense of [4]. As it is also commu- 
tative, it follows from a well known representation theorem that there exists 
& measure space P = (M,%9,m) such that the space L2(P) of square- 
integrable complex-valued functions is unitarily equivalent to & via a unitary 
transformation W from & onto LZ.(?) which transforms J into complex 
conjugation and takes @ into an alegbra of bounded measurable functions on 
? that is dense in L.(P). (More precisely, we are dealing here with equiv- 
alence classes of measurable functions, but in the present proof it would 
seem somewhat circumlocutory to insist on the distinction.) 

As an algebraic isomorphism preserves units, the unity element of @ 
maps into the unity function on M, whose square-integrability implies that ? 
is finite. More precisely, for any element u of @, E(u) = (u, e) = (Wu, We) 
and substituting ue it results that || We ||? 1, i.e. the integral over M 
of the unity function is one, so that P is a probability space. Now the image 
of @ is dense in L.(P), and the desired result now follows from the fact 
that any dense subset of Z.(P) is determining. To see this, note that the 
functions in L.(P) that are measurable with respect to the ring of sets 
determined by a dense set D constitute a closed linear manifold in L.(?), 
by the Riesz-Fischer theorem. This manifold is a proper submanifold if D 
is not determining, but it cannot be proper since it contains D, which is dense 
in Lo(P). 

To prove uniqueness, suppose that P’ = (M’, In’, m’) is a space with 
the same properties as P. Let D and D’ be the respective subsets of L.(P) 
and L.(P’) which map onto @ via the respective isomorphisms W and W’. 
The W-W’ is an algebraic isomorphism V of D onto D’. It suffices to show 
that V extends to a unitary map of L.(P) onto Z.(P’) that is multiplicative 
on bounded functions. For such a map takes idempotents into idempotents, 
and so will take a characteristic function (4 into a characteristic function C... 
As the map is unitary, A and A’ will have the same probability, and the 
induced map A-> A’ is unique modulo null sets. It is thus a probability- 
preserving map of the measure ring of ? into that of P’, and it follows from 
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the fact that V is an algebraic isomorphism on the bounded functions that 
this induced map is a ring isomorphism of the measure ring of ? onto that 
of P’. It is well known that such a map has a unique extension to an algebraic 
isomorphism of the random variables on ? onto the random variables on P’. 

Now to show that V has an extension of the sort indicated, note first 
that it has a unique unitary extension V, from L.(P) onto L.(P’), as it is 
isometric and maps a dense subset of L.(P) onto a dense subset of L2(P’). 
Next, if f is a bounded function on f and g is in @, then V,(fg) = Vi(f)Vi(g). 
For let {fn} be a sequence of elements of @ which converges in L2(P) to f. 
Then as g is bounded, {f,g} converges to fg in L2(P), so that Vi(fng) con- 
verges to V,(fg) in L.(P’). On the other hand, Vi(fng) = Vi(fn)Vi(g), 
and as V,(g) is bounded and {Vi(f,)} converges to V,(f) in L.(P’), it 
results that V,(f.g) converges to Vi(f)Vi(g). The stated equality now 
follows directly. Now the set of all elements g of Z.(?) for which 
V.(f9) = Vi(f)V:i(g) for every bounded function f on ? is a closed subset 
of L.(P), since either side of this equality is a continuous function of g. 
As it includes the dense subset @, it must be all of L.(P). 


Remark 1. It was noted above that a dense subset of L.(P) is deter- 
mining. Conversely, a determining subset of L.(P) ts dense, if it is an 
algebra of bounded functions.? To establish this measure-theoretic analog to 
the Stone Weierstrass theorem, let S denote the collection of all finite unions 
of sets of the form A; ((=1, 2,- -,m)], where A; is an interval 
(of any type) and the f; are elements of D. It is clear that 3 is closed 
under unions, and it is closed under differences, since 


[x: fi(w) © Ai] [w: 9; (x) By] 
a 
=f) [#: fi(x) Ai] [v: By), 
and the complement of B; is the union of at most two intervals. Thus 3 is 


*This result is stated by Kakutani in Proceedings of the National Academy of 
Sciences, vol. 36 (1950), p. 320, without the restriction to bounded functions. That it 
is invalid in this generality can be deduced from the non-uniqueness of the Stieltjes 
moment problem in general. The latter implies the existence of a function F of bounded 


variation on (0,©) such that =0 (n=1,2,---), while dF(A) = 0. 


Putting = flare), the of polynomials on vanishing at 0 
0 

determines the ring of Borel sets on (0,0) but cannot be dense in the space of func- 

tions square-integrable with respect to the measure determined by G, as every element 

is orthogonal to dF/d@. 
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a Boolean ring. It is clear that the o-ring @ generated by 3 is the same as 
the o-ring determined by D. Now if A is a finite interval, the characteristic 
function of A is a uniformly bounded limit of polynomials, which implies 
that the characteristic function of [x:f(x) eA] is a uniformly bounded 
limit of a sequence of elements of 9, if f is in D. 

Now if g is an element of Z,(P) that is orthogonal to all the elements 
of D, it is orthogonal to all uniformly bounded limits of sequences of 


elements of 9, and so in particular, f g=0. if Q=[a:fi(z) A; 
Q 


(t= 1,2,- --,mn)], where the f; are in and the A; are bounded intervals. 
By the countable additivity of the indefinite integral, the same holds when the 


A; are not necessarily bounded. It follows that f g = 0 for all elements 
s 


S of 3. As the collection of all measurable sets § such that J. g = 0 is 


closed under unions of monotone sequences and contains the ring 3, it 
follows that this collection includes the o-ring generated by 3.. Now every 
measurable set is equivalent to some element of ®@, since D is determining, 


so that j g = 0 for every measurable set 8. This means that g = 0, i.e. 
Js 


YD must be dense. 

In a similar fashion it can be shown that a determining algebra of 
bounded measurable function on P is dense in the algebra of all bounded 
measurable functions of P, in the topology on this algebra induced from the 
weak topology on the dual of the Banach space L,(?), by the identification 
of a linear functional with the bounded measurable function that is its kernel 
in the Riesz representation for the functional. 


Remark 2. An isomorphism of the type involved in the uniqueness 
part of Theorem 1 automatically has further properties justifying the 
identification of the two spaces from a probability viewpoint. It can be 
uniquely extended to an algebraic isomorphism between the random variables 
on the respective spaces, that preserves order (non-negatively) and expec- 
tation (i.e. maps a random variable of finite expectation into a random 
variable of the same finite expectation). 


4, Existence of random variables with preassigned distributions. A 
well known theorem of Kolmogoroff asserts that there exist real-valued random 
variables having any preassigned joint distributions satisfying the obviously 


necessary consistency conditions. The proof of the theorem does not extend 
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to the case of general-valued random variables, the basic difficulty being that 
a measure that is countably additive on each of a monotone increasing 
sequence of o-rings of subsets of a set need not be countably additive on 
their union. This difliculty essentially disappears when the theorem is 
approached from the standpoint of probability algebras. This is because the 
trouble in the conventional proof arises, roughly speaking, from trying to 
hang the random variables on the wrong space, while the use of probability 
algebras leaves the choice of the space of events quite arbitrary, within wide 
limits. 

In order to deal effectively with general-valued random variables, a 
modification of the usual notion must be introduced. The basic conceptual 
concern in sta‘istics is not so much with the values of the measurable function 
f representing a random variable at points of the space V of events as with 
the sets in M where f takes on certain values (and with the probabilities of 
those sets). That is, the inverse map / = f-! from (Borel) sets on the reals 
to subsets of WM is more central from some viewpoints associated with the 
origin of the theory. Taking the values of # modulo null sets, one obtains 
a Boolean oc-homomorphism Ft from the Borel subsets of the reals to the 
measure ring of ®. which determines uniquely the random variable ft repre- 
sented by f, and every such o-homomorphism preserving the Boolean units 
arises in this way. Thus ft and Ft are mathematically equivalent notions, 
and there are some mathematical as well as conceptual advantages in using 
Ft, particularly in the case of general-valued random variables. Accordingly 
we take 


Definition. <A generalized random variable X on a probability space P 
is a Boolean-c-homomorphism from a Boolean o-ring @ with unit to the 
measure ring of ®, which preserves the Boolean units. If @ consists of 
subsets of a set B, then V is called B-valued, and in any case is said to be 
defined relative to 


The usual notion of joint probability distribution of a set of random 
variables extends to generalized random variables as follows. For generalized 


random variables X; defined relative to B; (i—1,2,---,n), the joint 

probability that X; has its value in b; (‘(=1,2,---,n; b:¢ B;) is defined 

as the probability of the element N X;i(bi) of the measure ring of the 
i=l 

probability space. This probability, as a function of -, is the 


joint distribution of X,,X2,- -,Xn. 


Kolmogoroff’s theorem may now be extended as follows. 
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THEOREM 2. Let By) be a Boolean o-ring with unit uy, where » range: 
over an index set A. For each finite subset d1,A2,° + +,An Of A let there be 


given a function my, ,---», on the (set-theoretic) direct product [J B), to 
k=1 


the interval [0,1]. In order that there exist a probability space P and 
generalized random variables X, on P defined relative to By (XE A) such 
that the joint distribution of any finite set of the X),,X),°°-°,Xy, be 
VE necessary and sufficient that: 


2) if the elements by), of By, are fixed for 
then Or» Or» ts countably additive as a function of dy,. 


3) tf =u, for 7 >k, then 
(Ors >> Or.) = (Ory Ory Org)- 


The necessity of these conditions is obvious, as in the case of real-valued 
random variables. In proving the sufficiency we use 


LemMMA 2.1. The simple functions modulo null functions on a finitely- 
additive finite normalized measure space constitute a probability algebra. 


The distinguished linear functional of the -probability algebra in the 
lemma is understood to be the integral. If Q@=—(G,9%,g) is a finitely- 
additive finite normalized measure space—i.e. g is a finitely-additive non- 
negative measure on the Boolean ring % of subsets of G, where G is in 9 
and g(G) = 1—a simple function is a finite linear combination of charac- 
teristic functions of sets in 8. Such a function is a null function if it 
vanishes except on a set on which g vanishes. The simple functions constitute 
a ring in which the null functions are an ideal, and the quotient algebra @ 
is the algebra of the probability algebra whose existence is asserted by the 
lemma. 

If the simple function f has the form f = > Ca, With A; in &, the 

i=1 
integral H,(f) is defined as = ag(A;), and there is no difficulty in verifying 
that is linear on the simple functions, that £)(f?) = 0 while E,(f?) =0 
only if f is a null function, and that Ey(f?h) S »£(f*) for arbitrary simple 
f and fixed simple h, » being a constant depending on h. If ft denotes the 
equivalence class represented by f, and if H is defined on @ by setting 
E(ft) = £)(f), it follows that # is uniquely defined and that (CZ, F) is a 
probability algebra. 
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es Proof of Theorem 2. It is well known that a Boolean o-ring @) is 
De '  jsomorphic to a o-ring @) of subsets of a set By, modulo a o-ideal D> in it, 
where B, is a representative for the unit of @)/ Y),. It is therefore no loss 


to 
of generality to assume that 8, = 6,/D . Now set By, and let & 
d heA 
, — be the ring of all finite unions of “ cylinder sets,” i.e. sets of the form [J A), 
4 AcA 
e — where Aye @ for all A and A, = By except for finitely many A. Define a 


functional m on cylinder sets by setting 
m( Ay) = Ao, Any) 


heA 
where °,An} are the indices for which A,B), and setting 
m(B) = 1 to cover the case where there are no such indices. Then m has a 
unique finitely additive extension to the ring 0, as is well known. Applying 
the preceding lemma and Theorem 1, it results that there exists a probability 
space ? and a determining algebra @’ of random variables on P that is 
isomorphic (both as regards algebraic operations and the respective distin- 
guished linear functionals) to the algebra of simple functions modulo null 
functions on (B, 0, m). 
For each fixed » in the index set A, a generalized random variable X,, 
: on ® and relative to @,, may now be defined as follows. Let a, be an 
| element of @,, let A, be a representing subset of B, for a,, put f, for the 
equivalence class of the characteristic function of A’, = C,, where Cy = By 
for \Ayp and C, = A,, and let f’, denote the image in @’ of f, under the 
isomorphism just described. As f, is idempotent, so also is f’, which implies 
that there exists a unique element a’, of the measure ring of P such that f’, 
is the equivalence class of the characteristic function of any representing set 
for a’,. The map a,—da’, is designated as X,. From the fact that the 
preceding isomorphism is in particular an isomorphism when restricted to 
characteristic functions modulo null functions, it follows directly that XY, 
is a Boolean homomorphism of @, into the measure ring of P. It is clear 
that it preserves the Boolean units involved. To show that it is actually a 
o-homomorphism, let a,,a,@,: +--+ be a sequence of disjoint elements of 


n n 
B, of union a, Then a, = U a,, so that = U since Xp 
i=1 
is a homomorphism, and letting n become infinite, it follows that X,(a,) 
=U X,(a,). Now if two comparable elements of the measure ring of 
4=1 


have the same measure, they must be equal, so it suffices now to show that 


m’(X,(a,)) == m’( X,,(a,)), where m’ denotes the measure function on 
i=1 
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the measure ring of By construction, m’(X,(au)) = mz(an), while 
m’ ( U Xy(a,)) = 3 m’(X,(ax)) (for m’ is countably additive and the 


are disjoint, being a homomorphism) = my (a4) 
= m,(a,) by the countable additivity of m,. 
It remains only to show that 


n 
m’ ( X), (a,) ) (Os "5 
j=1 


for arbitrary @),,@,,° °°, 4@,, in B),, B),,- -, By, respectively. Using again 
the fact that the isomorphism involved above is in particular a ring iso- 
morphism on the equivalence classes of the characteristic functions, and the 
definition of the 1, the equivalence class of the characteristic function of 


n 
() X,,(a,) corresponds to the equivalence class of the characteristic func- 
j=1 


tions of a A’,,. As the isomorphism preserves the respective distinguished 
j=1 


linear functionals, 


n n 
m’( Xp, (a;)) == m( A’n,) = (Gry Bre 
g=1 j=1 


5. Weak distributions on linear spaces. In the case of linear-space 
valued random variables a special approach may be appropriate. In certain 
situations, the relevant object is not actually a probability distribution on the 
linear space, but a kind of weak distribution. For example, it is well known 
that there exist no probability distributions on Hilbert space having incisive 
invariance and smoothness properties. Nevertheless there exists a weak dis- 
tribution on Hilbert space which is significant for applications to quantum 
field theory and to the theory of Brownian motion. 


Definition. A weak distribution on a topological linear space B is a 
linear map u* => F'(u*) from the space B* of continuous linear functionals 
on B to the random variables on a probability space. 


In case B is finite-dimensional, there is clearly a unique distribution of 
the conventional type on B, such that the identity map on B* coincides with 
the given weak distribution, but in the infinite-dimensional case this need 
not be the case. In the case of a (real) Hulbert space 9/, there is an essen- 
tially unique weak distribution on 9% such that (identifying 9* with 9 in 
the usual way) F(x) is normally distributed with norm zero and variance 
while F(z,),---,F(a,) are independently distributed whenever 
@,° * *,%, are mutually orthogonal. When & is finite-dimensional, this is 
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essentially (i.e. via the correspondence between weak and ordinary distribu- 
tions just mentioned) the normal multivariate distribution in the standard 
form, while for infinite-dimensional 9 it constitutes a proper generalization 
of the normal distribution. 

The question of the existence of weak distributions with preassigned 
joint distribution for -, F(un*), Un* being an arbitrary 
finite set of elements in B*, can be resolved by the use of the original theorem 
of Kolmogoroff : it is necessary and sufficient for such existence that the usual 
consistency conditions hold, and moreover that if w* = au* + Bv*, then the 
prescribed joint distribution of F(u*), F(v*) and F(u*) ts such that 
F(w*) =aF(u*) + BF (v*). To derive the existence of the normal weak 
distribution on Hilbert space described above, the joint distribution of 
F(u:),- - +, (un) is prescribed as normal, with norm zero, and with co- 
variance matrix || (w;, u;) ||. The validity of the general consistency conditions 
follows at once from well known properties of the normal distribution, as 
does also the special condition for linearity. 


6. Concluding discussion. The present somewhat abstract point of 
view regarding variables has further theoretical advantages, in such areas as 
measurability problems in the theory of continuous stochastic processes, and 
in reformulating the theory in such a way that it can be extended to the 
non-commutative case. 

In these measurability problems the key difficulty is that of obtaining a 
unique and measurable least upper bound for a non-denumerable collection 
of random variables. If the random variables are formulated as measurable 
functions, the pointwise least upper bound will in general not be measurable, 
and in any case will depend upon the specific representations for the random 
variables that are selected. On the other hand, due to the completeness of 
abstract random variables as a partially ordered set, the least upper bound of 
an arbitrary collection of abstract random variables always exists uniquely 
as an abstract random variable. This approach has been applied by Nelson 
[3] to give a more intrinsic approach to such questions as the continuity of 
sample functions. A similar method could be used to treat generalizations 
of such questions as continuity of the sample functions in the case of 
generalized random variables, although in this case there is no analog for 
the sample functions; this is possible because there are analogs for the statis- 
tically significant aspects of the sample functions. 

The non-commutative theory essentially arises if the assumption of 
commutativity is omitted in the axioms for a probability algebra, and if 
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corresponding assumptions made in the assumptions on the distinguished 
linear functional. This transition from a commutative algebra to a non- 
commutative one is precisely the transition from a conventional probability 
space to a quantum-mechanical system in a definite state. In the one case 
it develops that the elements of the algebra can be represented by measurable 
functions on a measure space, while in the other it develops that they can be 
represented by operators on a Hilbert space. It is convenient in the non- 
commutative case to deal with a complex algebra admitting an adjunction 
operation *, although the conceptual loss thereby is to a considerable extent 
avoidable. In this case it is appropriate to assume, as is the case in the 
conventional scheme of quantum mechanics, that the distinguished linear 
functional has the properties: H(aa*) =0, H(b*ab) pal (b*b), and 
E(e) =1. The basic notions of probability theory, including conditional 
expectation can be extended to such a system, which always admits a repre- 
sentation in which the elements of the algebra are operators on a Hilbert 
space and F has the form H(A) = (Ay, w) for a fixed unit vector y. The 
theory remains fairly rudimentary, however, unless it is assumed further that 
E (ab) = EF (ba), in which case most probabilistic concepts and many results 
can, with appropriate reformulation be extended. Among the interesting 
examples of such systems are the approximately finite factors of Murray and 
von Neumann; these are essentially direct limits of matrix algebras, the dis- 
tinguished linear functional being the corresponding limit of the traces on 
the matrix algebra, after normalization so that the unit matrix has unit trace. 
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THEORY OF POSITIVE LINEAR DEPENDENCE.* 


By CHANDLER DavIs. 


1. Introduction. In problems involving real linear spaces, the concept 
of a positive linear combination of a set of vectors has acquired an impor- 
tance almost comparable to that of an (unrestricted) linear combination. 
As examples, one notes the usefulness of the Krein-Milman-Smulyan theorem 
in functional analysis; and the recent theories of games and linear pro- 
gramming. Accordingly, it seems desirable to develop an analague to the 
standard theory of linear subspaces and linearly independent sets of vectors. 
The present paper is an attempt in this direction. 

A central concern, accordingly, will be the geometry of frames of solid 
cones! in finite-dimensional Euclidean space (they being the analogues of 
bases in the standard theory). This is a problem of wider interest than 
might appear in connection with frames of arbitrary convex polyhedral cones. 
Indeed, it is known (and easy to show) that a frame of any such cone 
consists of a frame of its lineality space (therefore, a frame of a solid cone 
of lower dimension), together with a set of vectors arbitrary except for the 
requirement that it map onto a frame of the cone’s image when the space is 
reduced modulo the lineality space. Therefore, there are really only the two 
extreme cases to study: frames of pointed cones, and frames of solid cones. 
The first problem reduces in turn to the study of arbitrary convex polyhedra ; 
the second is attacked here. 

After certain necessary preliminaries (§ 2) I continue in § 3 with those 
parts of the theory which can be easily developed in rough analogy with classical 
methods in the standard (unrestricted-linear) theory. These results are all 
essentially known; many of them appear in [7]. The structure of bases in 
the present case turns out to be quite complicated, but by means of an auxiliary 
construction (§ 5) they are classified. Some numerical examples are collected 
in § 7. 


* Received May 18, 1953; revised January 4, 1954. 

* Terminology in this paragraph follows [6]. “Frame” is defined in §2. The 
lineality space of a cone is the maximal linear subspace it contains; the lineality space 
is zero for a pointed cone, while for a solid cone it is the whole space. 
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CHANDLER DAVIS. 


2. Preliminaries. The space studied is real linear space of n dimen- 
sions, which will be thought of as Euclidean space #”. Its elements will be 
denoted by lower-case letters; real numbers by lower-case Greek letters. The 
inner product of a and 6 will be written a:b. 

A positive combination of a,,- a, is a linear combination +: -- 
+ A,r with A; = 0; if all A; > O and all a; ~0, we speak of a strictly positive 
combination. An equation giving 0 as a positive combination of certain 
elements is a positive relation between those elements ; strictly positive relations 
are defined similarly. 

A convex polyhedral cone, or simply a convex cone, is the set of all 
positive combinations of a finite set of vectors, not all zero. This set of vectors 
will be said to positively span the convex cone; or simply to span it. (No 
confusion will arise with the notion “linearly span.”) 

A set of vectors {a;,: - -,a,} is positively dependent if one of the a; is 
a positive combination of the others (hence, in particular, if any a; is zero). 
Otherwise the set is positively independent. 

+, ar} is a frame of a cone C provided it is positively independent 
and spans C. If C is the whole space, or a linear subspace, any frame of C 
will be called a positive basis, or simply a basis. 

Other non-standard terminology will be explained as it is introduced. 

In selecting, from a set {a,,- - -,a,}, an a; having a specified property, 
I will sometimes assume without apology (or loss of generality) that the one 
selected is a,. (See for example Theorem 3. 7.) 


3. Some simple theorems. 


THEOREM 3.1. {a1,° + -,a,} spans E" if and only if, for every non-zero 
b, there exists an i=—1,---,r such that b-a;> 0. 


Proof. “Only if”: Any b may be expressed as b = + 
with 4; =0. Now +--+ so some term in the 
sum must be positive. 

“Tf”: Assume the convex cone A spanned by {a,---,a,} is not E* 
Then (see [6] or [4]), there is some hyperplane such that A lies entirely 
on one side of it; that is, there is some 6 0 such that ae A implies b-a 0. 


THEOREM 3.2. The set {a:,- --,a,} is positively independent if and 
only if no proper subset spans the same convex cone. 


The proof is trivial and may be omitted. 
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THEOREM 3.3. a, is a positive combination of {d2,- ar} tf and only 
if, for all 6 such that b- a, > 0, there is an t=2,:-+,7 such that b-a;> 0. 
{a,° * is positively independent tf and only if, for 
there is some b such that b-a; > 0, but, for 71, b-a; SO. 

Using the Minkowski-Weyl Theorem ([6], Theorem 11), it is not hard 
to reduce this to Theorem 3. 2. 

THEOREM 3.4. Any subset of a positively independent set of vectors is 
positively independent. 


THEOREM 3.5. Any set having a subset which spans E”, itself spans E”. 
4 g ] 


These are obvious. 


THEorEM 3.6. Let with a; 0, linearly span E". Then 
the following are equivalent: 
(i) {a:,° +,a,} positively spans 
(ii) for every i=1,---,7r, —a; is in the convex cone spanned by 


the remaining 


(iii) there is some strictly positive relation (see $2) between the ai, 


Proof. Assume (i). Certainly — a, is expressible as — a, = Aid, + Act: 
+--+-+-,a,, with ;=20; if this equation is solved for a, and the sign 
changed, it will be seen that —a, is given as a positive combination of 
(ii) is proved. 

Assume (ii). It tells us there are relations 


(1 + Aur) + +° + Ara, = 0, 
+ (1 + Ase) de = 0, 
Aridi + + (1+ Ar) a = 0, 
with \;, = 0. Adding these equations gives the required expression 
(1) pid, > + = 0, pi > 0, 


and (iii) is proved. 

Finally, assume (iii), and let b be an arbitrary vector of #". Since 
{a:,* - -,a,} linearly spans E” (this hypothesis enters here for the first time), 
there is some relation va, -+---+v,a,—0. By adding to this equation 
a sufficiently high positive multiple of (1), we get an expression of b as a 
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positive combination of {a,,---,a,}. (i) is established and the proof is 
complete. 


THEOREM 3.7. If +, a@,} positively spans E", then 
linearly spans 


Proof omitted. 

In the remainder of this paper, {a:,- - -,@,} will always be assumed to 
be a basis; when other sets of vectors are discussed different symbols will be 
chosen. 

It is clear that the standard theorem that any two linear bases of E* 
are affine equivalent has no analogue for positive bases. Thus in £* there 
are four essentially different types of positive basis, represented in Examples 
I-IV. (It is not true, of course, that any positive basis of H* is affine 
equivalent to one of Examples I-IV. These four examples are essentially 
given in [8].) These bases do not agree even in the number of vectors, which 
ranges from 4 to 6. Also, it is evidently not true that any subset of a positive 
basis is a positive basis of the smallest linear subspace containing it. The 
situation will become more complicated than in the case of linear dependence. 

However, one sees immediately the following 


THEOREM 3.8. r=n-+1, and r—=n-+1 is possible. 


Proof. If rn, then either {a,,- - -,a,} fails to span H” linearly, or 
it is exactly a linear basis of #”. In either case it fails to span EH” positively. 
On the other hand, for any n one can choose {@;,- - -, dn} a linear basis and 
= — Ardy —* *—Andn, 4 > 0, making {a1,- a positive basis. 

A basis for which r= n-+ 1 will be called minimal. By Theorems 3.4 
and 3. 7, every minimal basis is of the type just constructed. 


4, Spanned subspaces. Let us therefore investigate non-minimal bases. 


TueorEM 4.1. The following are equivalent: 
(i) r>n+1; 
(ii) some proper subset of {a,,- - -,a,} is linearly dependent ; 
(ili) some proper subset of {a:,- - -,a,} is a positive basis of a linear 
subspace (which by Theorem 3.2 must be proper) of E". 


Furthermore, the subset in (iii) may be chosen to include any preassigned 
ai, and the basis may be required to be minimal. 


Proof. (iii) immediately implies (ii) ; and one concludes easily from 
Theorem 3.7 that (ii) implies (i). 
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Assume r>n-+ 1. The vectors are more than n in number, 
so there is some non-trivial linear relation between them. By adding the 
proper multiple gf this equation to the equation (1) which must hold, one 
can certainly arrange to obtain +--+ ++ = 0, with pi’ = pi’ = 0, 


but some py’ ZeTO, SAY = = 0. 

{a;,° * *,@p} linearly spans some subspace S of H". The relation 
+++ ++ pp dy, pi’ > 0, means (Theorem 3.6) that {a1,° ap} posi- 
tively spans 8S. S is not EH", by Theorem 3.2. Also (Theorem 3. 4) 
*,@p} is positively independent. 

The basis of (iii) has been chosen so as to include a,. It remains to 
show it may be replaced by a minimal basis. But this is simply a matter of 
applying what is already proved to S. If {a:,- - +, ap} is not minimal, that 
is if the dimension of S is less than p—1, then some proper subset of 
{a:,* * *,@p} is a positive basis of a proper subspace of S; and so on. This 
process must end, since it decreases the dimension each time; when it ends, 
the minimal basis has been reached. 

A subspace S such that those a; in S are a basis for S will be called a 
spanned subspace; if the basis is minimal, a minimal subspace. 


THEOREM 4.2. Any spanned subspace is a linear sum of minimal sub- 
spaces, and conversely. 


Proof omitted. 
In some ways the spanned subspaces are analogous to the coordinate 
subspaces determined by an arbitrary linear basis. 


THEOREM 4.3. For the spanned subspace S spanned by {a1,- ~~, ap} the 
following are equivalent : 


(i) S is minimal; 


(ii) the positive relation satisfied by a1,° - +, @p 1s unique (up to multi- 
plication by a positive constant). 


Proof. For any such § there is a strictly positive relation between ay, 
"* +5, by Theorem 3.6. If § is not minimal it has a proper spanned 
subspace spanned, say, by {a1,° -,@g}, 9 <p; the strictly positive relation 
between -,@, is a positive relation between a,,° -,a, which is not 
strictly positive. Hence (ii) implies (i). 

On the other hand, if (ii) fails then the dimension of 9 is less than 
p—1, so, by Theorem 4.1, (i) fails. 


THEOREM 4.4. Every linear relation between 


is implied by 


— 
—— 
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the collection of strictly positive relations associated with the minimal sub- 
spaces according to Theorem 3. 6 (iii). 


It is convenient to delay the proof to § 5. 


5. The convex cone associated with a basis. Instead of simply listing 
the members of the basis {@,,- - -,a,}, one can describe it in another way. 
Consider Euclidean space E* with a distinguished coordinate system, the unit 
coordinate vectors being u,- - -,u,, and let A be the linear transformation 
determined by the conditions Au; = a;,i—1,---,7 (cf. [5]). The elements 
of the null-space V(A) of A correspond one-one to the linear relations 
holding between the a;; and, since such relations determine the a; to within 
affine isomorphisms in HE”, the geometrical properties of the basis can be 
described in terms of those of NV(A). 

In particular, the positive relations between the a; correspond to the 
elements of V(A) Pr. Here Pr is the positive orthant of EH", that is, the 
set of all elements of £" without negative components. 

The facets of the convex cone N(A) 1 P* are intersections of N(A) 
with the facets of the convex cone P". (A q-1-dimensional facet of a qg-dimen- 
sional cone is a maximal q-1-dimensional cone contained in its boundary; 
facets of lower dimension are defined by induction [6].) This follows from 
a general theorem, whose proof will only be sketched since it falls logically 
within the province of [6]. 


THEOREM 5.1. Let C be a convex cone, 8S a linear subspace. Then the 
convex cones SQ F, with F a facet of C, are all the facets of SNC. 


Proof Sketch. Every Sf F is proved to be a facet by Theorem 30 of 
[6]. Next any ze 8 /q C is shown to lie in the relative interior of some S/O F, 
and application of Theorem 35 of [6] completes the proof. 

Now each facet Q of Pr’ is determined by the set of the u; which it 
contains; in fact it is the set of all positive combinations of those 1%. 
N(A)NQ is a facet of N(A) NP if and only if some element of the 
relative interior of Q is in N (A) ; that is, if and only if some strictly positive 
combination of the u; in Q is mapped to 0 by A; that is, if and only if there 
is some strictly positive relation between the Au; for which u,eQ; that is, 
if and only if those Au; =a; for which u;e@Q span a spanned subspace. 


THEOREM 5.2. The lattice of spanned subspaces (for the basis given by 
A: E"-—» E™) is isomorphic to the lattice of facets of N(A) NP" under the 
natural correspondence just exhibited. 
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This has essentially been proved. That the spanned subspaces form 4 
lattice when ordered by inclusion follows from Theorem 4.2. The facets of 
a cone are of course to be given their natural order in terms of incidence. 
Remaining details are left to the reader. Note that the lattice of spanned 
subspaces, though it is join-closed as a sub-partly-ordered set [1] of the lattice 
of all linear subspaces of HE”, has a different meet operation; that is, that the 
intersection of two spanned subspaces even when non-zero is not necessarily 
the largest spanned subspace contained in both. For a counterexample we 
clearly must find a case where it is not a spanned subspace at all. But in 
Example V the spanned subspace spanned by a,- - -,@4 and that spanned 
by ds,* * *,@s have a 1-dimensional intersection which does not even contain 
any a; (Cf. Theorem 6.6 (iii) below.) The source of such cases is seen 
easily from the lattice-isomorphism: facets of Pr" such that N(A) intersects 
the relative interior of each, but not that of their intersection. 

Notice that extreme rays of N(A) MP’ correspond one-one to minimal 
subspaces under the correspondence of the last theorem. They are the 
“points ” (minimal non-null elements) of the respective lattices. 


Note too that if Q corresponds to S then the elements of Q give all the 
positive relations involving only those a; lying in 8. 


Proof of Theorem 4.4. Let 2,- - +, 2s be non-zero vectors lying one in 
each extreme ray of *N(A) Pr. The conclusion of the theorem is equi- 
valent to the statement that the linear combinations of 2,,- - -,2, are all of 
N(A). But their positive combinations are V(A) MP". Therefore it will 
suffice to show that an arbitrary ye N(A) is a difference of elements of 
N(A) Pr. Only the case y needs discussion. 

By Theorem 3.6, some element z, of N(A) lies in the relative interior 
of Pr. On the line joining y to namely {Ay + (1—A)a; OSAS1}}, 
there is some value A) > 0 giving a point z.e P", otherwise 2, would be on 
the boundary: z2=Asy + (1—Ao)4:. The equation can be solved for y, 
proving the theorem. 

Theorem 5.2 showed that the “ essential features” of the basis in the 
original space (more precisely, the incidence relations of the spanned sub- 
spaces and the membership of a; in them) are given in a simple way by looking: 
at N(A) Pr. (Note that the intrinsic geometry of this cone does not 
suffice, but the way in which it is obtained as an intersection of a linear 
subspace with Pr is needed; but see §6.) The proof just finished shows that 
all affine properties of the basis are so given. Further examination of 
N(A) /N Pr is suggested. Write hereafter N,(A) = N(A)N Pr. 


| 
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Let 2,,° * -,%, be as in the proof of Theorem 4.4, and let X be the 
matrix whose elements are given by X;;—=2;-u;. That is, X maps the unit 
vectors of Euclidean space of s dimensions onto the 2;, which give a frame of 
N,(A). Let a real matrix whose columns, considered as vectors, span the 
intersection of the positive orthant with a linear subspace, be called a 
+-matriz. It is proved in [3] that the transpose of a +-matrix is again a 
-+-matrix, and the cones spanned by the columns of these +-matrices are 
intrinsically dual to each other. Let a +-matrix whose columns are posi- 
tively independent, and whose rows are positively independent, be called a 
+-canonical matriz. In the present case, XY is a +-matrix whose columns 
are positively independent. Hence it may be written (perhaps after changing 


the order of the rows) as ee where X, is +-canonical and the rows of X, 


are positive combinations of those of X,. Also, the cone associated with YX, 
is intrinsically affine-isomorphic to that associated with X (namely N,(A)), 
since each is dual to that associated with the transpose of X. 

The principal task of this section is to give an independent characteriza- 
tion of those XY which can occur.” The question is slightly reformulated, and 
the answer stated, in 


THEOREM 5.3. All positive bases of Euclidean spaces are obtained in 
the following way: Take Y any +-canonical matrix, Z any matrix whose rows 
are non-zero positive combinations of those of Y. Let ~ 


Use each column of X as the set of coefficients in a homogeneous linear relation 
+: +--+ Xirb-=0. If +, 6, solve these relations and no further 
independent homogeneous linear relations, then {b,,---,b,} is a positive 
basis of the space tt linearly spans. 


Proof. There are two parts. 
The proof that no bases are omitted has almost been completed already. 
With X, and X, as above, it must be shown, first that no row of Xz is zero, 


2? They will be characterized only up to the following inessential changes: Any 
number of interchanges of columns, or multiplications of columns by a positive constant 
(this changes merely the ordering or the lengths of the a, in P’, therefore leaves 
a,,- + +,@, unaltered). Any number of interchanges of rows, or multiplications of 
row by a positive constant (this changes the picture in H’, but since it corresponds 
merely to changing the order or the lengths of the a; it is not interesting either). 
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second that every row of X, is repeated among those of X2.. Ifa row of X 
were zero, some u; would be orthogonal to all a;, therefore to N(A) ; but this 
contradicts the existence, proved above, of an element of N(A) in the interior 
of P’. If on the other hand, some row of X,, say the first, is not repeated, 
then it is not a positive combination of the remaining rows ([6], Theorem 9). 
Using the first statement of Theorem 3.3, one shows that there is some set 
of s numbers As such that, if each equation Xj,a, = 0 
is multiplied by the appropriate 4; and the results added, an equation is 
obtained in which only the coefficient of a, is positive. This means that a, 
is a positive combination of the other a;, contradicting the positive indepen- 
dence of {a:,: - *,@,}. This finishes half of the theorem. The proof of the 
other half follows. 

Since every row of X is a positive combination of the remaining ones, 
there is no set of numbers A,,° - *,As as above. (Use Theorem 3.3 again.) 
Therefore the relations imposed on 0,,: --,06, do not imply any positive 
dependence; therefore there is none, by hypothesis. It remains to show that 
{b.,: - -,b,} positively spans the space which it linearly spans. By Theorem 
3.6, it is enough to produce a strictly positive relation between the bj. This 
can be done by adding up the positive relations imposed on them. This 
completes the proof. é' 

The notation of this theorem will be followed in the succeeding section. 


6. Further details. The matrix Y alone determines some of the features 
of the basis: Since it determines the intrinsic geometry of V,(A), it determines 
the incidence relations of the spanned subspaces, by Theorem 5.2. Once Y 
has been fixed, adjoining of a new row to Z has the effect of increasing by 
one the dimension of #” and the number of basis elements. This observation 
can be sharpened to the following. 


THEOREM 6.1. Let B be the projection of E” on the orthogonal comple- 
ment of a, If {Ba,,- --,Ba,.} is a basis of BE", then the matric X 
associated with it by Theorem 5.3 is obtained from that associated with 
+, by omitting the r-th row. 


B will be said to have eliminated a, in this case. 


Proof. Xia: +: +--+ implies X,Ba,+--- 
+ Xi,Ba, = 0, which because of Ba, = 0 gives +---+ = 0. 
Essentially all that needs to be added is that there can be no further indepen- 
dent relations satisfied by the Ba; 7 =1,---,7r, or they could not linearly 
span an n — 1-dimensional space. 
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Notice the corollary that such a B carries spanned (minimal) subspaces 
into spanned (minimal) subspaces. 


THEOREM 6.2. a, can be eliminated if and only if * the r-th row of X 
is in Z. Thus the necessary and sufficient condition for no a; to be eliminable 
is that Z be absent. 


This follows from the last two theorems; the proof is omitted. 


THeorEM 6.3. In the case of a minimal basis, every a; 1s eliminable, 
provided n > 1. 


Proof. In this case X has only one column, and (for n > 1) more than 
two rows. 

This describes completely the relation of Z to the geometry of the »basis. 
It is natural to ask what properties of the basis correspond to special properties 
of Y. First, consider the case where Y, and therefore also X, are reduced 


matrices: 
xX, 0 | 
x= 


THEOREM 6.4. In case Z ts absent, Y is a reduced matrix if and only 
if E" ts the direct sum of two non-zero spanned subspaces. 


Proof. Assume Z absent, Y reduced. Then the minimal subspaces fall 
into two classes, with no a; in common: one corresponding to the columns 
of X,, the other to those of X,. Take the linear sum of each class. There 
result two spanned subspaces S, and S, whose linear sum is obviously E". 
It must be shown that 8; Suppose be SiN 82,60. Then is 
a non-trivial positive combination of the a; lying in S;; —) is a non-trivial 
positive combination of those lying in S,; adding these two relations gives 4 
positive dependence among the a;, that is, an element ye N,(A). It is easy 


to see that y is not a positive combination of the columns of (3°) and those 
of i ) ; this is a contradiction, so 8S, M S.=—0. 
2 


For the other half of the theorem, let S, and S, be two such spanned 
subspaces. Every a; is in one or the other; X will be shown to be reduced once 
it is shown that every minimal subspace is contained in either S, or S2. 
Suppose -, * *,@, Span a minimal subspace 8’, where a,,- - dp 
are in S, and a,,---,a, are in S.. For some p> 0, 


* With the assumed proviso that if a row of X is identical or proportional to one 
in Z the corresponding a, can be eliminated; this is again a mere matter of the order 
in which the rows are written, cf. fn. 2. For a trivial example of this, see Example III. 
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Now + ppdpe Sy and ++ are not zero, other- 
wise S’ would not be minimal. But they add to zero, contradicting the linear 
independence of S; and S,. The theorem is proved. 

The extreme case here is where Y is a diagonal matrix. Notice that Y 
is in fact always diagonal for n=3 (Examples I-IV). This is also true 
for n = 4, as will be proved in Theorem 6.8; but for n = 5 there exist bases 
with non-trivial Y, such as Example V. Before diagonal Y are discussed, 
it is convenient to mention this simple fact: 


THEOREM 6.5. Let d1,° + +,@p be all the a; lying in the minimal sub- 
spaces corresponding to 2,,+ + -*,%q. Let S, of dimension m, be the linear 
subspace of E” linearly spanned by a,,:-+,ap. Let the dimension of the 
smallest facet of N,(A) containing be t. Then m= p—t. 


Proof. S is a spanned subspace, that one in fact which corresponds to Q 
by Theorem 5.2. There are no linear relations among @,° - -,@, which are 
linearly independent of those given by elements of Q. Now if A is considered 
as a function on A-*S to S, its domain has dimension p, its null-space has 
dimension ¢, and its range has dimension m, which proves the theorem. 

A particularly evident special case is m =n, pr. Then ¢ is the dimen- 
sion of N(A), that is, rank (X), which in turn is equal to rank (Y); so 
n=r—rank (Y). 


THEOREM 6.6. The following are equivalent: 
(i) Y ts a diagonal matriz; 


(ii) the number of minimal subspaces equals the number of basis 
elements minus the dimensionality of the space: s=r—n. 


A necessary condition is 


(iii) the intersection of any two minimal subspaces is linearly spanned 
by those a; it contains. 


Proof. Since Y has s columns, s = rank (Y) =r—-n. The equivalence 
of (i) and (ii) therefore reduces to the following lemma: 


Lemma. A +-canonical matriz is diagonal if and only if the rank equals 
the number of columns. 


Proof. Let the matrix again be called Y, and have s columns, all linearly 
independent. The transpose Y’ is also +-canonical. Its columns are points 
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in E* giving a frame of the intersection of a linear subspace with the positive 
orthant P*. The number of linearly independent columns of Y’ is s, hence 
the linear subspace is all H*, hence Y is (perhaps after a rearrangement of 
rows) diagonal. The other half of the lemma is trivial. 

It remains to consider condition (iii) of the theorem. The a common 
to two minimal subspaces are linearly independent in general, by Theorem 
4.1. The only possibility of failure of (iii) is that the dimension of the 
intersection may be greater than the number of common q. 

Let the two minimal subspaces, corresponding to x, and 22, be of dimen- 
sions m, and m.; let the common a; be p’ in number; let the linear span 
of the subspaces have dimension m and contain p of the ai. Clearly 
p=(m,+1)+(mz.+1)—p’. The dimension of the intersection is 
m,+m,—m. For (iii) to fail for 2, 22 is for m, + m,—m > p’ to hold; 
that is, for p—m > 2 to hold; that is, for ¢ >2 to hold, ¢ being as in 
Theorem 6.5; that is, for the two extreme vectors 2, 7 of N,(A) not to lie 
in a single 2-facet of V,(A). 

Thus (iii) cannot fail if N,(A) is formed from a simplex by projection 
through a vertex, that is if Y is diagonal. Theorem 6.6 is proved. 

Note that (iii) is not sufficient for (i); a counterexample is given by 
Example VII. 

It is clear that, for any n, H” admits a basis consisting of the elements 
of a linear basis together with their negatives. Such a basis might be called 
maximal, in contrast to minimal bases. The following theorem (which in 4 
sense complements Theorem 3.8) justifies the terminology. (Theorem 6.7 
is essentially due to Robinson and Blumenthal; see [2], sections 76 and 120. 
It is seen to follow more naturally from the present strictly algebraic theory 
than from their metric approach.) 


THEOREM 6.7. rS2n, and the following are equivalent: 
(i) r=—2n; 
(ii) Y is diagonal and Z is absent; 


(ili) every minimal subspace is one-dimensional, so that the basis 1s 
maximal. 


Proof. As above, n=r—rank (¥Y). Now rank (Y) S38’, the number 
of rows of Y, with equality only if Y is diagonal, by the dual of the Lemma 
above. Also 2s’ =r, with equality only if Z is absent, by the basic Theorem 
5.3. Therefore 2n = 2r— 2s’ =r, with equality only if Y is diagonal and 
Z is absent. It remains to show that (ii) implies (iii), (iii) implies (i), 
but these are trivial. 


| 
| 
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THEOREM 6.8. For n <4, every basis gives diagonal Y. 


Proof. The number s’ of rows of Y satisfies ’ Sr/2Sn=4. Any 
non-diagonal +-canonical matrix must have at least four rows (because three 
vectors in a single orthant are either linearly independent or positively 
dependent, evidently) ; so we need consider only s’ = 4, r= 8, n=4. But 
in that case also Y must be diagonal, for rank (Y) =r—n=4=9’. 


7. Examples. 


in § 5: the columns of A are 


II. 


III. 


IV. 
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Here is one more application to a special case: 


FOF CO 


In each of the examples I-VII below, A and X are as 
-,a,, the rows of X’ (the transpose of X) 


X’=(1 1 1 1). 


& 


1 
or ee 
0100 

0010 0 
etree: 
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In the last two examples, only Y’ is given, Y being as in Theorem 5. 3. 


fi 2 10 0) 
913210 

3490 04 
[1 0 0 4 3 
1001001 0 0}° 
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MODAL OPERATORS, EQUIVALENCE RELATIONS, AND 
PROJECTIVE ALGEBRAS.* + 


By CHANDLER Davis. 


1. Introduction. In modal logic, as initiated by C. I. Lewis in 1918, 
one applies to propositions a, b,- - - not only the ordinary Boolean operations 
nN, U, and ’, but also a modal operator which for present purposes will be 
written C. This is a constant symbol with the following interpretation: If a 
is thought of as assertion the truth of a proposition, then Ca is thought of as 
asserting its possibility; similarly, (Ca’)’ asserts its necessary truth. 

Depending on the sense put on the words “ possible ” and “ necessary,” 
different properties will be assumed for C applied to propositional variables 
[12]. These properties may be expressed as identities, such as those below. 
Waiving the logical questions, some of these sets of identities may be studied 
as definitions of kinds of operators on Boolean algebras. Particularly inter- 
esting are S4 operators, satisfying 


(1.1) a, 

(1. 2) CO =O, 

(1.3) CCa = Ca, 

(1. 4) C(aU b) =CaUCb. 


Here (1.1)-(1.3) are, with the condition of “ isotoneness ” 
(1. 4’) C(aU b) D Ca, 


just the assumptions made on “closure operators” on an arbitrary lattice 
[1,19]; while all of (1.1)-(1.4) are satisfied if C is the operation of 
topological closure in a topological space, a and Db being subsets and the 
Boolean operations being set operations. Boolean algebras with an S4 
operator have been studied as generalizations of topological spaces [16]. 


* Received September 12, 1953. 

* This material is essentially excerpted from the author’s thesis [5]. The work was 
done without knowledge of the earlier work of Chin and Tarski (see [4]), which it 
presumably overlaps a good deal. It is unfortunate that their investigation has never 
been published. 
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S5 operators satisfy, beside (1.1)-(1. 4), 
(1.5) aNCb=O implies CanCb=0O. 


In the case of a topological space, the condition (1.5) on the closure 
operator implies that every open set is closed; in fact, (1.5) with (1.1) 
implies in general 

(1. 5’) C(Ca)’ = (Ca)’. 


Boolean algebras with an S5 operator do not therefore generalize any top- 
ologically interesting spaces, but they do provide generalizations of the simpler 
and “more set-theoretical ” notion of equivalence relation. 

This will be discussed more fully, together with other preliminary 
remarks on 85 operators, in Section 2. In Section 3 is a discussion of the 
reason for considering Boolean algebras with several S5 operators. The 
remainder of the paper gives theorems on the structure of these systems. 
They are shown to be in a rough sense “ coordinatizable ” by deriving them 
from the projective algebras of Everett and Ulam (Sections 4-5) ; only the 
two-operator case is treated here. The relation of these systems to the first- 
order functional calculus of logic is stated in Section 6. 

The terminology to be used for Boolean algebras is as follows. “ Meet” 
means “(,” “join” means “U.” The relation a—avUbJ, i.e, aD will 
be called “a is above 6” or “bd is below a.” J and O are the “unit” and 
“null” elements respectively. A prime denotes complement if it follows an 
element of a Boolean algebra, though it may have other meanings in other 
connections. If @ is a set of elements of a Boolean algebra, U @ will be the 
join of all elements in the set. 


2. Properties of S5 operators. First of all, the following more eco- | 


nomical set of conditions will be convenient: 


Lemma 2.1. If @ is a Boolean algebra, any function C on B into B 
is an S5 operator provided it satisfies 


(1.1) a, 
(1. 4’) C(aU b) Ca, 
(1. 5’) C(Ca)’ = (Ca)’. 


Proof. (1. 4’) is a consequence of (1.4) alone. As already mentioned, 


(1.1) and (1.5) give (1. 5’): Cam (Ca)’ =O, hence by (1.5), CaN C(Ca)’ E 


= 0, that is, C(Ca)’ C (Ca)’, and the inequality cannot be strict because 2 


of (1.1). 


i 
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The other half is to show that (1.1), (1. 4’), (1. 5’) imply (1.1)-(1. 5). 
Now (1.1) implies so, by (1.5), CO=C(CI)’= (CI)’ =0; 
(1.2) is proved. Next, (1.3) results from (1.5’) alone: CCa—=C(Ca)” 
—0(C(Ca)’)’ = (C(Ca)’)’ = (Ca)” =Ca. The proof of (1.5) is also 
easy: Suppose a Ch—O. This is equivalent to aC (Cb)’, so that by 
(1. 4’) and (1. 5’), Ca C O(Cb)’ = (Cb)’, that is, Can Cb=O. (1.5) is 
proved. Only (1.4) remains. 

In the usual terminology, call any fixpoint of C “closed”; the closed 
elements are those of the form Ca, by (1.3). Well-known wanes show 
that 
(2.1) aCCb implies CaC Cb, 


(2. 2) a and 6b closed implies aM 0 closed. 


Indeed, a C Cb gives Ca C CCb = Cb by (1.4’) and (1.3). And for a and 
b closed, (1. 4’) gives C(an b) CCaNn b; the inequality cannot 
be strict because of (1.1), so (2.2) is also proved. 

Complements and meets of closed elements are closed, by (1. 5’) and 
(2.2) respectively, so C® is a subalgebra and 


(2. 3) a and 6 closed implies a U 6 closed. 


The proof of (1.4) is now easy. By (1.1),aU bC CaU Cb; by (2.3), 
Ca U Cb is closed; by (2.1), C(aU 6) C Ca UCb. But the reverse inequality 
is obvious from (1.4’). The proof is complete. 

It is easily shown that (1.1), (1. 4’), (1. 5’) are independent conditions 
if B has at least eight elements. 

The following theorems make precise the sense in which S85 operators 
generalize equivalence relations. 

First the relation to equivalence relations is stated in the case where @ 
is a full Boolean algebra, that is, isomorphic to the Boolean algebra of all 
subsets of some set: 


THEOREM 2.1. Let @ be a full Boolean algebra. For each equivalence 
relation @ on the points of B, let Cg be the operator defined as follows: for 
ae B, Cga is the join of all points of B which are ¢-equivalent to any point 
below a. Then Cg is 85; and the correspondence 6—>C%¢ is a lattice iso- 
morphism of the partition lattice P(®) of all onto the lattice of all S5 
operators on 8. Also CB is always a subalgebra of B complete in B and a 


full Boolean algebra; and the correspondence ¢—> 048 is a lattice dual iso- 


morphism of P(®) onto the lattice of all such subalgebras. 
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Explanations. S5 operators are ordered as functions into a poset [1], 
namely, C, > C, means that, for all ae B, Cya > C.a. Equivalence relations 
on a set of points p,q, - - are ordered by defining ¢ > y to mean that pyq 
implies p¢qg. Subalgebras of @ are ordered as usual: by inclusion. <A point 
of a Boolean algebra is a minimal element strictly above O. The join referred 
to in the second sentence of the theorem is of course the join operation of 8, 
not set union. 


This theorem is not really new (see [16,17]; also [18,19]); also the 
ideas involved are perhaps made sufficiently clear by the remarks in Section 1. 
Accordingly, the proof is omitted; it is given in [5]. 

Next, a representation theorem relates the general case to the special 
case above. 


THEOREM 2.2. Jf C is an S5 operator on the Boolean algebra 8, and 
if B’ is the full Boolean algebra in which B is imbedded by the Stone repre- 
sentation theorem, then there is an S85 operator C’ on B’ which is an extension 
of C. If several S5 operators are extended to B’ in this way, the correspon- 
dence is order-preserving in both directions. 


This is a special case of a representation theorem of Jénsson and Tarski 
[10], but the proof is particularly simple in this case.’ 

For p a point of 8’, define C’p = 1 {a;a > p,ae CB}; define further 
CO =O. For other be 8’, define C’b as U {C’p;p Cb, p a point}. Then 
C’ may be proved to be an S85 operator by Lemma 2.1, or as follows. 

Define p¢q, for points p and q of B’, to mean that p C Ca implies g C Ca 
for ae B. The relation ¢ is obviously reflexive and transitive. To verify that 
it is symmetric, take and p Ca; then, by (1. 5’), pC (Ca)’ = C(Ca)’, 
so gC (Ca)’,q@ Ca. Hence ¢ is an equivalence relation. The S5 operator 
Cg corresponding to it according to Theorem 2.1 is identical with C’, by 
definitions. 

C’ must be shown to agree with C on @. The definition shows that if 
a is closed under C it is closed under C’. From this it follows that, for all 
be B, C’b C Cb. The reverse inequality is more difficult. 

Take any be B; C’ =U {C’q;q Cb, ¢ a point}; and each such 
is the join of an equivalence class under ¢. Hence to show that Cb C C’b 
it is enough to show, for any point pC Cb, the existence of a point q Cbd 
with pdq. 

Consider the set e—{bNd;d2p,deCB}. First, « contains with 


* The referee contributed to its simplicity. 
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any two elements their meet: if dea, b dea, then, because dM 
and dN de CB,bndnbn dea. Next, does not contain O:forbNd=O 
and de C® would imply b C d’e CB, d’ D Cb D p, which would forbid d D p. 
A standard argument shows there is some point q of 8’ which is below every 
element of a; in particular, gC b. Also pdgq, by the definition of « There- 
fore Cb C 

This proves the first statement of the theorem. 

Now let C,’ and C,.’ be the S5 operators on @’ obtained in this way from 
the 85 operators C, and C, on 8. That C,’ C C.’ implies C; C Cz is obvious. 
The converse follows almost as easily: If C, C C., then C.@ is contained in 
0,8, so by definition C,’/p C C2’p for p a point of B’, so C,’b C C.’b for all 
be B’, which was to be shown. 


3. Discussion of S5 operators. All the considerations of this paper 
were motivated by an interpretation of the notion of possibility, therefore of 
modal operators, which will now be set forth.* It is related to the ideas of 
Wajsberg [22], Carnap [2,3], and McKinsey [14]. 

Let us consider a logical system as a way of representing the relationships 
between statements which may correctly or incorrectly be made about the 
universe. (The “universe” here may be any “ isolated” physical system or 
conceptual system.) No two such statements need be considered as distinct 
unless it is believed that one of them might hold without the other’s doing so. 
In other words, let us consider a statement to assert something only insofar 
as it distinguishes a state of affairs which it asserts from an otherwise con- 
eeivable state of affairs which it denies. This point of view leads naturally 
to limiting oneself to propositions “in extension”: To begin with, one 
imagines a set of “ possible ” alternate universes. Thereafter, a proposition a 
is considered “in extension,” that is, it is identified with the set of alternate 
universes in which it holds. (Each proposition considered is to be one which 
asserts something about the universe: one, therefore, which can assert some- 
thing about each alternate universe, and can hold or not hold in each alternate 
universe. It is not to be a proposition about sets of alternate universes; nor 
yet a proposition about propositions.) Now if the proposition asserted by « 
holds in some alternate universe one can say that the proposition asserted by 
Ca, “it is possible that a,” holds in all the alternate universes. 

Now in the formal system constructed following these ideas, logical 


*The discussion of logic in this section, as in §1, is informal. Nowhere in this 
paper is there a formal discussion of logical systems generalizing Lewis’s S85, though 
the mathematical systems treated would be matrices for such systems. 


| 
| 
1 
| 


CHANDLER DAVIS. 


connectives between propositions will correspond to operations on the elements [| 
of a complemented lattice of subsets of the set of all alternate universes. This | 
lattice might be taken to be the Boolean algebra of all subsets, as is done by | 
Carnap [3], who distingiushes each of his alternate universes initially by the 
subset of a given set of atomic propositions which hold on it. Here, the 
lattice will be taken to be some field of sets, therefore a Boolean algebra 
(although it is not at all clear that other lattices are uninteresting in this 
connection) ; it will not be assumed full. 

The operator C on this Boolean algebra which was discussed above gives 
CO = 0, and for Ca=I. It is an operator—that which corre- 
sponds, if the Boolean algebra is full, to an equivalence relation with only 
one equivalence class. 

Non-trivial §5 operators, or more than one S5 operator, might appear 
in the same context. An early argument for the introduction of two modal 
operators [6,21] need not be discussed here (see [11]). A strong argument, 
however, is made by Weyl [23] for considering more than one “ degree of 
possibility.” He gives the following example. Suppose a train leaves 
Seattle at 9:00 a.m. Pacific Time. It is physically impossible for it to 
arrive in Chicago one microsecond later. It is technically impossible, though 
physically possible, for it to arrive in Chicago at 10:00 a.m. Pacific Time the 
same day. Weyl’s example illustrates the desirability of a totally ordered set 
of “degrees of possibility ”*; it can be amplified as follows to illustrate the 
case for a partly ordered set. It may be technically possible for the train to 
arrive at 9:00 a.m. two days later, but impossible because no track is avail- 
able for it to enter the Chicago station at that time; the technically impossible 
arrival time of 10:00 on the day it started might however be possible by the 
latter criterion. Thus we have two “ degrees of possibility ” neither of which 
implies the other. The example is artificial, and a different one will serve 
_better. 

Let the alternate universes be “ all ” division rings. Assume for simplicity 
that the Boolean algebra being considered is that of all sets of alternate 
universes ; that is, that there is, among those propositions being considered, a 
proposition distinguishing each of the division rings. Let C,a assert that the 
proposition asserted by a is possible as far as commutativity or non-commu- 
tativity is concerned. (Equivalently one could say, let C, be Cg of Theorem 
2.1, taking for ¢ the equivalence relation having two equivalence classes: the 
commutative division rings, the non-commutative ones.) Thus if a asserts, 


* Possibility operators to be ordered as functions into the Boolean algebra of 
propositions, see § 2. 
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“This division ring has two elements,” then Cya is true for all fields (all 
members of the first equivalence class) and for no other division rings. Let 
(.a assert that the proposition asserted by a is possible as far as finiteness 
or non-finiteness of the division ring is concerned. If a asserts, “ The identity 
of this division ring has infinite order,” then C,a is true for any division ring 
with an infinite number of elements and for no others; and so forth. C, and 
(, are again S5 operators, and neither C, CC, nor C, 2 C2. 

Return for a moment to the first, relatively trivial interpretation of possi- 
bility above. It would have been equally reasonable to replace the propositions 
a by propositional functions F',(-) in the following way. For each proposition 
a about the universe, let F(x) assert that a holds in the alternate universe z. 
(For fixed 2, F,(x) is not an assertion of the same form as a, so that 
expressions involving the F,(-) cannot meaningfully be combined with 
expressions involving the a by logical connectives.) The association of a to 
F,(-) is of course a Boolean isomorphism. To what does the S5 operator C 
described above correspond? By its definition, Ca TJ if (J y)Fa(y) is true, 
Ca =O otherwise. If one is willing to regard the proposition (3 y)Fa(y) 
as a propositional function (whose variable is not written and is without 
effect on the truth-value), then the conclusion may be simply stated: The 
operator C on propositions corresponds to existential quantification on the 
associated propositional functions. This correspondence between modal pro- 
positional calculus and the ordinary “einstellige Pradikatenkalkul” was 
mentioned but not discussed by Wajsberg [22]. 

So much for the case of one trivial S5 operator. The introduction of 
more than one modal operator naturally suggests the extension from the 
calculus of propositional functions of one variable to the first-order functional 
calculus, in which propositional functions may have more than one argument. 
In fact, there is a close relationship between the two situations; this relation- 
ship is the subject of the following sections. 


4. $5 operators and projective algebras. The projective algebras of 
Everett and Ulam [7%] were intended as a preliminary attempt at supplying 
a model for the first-order functional calculus similar to that supplied for 
propositional calculus by Boolean algebra. The definition is as follows.° 

A projective algebra is a Boolean algebra 2 (whose unit element will 
be written “i”), with a distinguished point po; with unary operations 
(“ projections ”) a—> a, and a—ay,; and with a binary operation a,b > aO b 
defined for a Ci, and b Ci,; subject to the following laws: 


5 See also [4]. 
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Pl. (aU bz; (aU b)y =a, U By. 


P2. toy ine Po- 


P3. a=O, a,=O, and a, =O are all equivalent. I 
P4. Azz = Az, Ayy = Ay. d 
P5. Let OCaCi, Then also, 
for any ce 2, cp =a and » imply cC (aD). 

V 


P6. 1,0 pois; po Oty = ty. 
(aU 4) Oi, = (aDi,) U (0 U 5) = (i, U (i, 5). 
Dl. cDO0—00b—0. 


The typical case motivating these conditions is the projective algebra of [| I 
subsets of a direct product, in which 2 is a Boolean algebra of some subsets 
of a Cartesian product of a set by itself, and where the projections a—a, 
and a—> 4d, are the projections of a set in 2 onto the respective axes. Every 
projective algebra is isomorphic to a projective algebra of this sort [7,15], [ 
but not necessarily to a projective algebra of all subsets of a direct product. | 4 

Every projective algebra 2 can be made into a Boolean algebra with two 
S5 operators in a natural way. Namely, $5 operators on Q as a Boolean 
algebra are given by defining | 


(A) Cia Ar ty, C.a te dy. 


This result is a special case (where e is taken equal to 1) of Theorem 4.1 
which follows.® 


THEOREM 4.1. Let Q2 be a projective algebra and e a fixed element E 
of 2. On the Boolean algebra of elements below e (complement being defined ( 
as relative complement) the following are S5 operators: 

(B) (a, OW) Ne, Ca = (iz O ay) Ne. 

THEOREM 4.2. Every Boolean algebra with two 85 operators is obtained | ; 
in this way. 

st 

Proof of Theorem 4.1. It is enough to give the argument for C,, since S) 
the proof for C, would go just the same. The conditions of Lemma 2. 1 tl 
must be verified; accordingly, it will be shown that 


~ a 


(4.1) (a,D ty) NeDa; 


® Cf. remark at end of § 5. 
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(4. 4) ((aU Ne= ((azD1,) U (b,0Y)) Ne; 
(4. 5”) C,((Cia)’ N e) = (Cia)’ Ne. 


Here (1. 5’) takes the form (4. 5’) because the prime is still being used to 
denote the complement in 2 (not the relative complement); (4.4) is the 
statement of (1.4), which is stronger than (1. 4’). 

These identities can be verified using the representation theorem for 
projective algebras, or formally, as follows. In the proof, results from [7] 
will be cited by number, without quoting them. 

First, if a= O then (4.1) becomes trivial and (4.4) follows from P3, 
Di. If none of the elements involved is O, one has, by C1, C9, and C10, 
(a, Oty) NeD (ae Oay,) NeDa, proving (4.1); while C19 gives (4.4). 

If a= 0, (4. 5’) reduces to Cie = e, which follows from (B) and (4.1). 
It remains to prove (4. 5’) for OC aCe. Now 


(C,a)’ Ne = (a, i,)’Ne (obvious from (B) 
= ic) Diy] U [ie (i Ne (by C22) 
== Niz) Ot) Ne (by D1). 
Also 
Ne) = ([(a2 Diy)’ N Ne (by (B)) 
C (([(a2 0 ty)’JoN ee) Oty) Ne (by C4, C9) 
= ty) N (eg Diy) Ne (by C11) 
= ([ (a Diy)’Je iy) Ne (by P5, C9) 
= ([(a¢ ty) Ne (see above) 
= Ne (by P5). 


Hence “ C ” holds in (4. 5’). But the reverse inequality is immediate from 
(4.1). The proof is finished. 


5. Continuation. Proof of Theorem 4.2. Imbed the given Boolean 
algebra 8 in B’ by Theorem 2. 2; let the extensions of the given 95 operators. 
C, and C, be written simply C, and C,. The proof is in three parts: con- 
struction of a certain full projective algebra 2’; construction of a certain 
sub-projective-algebra 2 of 2’; and showing @ can be obtained from 2 by 
the method of Theorem 4. 1. 

Elements of @’ will be writen as before (in particular, “p” and “q” 
will denote points) ; lower-case Greek letters will denote elements of Q’ 
(thus its unit element is written “.”). 


| 


756 CHANDLER DAVIS. 


Part I. The points x of Q’ are to be as follows: 
(5.1) for every point p of 8’, a point [p]; ; 
(5.2) for every point Cp of and point C.q of a point [Cip, C.q]; 
(5.3) for every point Cip of C,8’, a point [Cip]; : 
(5.4) for every point C.q of C.8’, a point [C2q]; 
(5.5) an additional point zo. 


Here it is clear that the same element r of 8’ may give rise to more than one 
point of 2’—e. g., if r is a point both of B’ and of C,®’. In such a case the 
notations [r],, for the point of the form (5.1), and [1], for the point of 
the form (5.3), could be used to distinguish the two; however, the arguments 
which follow will be sufficiently clear without doing this. The point zp will 
be the distinguished point (§ 4) of 2’. 

Let 2’ be the full Boolean algebra generated by the points (5. 1)-(5. 5). 
Denote by x» the join of all points of the form (5.n), n = 1, 2, 3, 4, 5. 

The projections of 2’ will be defined first for points =. 


me: [plo= [Crp] ty: [ply = [C2p] 
Coqg]2 = [Cp] [Cip, C2q]y = [C29] 
[Ciple = [Cp] [Ciply = To 
70 [Cop], = [Cop] 
(70) 2 = 703 (70) y = 


Next, for arbitrary «e 2’, define a point}; and a, 
similarly. 

Of the conditions (P1-P? and D1) which must be checked to prove 2’ 
a projective algebra, P1, P3, and P4 are already clear from the definitions. 

Now tz = xz U xs, ty = xs U xs, in the notations above. Recognizing this, 
one verifies P2 immediately. 

Also for «e 2’, a Cu, if and only if « is a join of points of the forms 
(5.3) and (5.5), with a similar condition for 8 Cu, For such 2 and 8B, 
the operation must be defined. First, OO—OOB=O may be 
assumed; this is D1. Next, for OC aC. and OCBCy, let «08 
=U {2372 C C a point}. This makes P6 and obvious. 

P5 must be checked. As a sample, here is the demonstration that 
Consider any point rCa. If then r= 
where p= 7 Up is any point below 8. (Remember 8+~O.) The other case 
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is [C.p]: then either r= [C,p, with [C.q] = [Cip, Coq], CB; 
or, if 8 =m one has 7» = 7,. (It might also happen that r= [p]., 
with [p],y CB, but this can not be guaranteed; this was of course the reason 
for introducing the points (5.2) originally.) In either case, tC (#0 8).. 
Therefore « C (#0 

It has been proved that Q’ is a projective algebra. 


Part II. Next the subset 2 of 2’ will be defined. Here @, Gi, G, 
represent sets of points of (respectively) 8’, C,B’, C.B’. 


(5.6) If U @eB, then U{[p];peQ}eQ; 


(5. 7) if U die CB, U a. € C.B, 
then U {[C,.p, C.q]; Cipe Gi, Cage Ge} 


(5.8) if U then U {[C.p];C.ipe GijeQ; 

(5.9) if U @,eC.8, then U {[C.p];C.pe e292; 

(5.10) meQ. | 

Let 2 be the sub-Boolean-algebra of 2’ generated by these prescribed elements. 


This ensures that 2 is closed under the Boolean operations. The proof 
that it is closed under the other operations follows. 

Any «¢2 may be written as the join of the aM xa, n =1, 2, 3, 4, 5. 
Also @M xn€ 2, and, by P1, xn)2eQ for all n implies So the 
next task is to prove, for « C yn, that «e 2 implies a,€Q. 

For n = 5, that is, « =, this is automatic. 

For n = 1, 3, 4, it is almost as obvious. Take for example «C yi, ae Q. 
It is easy to see that a is of the form (5.6): «=U {[p];pCa,ae B}. 
Then a, = U {[Cip]; Cip C Cia}; since C,a e C,B, this is of the form (5. 8), 
hence in Q. 

Finally, consider « Cy2, «¢Q2. Call an element of the form (5.7) a 
“Q-rectangle.” Now « is a combination of a finite number of 2-rectangles, 
together with elements disjoint from x2, under Boolean operations. Note that, 
for any Q2-rectangles @ and y, BN y and ~’N xy are joins of Q-rectangles. 
Therefore g is a join of Q-rectangles; and therefore, by P1, only the case 
where « is a single 2-rectangle needs to be considered. But this case is clear. 

Thus Q is closed under z-projection; similarly for y-projection. There 
remains the operation “0.” 

Assume U{[C,p];Cip CaeC,8}, B= U{[C.p]; Cop Cb CB}; 
so that a,8e2. In order for a point 7 of Q’ to be below 208, it must 
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satisfy 7, Ca andaz,Cf. This is equivalent to saying that either z= [p], 
with pCan b, or r= [Cyp, Cg], with Cip Ca, Cog b. The join of all 
a of the first kind is an element of the form (5.6), because a and b, hence 
afb, are in @; the join of all + of the second kind is an element of the 
form (5.7). Therefore «08, the join of all of either kind, is in 2. 
This was not quite general enough, it assumed m Ca, m CB. However 
the remaining cases involve no difficulty, and one concludes that 2 is closed 
under the operation “ 0.” 
It is now easy to see that 2 is a projective algebra. 


Part III. The first two parts of the proof were preliminary: the 
supplying of a projective algebra 2 from which an isomorphic image of 8 
could be obtained by the construction of Theorem 4.1. There remains the 
proof proper: showing that this can in fact be done using the 2 prescribed. 


Define the function ® on B’ by da—U {[p];p Ca,p a point}. Then 
8’ consists of those elements of Q’ below ®J, which is the element called yi 
above. Any ae 8’ is in ®B if and only if it is in 2; and @ is a Boolean 
isomorphism of 8 on 6B (in fact of B’ on ©8’), if complementation in 
8’ is understood as relative to 1. 

So much is obvious. Now by Theorem 4. 1, C, and C, defined as follows 
are S5 operators on $8: (¢,0i,) N Ol, NOI. It 
must be shown that ®C,a = 0,¢a, 6C.a = 

Assume [p] C Cie, a=—@a, ae B. Since [p] Ca,Oy, one has 
[ple %; and every point below «, is [Ciq] for some [q] Ca, that is, 
for Hence, for such a This proves 

Let p be a point of B’ below C,a, ae B; then [p] CC,a. By the 
proof of Theorem 2. 2, there is some point q below a such that pC C,.qg. For 
such a q, [p]o= [Cip] = [Ciq] = C (fa)2. Thus p, being a point 
below $I whose z-projection is below (@a),, is below (6a Ow) = C,4a. 
This proves ®C,a C Ca. 

An identical argument relating C, with C. completes the proof of the 
theorem. 


It should be remarked that in case e of Theorem 4.1 is i, then for any 
ty) =i, iy (see [%, C10]). But there exist 
8 such that C.C,a assumes other values than O and J. Therefore if Theorem 
4.1 had asserted only the special case e =i, the converse Theorem 4.2 would 
have been false. 
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6. Relation to the first-order functional calculus. This relation is 
most easily expressed by dealing with Mautner’s Boolean tensors [13] instead 
of with the logical theory itself. The logical coordinate system of Mautner’s 
theory will be left fixed,’ so that a Boolean r-tensor may be treated as simply 
a function on the sett SXSX::-: XS (r factors) to the two-element 
Boolean algebra. 

The Boolean r-tensors f = +, form a system B, closed 
under the Boolean operations. However @, is not closed under the following 
operations : 


(6.1) outer sum formation 

(6.2) outer product formation, dual to (6.1) ; 
(6.3) setting arguments equal 


(6.4) sum contraction (i.e., existential quantification) ® 


(6.5) product contraction, dual to (6. 4). 


All of (6.1)-(6.5) correspond to admissible processes in forming formulas 
of the functional calculus.® 

If all of (6.1)-(6.5) are forbidden, @, is merely a Boolean algebra. 
This corresponds to the situation of the “ alternate universes” of Section 3 
before S5 operators were introduced. Corresponding to the introduction of 
S5 operators there, a limited class of Boolean tensor operators will be specified 
which will take 8, into itself. (Note that (6.1)-(6.3) will still be forbidden, 
and will have no analogue.) Namely, for any ye 8 let 


Clearly C,f is in B,, but its k-th argument is without effect. Conversely, 


7It will be clear that Theorem 6.1 has invariant significance in Mautner’s sense 
and Theorem 6.2 does not. Note that my notation for Boolean tensors in this section 
differs from Mautner’s. 

*The notation in the left-hand member implies that the k-th variable is bound by 
the summation. 

*It is unconventional to include setting arguments equal (6.3) as a process 
analogous to the others; that is, “F(#,a)” is not usually regarded as having been 
formed from “F(a, y).” But cf. [9; Kap. 5a]. 
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a function in @, whose value is independent of its k-th argument is of the 


form C;f. This characterization of C;,@, makes it clear that it is a complete 
sub-Boolean-algebra of B,. Also C;, can be seen to take any fe 8, to the is 


meet of those elements of C;,@, above f. Therefore (Theorem 2.1) C;, is an 
S5 operator. 


Furthermore, if € is any set {k,,- - -,k;} of the indices 1,- - -,7, then 
the operator 
(6. 7) Ce = “Ck, 


clearly depends only on é (not on the order of the C;,) and is an S85 operator. 

Some further properties of the Cg may be pointed out.: It is natural to 
order them as functions into @,, in fact to consider them as a subset of the 
lattice of all §5 operators on @, (Section 2 and [1]). Now if one defines 
an operator corresponding to the null set of indices by 


(6. 8) Cof =f, 


then one concludes that Czy,—CegU Cy. It is not hard to show also that | 


Ceng=CeN Cy. (The proof, based on Theorem 2.1, is given in [5].) 
Finally, if {1,---,r}, then for all 
provided there is any set of arguments for which f(a,-°--,a,-) =I. 


Collecting these facts gives the following. 


THEOREM 6.1.7 The operators Cz on B, defined by -(6.6)-(6.8) are S5 
operators. They form a sublattice of the lattice of all S5 operators (containing 
its null and unit elements), and Ce—é is a lattice-isomorphism onto the 
Boolean algebra of all sets of indices 1, --,r. 


Thus Boolean algebras with $5 operators generalize Boolean tensors only 
if the operations allowed on the latter are restricted; they provide a Boolean 
model for the functional calculus only in a limited sense. Quantification is 
included (notice that the dual of (6.6) is (C;f’)’) but (6.1)-(6.3) are not. 

Projective algebras are evidently in the same situation. In fact, identify 
every f © 8, with the set of all points of S X S which it maps into J. Fix a 
point z of 8. Define *° f, as the set-intersection of {2} XK S with O,f (as 
defined by (6. 6)) ; similarly fy = (S {z}) N C.f. Also define, for subsets 
T and U of 8. 


({4} XT)O(U X {a}) =UXTCSXS. 


20 [7, C28]. 


j 
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THEOREM 6.2.7 With respect to the distinguished point {%} X {2} and 
the operations just defined, B. becomes the projectwe algebra of all subsets 
of the direct product SX 8. Every projective algebra of all subsets of a 
direct product of a set with itself is obtained in this way. 


This is clear without giving details. For the first sentence, compare [7]. 
For the second sentence, simply construct @, over the set in question. 


ANN ARBOR, MICHIGAN. 
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COHOMOLOGY CLASSES OF FINITE TYPE AND FINITE 
DIMENSIONAL KERNELS FOR LIE ALGEBRAS.* 


By G. HocHscHILp. 


Introduction. Our main concern is with the problem of finite dimen- 
sionality of the kernels in the interpretation of the 3-dimensional cohomology 
groups of a Lie algebra Z as groups of equivalence classes of L-kernels (as in 
[4]), in analogy with the theory of group kernels of Hilenberg-MacLane, [1]. 
This problem is connected with the theorem on the existence of a finite 
dimensional faithful representation of a finite dimensional Lie algebra and 
with the related problem of the effaceability of cohomology classes in finite 
dimensional modules. In fact, the solution of our problem depends on some 
of the methods and results of several investigations in these directions. 

In order to give a coherent and intelligible account of these and certain 
refinements of them which we need for our purpose, we take up some of their 
essential details in a more specifically adapted exposition. In particular 
(in Section 1) we give a new exposition of the relevant features of the use of 
the differential calculus of E. Cartan which was basic in [2] and [9] ([9] is 
not available in print), cutting out much of the computational detail which 
encumbered the original presentation. In Section 2, we give a slight refine- 
ment of a result on module enlargements due to Zassenhaus, [10], which is 
vital for our purpose. In Section 3, this is combined with the results of 
Section 1 to give a completion of the result of [8] and [9] on the effaceability 
of cohomology classes. This will absorb Iwasawa’s result, [6], on the efface- 
ability of 2-dimensional cohomology classes in characteristic 0. We also give a 
simplification of Iwasawa’s proof for the case of characteristic p by using 
Jacobson’s technique of [7]. In Section 4, we apply these results to the 
kernel problem for which we obtain a complete solution, in the case of 
characteristic 0. 


1, Cohomology and differential forms. Let ZL be a finite dimensional 
Lie algebra over a field F. Denote by A the exterior algebra built over the 
space dual to Z. A is a graded algebra over Ff, with the homogeneous com- 


* Received March 22, 1954. 
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ponents A” such that A” = (0), for m <0; A° =F; A*=the dual space 
of L; and, for m=1, A”™=the subspace of A which is spanned by the 
(exterior) products with m factors from A*. The (exterior) multiplication in 
A is anticommutative in the sense that, for ae A? and b e AY, ba = (—1) ab, 
Without dwelling on the well known details of the definition of A, we recall 
merely that A” may be identified with the space of all m-linear alternating 
functions from LZ to F. In particular, A” = (0) if m exceeds the dimension 
of Z, and otherwise the ordered products of m elements of an ordered basis 
for A‘ constitute a basis for A” over F. The coboundary operator 6 of A isa 
homogeneous antiderivation of degree 1, which means that 6(A™) C A”™** and 
8(ab) = 8(a)b + (—1)?a8(b), if ae A®. Furthermore, = (0), and 
*==(. Clearly, such a map 8 will be uniquely determined (assuming its 
existence) as soon as its values on A‘ have been prescribed. This is done 
most naturally by using the following further maps. For we L, define the 
map ¢c, of A‘ into F by c,(a) =a(u). Then c, can be extended in one and 
only one way to a homogeneous antiderivation of degree —1 of A, which we 
still denote by c,. It follows then at once by an induction on the degree that 
= 0. On the other hand, there is a unique homogeneous derivation t, 
of degree 0 on A which maps F into (0) and, on A’, is given by t,(a) = a, € A’, 
where a,(v) =a([v,u]). One verifies easily that t,t, — tyty = ttu,v}, so that 
A carries the structure of an Z-module. This verification is again based on 
an induction on the degree, noting that the two maps whose equality is to be 
established are both derivations of degree 0. Similarly, one observes that 
tyuCy — Cyt, and Cry»; are antiderivations of degree —1 which coincide on F 
and on A’, whence it follows that tc, — ¢yty = Ctu,v;. Now the conditions 
Cy (8a) = t, (a), for all we and all ae A? define a map §: A? A? uniquely. 
There is unique extension of this map to a homogeneous antiderivation of 
degree 1 of A which is 0 on F. This is the coboundary map 8 of A, and one 
verifies easily that c,6 + 8c, = t,, for all we Z, and that 8&0. This verifica- 
tion is again inductive, using the fact that c,8 + 8c, and 8 are derivations 
of A. 

Now let V be an L-module. We form the tensor product V® A with 
respect to the base field F. We may regard this as a right A-module in the 
natural fashion, and we shall “extend” the above maps cy, t, and 8 over 
V @A in such a way that their inter-relations are preserved, as well as the 
properties c,* — 0, = 0. We shall define c, and t, on V® A such that, for 


* These maps and, more generally, the technique of derivations and antiderivations 
in graded rings which we employ here have been used quite frequently in recent years, 
notably by H. Cartan, C. Chevalley, and J. L. Koszul. A certain familarity with their 
manipulation is presupposed here. 
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ve VandaeA, a) =v @e,(a), and t,(v @ a) = (u- v) @a+v@t,(a). 
The coboundary map & is extended so that 8(v@a) = (8v)a+v0® ia, 
where Sv is defined as an element of V @ A? by the relations c,(8v) = w- v, 
for all we L. In particular, the pair (V © A, 8) constitutes a cochain complex 
whose cohomology groups are the ordinary cohomology groups H"(L, V) for 
L in V. 

Next we single out certain facts concerning rings of power series whose 
consideration is basic for our proof of the effaceability theorem. Let P denote 
the ring of all integral formal power series with coefficients in the field F 
of characteristic 0 in n variables 2,,- - *, 2». We denote by D; the partial 
differentiation with respect to 2; in P. Correspondingly, we denote the formal 
quadrature with respect to 2; by D;* (the “constant of integration ” is taken 
to be 0). If U is an F-linear subspace of P, the calculus closure of U in P 
is defined to be the subspace of P which is spanned by all those elements of P 
that can be obtained from elements of U by applying a finite number of 
operators D; and D;*. The differentiations of P (over F’) constitute a free 
P-module with the D; as a free basis. An element pe P gives a P-linear map 
D— D(p) of this module into P which we denote by dp; this is the differential 
of p. The differentials generate (over P) a free P-module which is dual to 
the module of the differentiations of P; dx,,- - -, dz, being a free basis dual 
to the free basis D,,- - -,D, of the module of differentiations. The algebra 
B of the differential forms on P is the exterior algebra built (relative to P) 
over the module generated by the differentials, which is therefore B’. We have 
B° = P, and d maps B° (F-linearly) into B'. This map d can be extended 
uniquely to an antiderivation of degree 1 of B, still denoted d, such that d? = 0. 
As with every graded ring, we denote by B* the sum of all homogeneous com- 
ponents of positive degree of B. Then B* is an ideal of B, and B is the direct 
sum of B® and B+. The most important fact for us is the following well 
known result, a proof of which can be found, for instance, in Section 2 
of [2]. 


Lemma 1. Let b be an element of B+ such that db =0. Let Uy be the 
subspace of P which is spanned by the coefficients of b when expressed as a 
P-linear combination of the ordered products of the dx; Then there exists 
an element ce B such that dc =b and U, lies in the calculus closure of Uy». 


Now let Z be a solvable Lie algebra of finite dimension n over an alge- 
braically closed field F of characteristic 0. Let K be another Lie algebra 
over F which contains Z as an ideal. Then, since simple Z-modules must 
be 1-dimensional (as follows from the fact that LZ is solvable and F 


= 


766 G. HOCHSCHILD. 


algebraically closed), we can find a sequence (0) =L,nCLZyn.C:+-+-ClL, 
=[K,L] CL,.C::-CL,—=L, where each JZ; is an ideal in Z and has 


dimension n—i. We select a basis u;,° - -,U, for L over F such that each 
n 

is spanned by Un. Then, if we write uj] = with 


Ci, F, we have = 0, unless k = j, andk>r. 

Applying Lie’s theorem to the solvable Lie algebra spanned by Z and a 
single element x of K, we conclude that [2, Z] is nilpotent on every finite 
dimensional K-module. Hence [[2, LJ, C L; (j=1,- +--+, for every 
xe K, whence [[K, LZ], C L;. We have therefore cj; =0 if i> 

Next we consider a finite dimensional K-module V. Regarding V as an 
L-module, we can (with the same justification as was used above) find a 
basis ¥,,: * ,Um for V which is such that, for each k, L- vz is contained in 
the space spanned by v;,° - +,Um. It follows then as above that [K, L] - v, is 


m 
contained in the space spanned by Um. If we write vj = Dd 
k=1 


with ej, in F, we have therefore ejj,—0, unless k2=j; and, for 1>7, 
€ij; = 0. These facts concerning the coefficients and ci, will sometimes 
be used implicitly in what follows. 

Certain elements of our power series ring P will play a distinguished 


role below. These are the following exponential series: c; = exp( > Cpt’), 
r p=1 

and e; = exp( > ép;j%)). We denote by Q the subring of P which is generated 
p=1 


by F, the x, (k =1,---.m), the c; and their inverses (1 = 1,---,m), and the 
e; and their inverses (7 —1,:--,m). Correspondingly, we replace B by 
the subring C of B which is generated by Q and the differentials dz,. Evi- 
dently, Q coincides with its calculus closure. From Lemma 1, it follows that 
if b is an element of C* such that db =0 there is an element c in C such 
that dc=b. 

We shall construct an isomorphism of the complex (A,8) into the 
complex (C,d). Let a;,- --,a@, be the basis of A! which is dual to the 
basis +, Un of L, so that a;(uj;) = (Kronecker symbol). Then the 


definition of shows that 8a; = — cj,ia;a;. If an isomorphism of (A, 8) 
j<k 
into (C,d) leaves the elements of F' fixed and sends each a; into b;, we must 


therefore have dbij = — > cjxibjby. We can write these conditions in the 
i<k 


i-1 

form + = — cyibjby. For ir, these conditions become 
j=1 

simply db; 0. Accordingly, we shall define 6; dz;, for ir. Next we 

observe that cj; 0 if 7 and also if j Hence, if i > our conditions 
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on the may be written db; d( 0; =— Cinid Dx. These con- 
p=1 
ditions are equivalent to the conditions d(cjbi) =—c; 


Now suppose that i> v7, and that we have already found elements 
b,,: °°, 0:1 in C* which satisfy the above conditions and which are further- 


g-1 
more of the form b, = c,'dr; + where the are elements of Q 
t=1 


which do not involve Write Cjxibjby, and, correspon- 


dingly, Pp= Cjxiajax. Similarly, set = and Ry = Cpiidy. 
We have da; + Ria; + Pi = 0. Hence, applying 8, (dR;)a; — Ryda, + = 0. 
Substituting Ria; -+ P; for — 8a; and observing that R;?—0 (for Rye A’), 
we obtain (8R;)a; + 8P; =0. Since + dP; involves only the a,’s 
with 7 <i, this implies that we must have RjP;+6P;—0. Evidently, it 
follows from this that we have also rip; +dp;—0. But this means that 
d(cipi) = 0. By Lemma 1, we can find elements q; in Q such that dq, = — cipi, 


i-1 
and furthermore such that 9; = c; S gidx:, where the gi: are as in our induc- 
t=1 
tive assumption. (Note that we have 1 > and that - -,2, do not enter 
in this consideration.) Now we define b; = c¢;*(d7; + qi). Thus we can find 
b1,: + +,b, in C? which satisfy our above conditions and which are further- 


more of the form described in our inductive assumption. From this form of 
the 0,’s, it is clear that the dz;’s can be expressed as Q-linear combinations 
of the 0,’s, whence the 0;’s constitute a free basis for C1 over Q. The map 
which leaves the elements of F fixed and sends each a; into b; is therefore an 
F-linear isomorphism of A° + A? into C® + C? and extends uniquely to a 
ring isomorphism ¢ of A into C. By the construction of the b,’s, we have 
do = $5 on A° + A’, whence it follows that dé = ¢8 on all of A. Thus ¢ 
is an isomorphism of the complex (A,8) into the complex (C,d). Now we 
define, for each we LZ, an antiderivation C,, of degree —1 on C such that 
$Cu = Cyd. In fact, we define C,(¢(a)) = cy(a) =a(u), for ae At. Since 
the b,’s constitute a free basis for C! over Q, this map C, extends uniquely 
to an antiderivation of degree — 1 of C which clearly has the desired property. 
Now we define the derivations 7’, of degree 0 of C by setting T,, = C,d + dC,. 
We have then 7,¢ = ¢t,. Hence we find also (7,0,—CyTu)¢ = Ctuvid, 
for all w and v in Z. This shows that the antiderivation of degree —1, 
Cpu), vanishes on whence we may conclude that 
O,Ty = on all of C. Observing that 7,d—=4dT,, we derive 
from this, by writing 7, —=C,d + dC,, that —T Ty = 


r 
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In particular, this shows that Q becomes an L-module with the operators 
T,. It is easily seen from our definitions that, for +r, Ty,(vi) =, 
Hence T,, maps each exponential of a linear form in %,- - -,2, into an F- 
multiple of itself, which is zero whenever we [K, LZ]. Furthermore, it follows 
inductively from the form of the 6,’s that, for any 7, and any we L, T,(2) 
does not involve z;,- --*,2,. From these facts, it is not difficult to prove 


the following critical result. 


Lemma 2. Every finite dimensional subspace (relative to F) of Q is 
contained in a finite dimensional L-submodule of Q. Furthermore, [K, L] 
is nilpotent on every finite dimensional L-submodule of Q. 


Proof. Let M > 1 be an upper bound for the total degrees in the 2;’s 
of the elements T,,(2;). In counting degrees, the exponentials are ignored, 
i.e., they are regarded as elements of degree 0. The weight w(q) of a 
monomial is defined as w(q) = If q is an 

i 


arbitrary element of Q, w(q) is defined as the maximum of the weights of the 
monomials occurring in g. If q is an ordinary polynomial in F'[a,,- - -, 2] 
the weight of 7,(q) =X Di(q)Tu(zi) is at most equal to the maximum of 


the n numbers w(q) — + MM*(-), whence w(Tu(q)) < w(q), for all 
qe F[a,:-+,¢]. If q is a polynomial and e an exponential we have 
Ty(eq) = «eq + eT,(q), where ce F; and if we [K,L]. It is now 
clear how one can complete the proof of Lemma 2 by a simple accounting of 
the weights. 

Now we form the tensor product V@C relative to F and make 4 
complex (_V@C,d) by extending d over V@C in the same way as we 
extended 6 in our construction of the complex (V @ 4,8). Our isomorphism | 
¢ of (A,8) into (C,d) evidently yields an isomorphism of (V © A, 8) into 
(V®C,d) which sends v @a into v®@¢(a). On the other hand, we shall 
prove that the complex (V @C,d) is isomorphic with the complex (Cx, d), 
where C’;, is the direct sum of m copies of C, and d is applied coordinatewise 
on C;,. We shall treat C,, as a C-module in the natural fashion, and our isomor- 
phism will be C-linear. In V @ C, we have dv; = ~ » @ i. 


Hence a C-linear isomorphism of (V @ C, d) tat (Cn, d) sai map the t; 
into elements w; of =Qm which satisfy the relations dw; = > eij,wxbi. 
isk 


r n 
These relations may be written in the form dw;—w; = Dd ( Dd eine) di, 
n p=1 i=1 
or, equivalently, d(wje;*) => ( > eij.w;,)bie?. Let y; denote the element 
i=l k>j 
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of Qm whose j-th component is equal to 1, while all its other components are 


rs | 

inf equal to 0. For j =m, our above condition becomes simply d(Wmem*) = 0. 
PF. | Accordingly, we define wm —Ymém. Now suppose that we have already found 
elements Wm, Wm-1,° Such that the above relations are satisfied, and 
i) furthermore such that each w; is of the form wz = yxex + fr, where fr © YriiQ 


4 J 

and 1; = >! Then we have dvj = v;®@ 1; + p;. Applying d, we obtain 

Dp 

0 =d(v;)rj; + dp; Substituting vj @r; + p; for dv;, we get pyrj + dp; = 0. 


‘| From this it evidently follows that we have also Pjr;-+ dP;—0, whence 

=0. By Lemma 1, there is an element hj in yjuQ + + 
» + ymQ such that dh; = Pjez*. We set (yj + Hence we can 
successively find which satisfy our above relations and are of 


the form described above. From the form of these elements, it is clear that 


‘ they constitute a free basis for C,, over C. The C-linear map which sends 

each v; into w; is now seen to be an isomorphism of the complex (V @ C, d) 
@ | onto the complex (Cm,d). This gives the following main result of this 
section.” 


Lemma 8. Let K be a Lie algebra over a field F of characteristic 0, 
and let L be a solvable finite dimensional ideal of K. Let V be a finite 
dimensional K-module. Then there exists a finite dimensional L-module U 
on which [K, L] is nilpotent, which contains V as an L-submodule, and which 
is such that the canonical homomorphism H(L,V) > H(L,U) annihilates 
every cohomology class of positive dimension. 


Proof. It is easily seen that there is no loss of generality in supposing 
that F is algebraically closed. Then we can use the above imbedding of 
(V@A,8) in (Cn, d). Clearly, Lemma 1 extends to Cm. Hence every 
' cohomology class for L in V which is of positive dimension is represented in 


Cn by an element of the form dc. The coefficients in Q» of c when expressed 
» | in terms of the b,’s, together with the elements of V (identified with a sub- 


‘space of Qm), span a finite dimensional subspace of Qm which, by Lemma 2, 
is contained in a finite dimensional L-submodule of Qm. Since the vector 
space H(L, V) is finite dimensional, it is now clear that the Z-module U of 
Lemma 3 can be found as an L-submodule of Qm. 


* The result of Lemma 3, except for the fact that U can be chosen so that [K, L] 
is nilpotent on U, is proved by a very different method in [8]. The proof given here is 
essentially that of [9]. 
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2. Zassenhaus’ module enlargement. We shall require a slight refine. | 
ment of a result of Zassenhaus on enlargements of modules. In proving it, | 


we follow Zassenhaus quite closely. 


Let S be a finite dimensional Lie algebra over an arbitrary field F, and 


let Rs denote the universal enveloping algebra of 8. Rg is generated by 1 


and the elements of S. We define an increasing filtration of Rs by subspaces 
Rm such that RpRg C Rpg, as follows: for m <0, Rm = (0); Ro =F; for | 


m =1, R» is the subspace of Rg which is spanned by all products (in Rs) 


of no more than m elements of S. Then the associated graded ring 3 Rm/Rmn | 


is isomorphic with the symmetric algebra built over 8, i. e., with a polynomial 


algebra in m variables, where n is the dimension of S. From this one verifies | 


easily that the maximal condition holds for the ideals (left, right, or two 
sided) of Rs. On the other hand, one shows easily that, if R is any algebra 
over F and A and B are ideals of finite codimension in FR (i.e., R/A and 
R/B are finite dimensional) and if, furthermore, B is finitely generated as an 


R-module, AB is also of finite codimension in R. Hence one concludes that, | 


if A and B are ideals of finite codimension in Rg, so is AB. 


Now consider a finite dimensional Lie algebra K which is of the form I 
K =8S+L, where 8 is a subalgebra and L an ideal in K, and Sn L= (0). | 


Then we have Rg Rx, Rr, Rr, and Rr == = From now on, 
the word ideal will mean two sided ideal. If C is an ideal of Rg, we denote 


by C’ the set of all elements ce C for which the commutator space [Z, c] j 


is contained in C[S,L]. It is not difficult to verify that C’ is still an ideal | 


of Rs. Furthermore, if C is of finite codimension in Rg, so is C’. In fact, 
let be a basis for Z which contains a basis for [S, L]. 
Now observe that le Rs’, whence Rs’ = Rg, so that, for every re Rg and 


every index i—1,---,g, we have [a%,7] => si(r)2;, with Ry. 
g=1 


Since the z,’s belong to a free system of generators for Rx over Rg, the 
coefficients s;;(2) are unique, whence the s;; are linear maps of Rg into itself. 
Put Qi; = sy7(C). Then C” is evidently the intersection of C and all these 
Qi; Since Rs/Qi; is isomorphic (via s;) with a subspace of Rs/C, it is 
finite dimensional. It follows now by an elementary argument that Rs/C’ 
is also finite dimensional, which was our assertion. 

We are now in a position to prove the result alluded to above. 


Lemma 4. Let K be a finite dimensional Lie algebra over the field F, 
and suppose that K—S-+L, where S is a subalgebra, L an ideal, and 
SOL=(0). Let N be a finite dimensional L-module, and suppose that 
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[S, L] is contained in an ideal of L which is nilpotent on N. Let M bea 
K-module which, as an L-module, is contained in N. Then there exists a 
finite dimensional K-module Z which contains M as a K-submodule and N 
as an L-submodule. 


Proof. Make a composition series (0) =N)>CMC*°°C N,=WN, 
such that each factor module N;/N;, is L-simple. Let A be the set of all 
elements of Rz which annihilate each Ni/Ni.. Since [S, LZ] is contained in 
an ideal of Z which is nilpotent on N, we have [S,LZ] C A. Since A con- 
tains the annihilator of N in Ry, it is of finite codimension in Rz. As we 
have seen above, this implies that A” is also of finite codimension in Rz, and 
evidently A’ annihilates NV. Now let B, denote the annihilator of M in 
SRs = Rg*, and define a decreasing sequence of ideals in Rg by putting 
(in the notation introduced above) By. = By’, for each i= 1. B,, and hence 
also each B; is of finite codimension in Rs. Now let C—=B,R, + B1A 
+:-+-++B,AT?+ RsA". One verifies in a straightforward fashion that C 
is an ideal of Rx. Furthermore, we have a finite set 7,- - -,2p of elements 
of Rs which span Rs mod. B,, and a finite set y1,- - +, Yq of elements of Ry, 
which span R,mod.A’. The products xy, span Re = Rgk, mod. 
+ RsA" and a fortiori mod.C. Hence C is of finite codimension in Rg. 
Using that Rs is the direct sum of the subspaces F and Rg*, we see from the 
definition of C that CQ R, =A’, and therefore annihilates NV. 

Now put 7 = (Rx/C) @N, where the tensor product is taken with 
respect to Ry. T is an Rx-module in the natural fashion, and, since C is of 
finite codimension in Rx, T is finite dimensional. Let 7 be the tensor product 
(Rx/C) ® M with respect to Ry. Since C annihilates M, we may regard M 
as an Rx/C-module, so that there is a canonical homomorphism of 7, onto M. 
Let Hy be the kernel of this homomorphism, and let H be the canonical image: 
of H, in T. Clearly, H is an Rx-submodule of 7, and we define Z to be the 
Rx-module T/H. We claim that the map n>1®@n-+4 is an R,-iso- 
morphism of N into Z, and that its restriction to M is an Rg-isomorphism 
of M into Z. 

In order to prove this, we observe that we have (C + Rr) N (C + Rg*tRz) 
=C. Hence, as a right module over R;, Rx/C is the direct sum of the sub- 
modules (C + R,)/C and (C+ Rs*R,)/C. By a standard result on tensor 
products, (Rx/C) ® N is therefore canonically isomorphic with the direct 
sum of the partial tensor products corresponding to our decomposition of 
Rx/C. Since CM R;, annihilates N, the tensor product of (C + R,)/C by 
N may be identified with N, and our map of N into Z then reads simply 
n—>n-+H. The elements of H are precisely the elements of the form 
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mod. C of elements u; in Rs*Rz such that m+ Suj-m—0, in M. From 
4 


this it is clear that HM N = (0). Hence we conclude that our map of N into 
Z is an R,-isomorphism. Finally, if me JM and re Rx, write r=a++bd, 
with ae R, and be RstRy. Then we have, r-(m+H)=—=r®m+H 
=r-m-+H, whence the restric- 
tion of our map to M is an Rx-isomorphism. This completes the proof of 
Lemma 4. 


3. Effacing of cohomology classes. A cohomology class c in H(L, V) 
is called effaceable if and only if there exists a finite dimensional L-module 
U containing V such that the canonical homomorphism H(L, V) — H(L, U) 
annihilates c. If c¢ is effaceable, and if U is such a module, we shall say 
that c is effaced in U, or that U is an effacing module for c. Everything 
we have done up to now was aimed at the following result. 


THEOREM 1. Let K be a finite dimensional Lie algebra over a field F 
of characteristic 0, and let M be a finite dimensional K-module. Let 8 bea 
maximal semisimple subalgebra of K. Then a cohomology class for K in M 
as effaceable if and only if it is annihilated by the restriction homomorphism 


Proof. Since every finite dimensional S-module- is semisimple, the 
canonical homomorphism H(8,M) — H(S,N) is an isomorphism whenever 
N is finite dimensional and M C N. It follows that the condition of Theorem 
1 is necessary. 

Now let Z denote the radical of K. Then, as is well known, we have 
K=8+1L, and Sn L=(0). By Theorem 13 of [5], there is an iso- 
morphism ¢y of the tensor product (with respect to F) H(S, F) ® H(L, M)* 
onto H(K,M); where H(L, M)¥* is the subspace of H(L,M) which consists 
of all the cohomology classes that are annihilated by K in the canonical 
structure of H(L,M) as a K-module. Under this isomorphism ¢y, the sub- 
space > H?(8S,F) H%(L,M)* corresponds to H"(K,M). The isomor- 


ptg=n 
phism ¢y can be described explicitly in terms of cochains, and it is then 


easily seen (in [5]) that it has the following properties. If p denotes the 
restriction homomorphism H (K, M) H(S, then p¢y is an isomorphism 
on H*"(8,F)@H°(L,M)*. Furthermore, if N is a finite dimensional 
K-module containing M, and if y denotes the canonical homomorphisms 
H(K,M)—>H(K,N) and H(L,M)->H(L,N), we have ydy(u®v) 
= d¢y(u®@y(v)), for all we H(8,F) and all ve H(L, 


m + > uj’ ® m;, where m and the m; belong to M, and the w;’ are the cosets F 
i 
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Now suppose that ce H"(K, M) and p(c) =0. Writec—¢u( 


peg=n 
with wp qe H?(S,F) H(L,M)*. If q>0 we have pdu(wp,) by 
the definition of ¢@y. Hence p(c) implies that pdu(wWno) = 90, whence 
we conclude that Thus +: Won). Combining 
Lemmas 3 and 4, we see immediately that there exists a finite dimensional 
K-module N containing 1 such that y annihilates every H4(L, M) with q > 0. 
For this module N, we have y(c) 0, by what we have remarked above. 
This completes the proof of Theorem 1. 

It was shown by Iwasawa in [6] that every two dimensional cohomology 
class for a finite dimensional Lie algebra in a finite dimensional module is 
effaceable. In characteristic 0, this follows from Theorem 1 and the well 
known fact that H?(S,M) (0). In characteristic p, Iwasawa’s result is 
much easier to prove. For the sake of completeness, we shall give a simplified 
version of Iwasawa’s proof, using the technique of Jacobson, [7]. 

Let K be a finite dimensional Lie algebra over the field F' of characteristic 
p, let M be a finite dimensional K-module, and f a 2-cocycle for K in M. 
Regard M as an abelian Lie algebra, and consider the extension (£, ¢) 
of M by K which is determined by f. The space of H is the direct sum 
(K,M) of K and M, the commutation is given by [(21, m1), (2, m2) | 
= 22], +f(%,%2)), and For eek, 
denote by D, the inner derivation of EH which is effected by e, so that 
D.(e’) = [e,e’]. Then each p*-th power of D, is a derivation of £. 
Let (%1,°°+,2%n) be a basis for K and write Dj;= Diz,o). Then, for 
each 1, some non trivial F-linear combination 2 ane De must vanish. Put 


Ui = > dix (2,0). Then each u; belongs to the center of Re. Let U be 
k 


the ideal of Rg which is generated by +, Un. Put N = Re*/(U + 
It is easily seen that NV is finite dimensional. The left multiplications of Ry 
induce on WN the structure of an Rg-module. We claim that the map 
m—>m + U + MRp* is an Rx-isomorphism of M into N. Indeed, for re K, 
we have x-(m + U + MRz*) = (2, 0)m + U + = + 
whence our map is an Rx-homomorphism. On the other hand, we can write 
the elements of Rg uniquely as linear combinations of ordered monomials in 
the elements of a basis of 1 and the elements (2;,0). Each ordered monomial 
occurring in an element of U must have positive degree in at least one of 
the (2,0), and each ordered monomial occurring in an element of MRz* 
must have total degree at least 2. Hence MA (U + MRzg*) = (0), and 
therefore our map M — N is an isomorphism. 

Now put g(x) = (2,0) + U-+ MRp*, for xe K. Then g is a 1-cochain 
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for K in N, and one checks directly that 6g =f. Thus N effaces the | 


cohomology class of f, and Iwasawa’s result is proved also in characteristic p, 

Now we recall (cf. [3]) that the cohomology groups of a Lie algebra 
K can be computed also from the complex of cochains for Rg* in the module M. 
These cochains are the n-linear functions (n —0,1,---) from Rx* to M, 
and the coboundary operator 8 is defined by the formula 


89(To; ° ° =To° +B (— 1)*9 (ro, ° 


If Q is an ideal of Rx* which annihilates M, so that M may be regarded 
also as an Rx*/Q-module, there is an evident canonical homomorphism of 
H(Rx*/Q,M) into H(Rx*,M). We shall say that a cohomology class ¢ is 
of finite type if it belongs to the canonical image of H(Rx*/Q,M) in 
H(Rx*, M) = H(K, M), for some ideal Q which is of finite codimension in 
Rx* and annihilates M. We shall then say also that Q reduces c. We assume 
throughout in what follows that K and M are finite dimensional. Then it is 
almost evident that every 0-dimensional cohomology class for K in M is of 
finite type. Indeed, such a cohomology class is represented by an element of 
M which is annihilated by Rx*, and we may take for Q simply the annihilator 
of M in Rx*. Similarly, we see at once that every 1-dimensional cohomology 
class for K in M is of finite type. In fact, we may take for Q the square 
of the annihilator of M in Rx’. 

Next we observe that, in positive dimensions, every cohomology class 
which is of finite type is also effaceable. In fact, if Q is an ideal of Rx* 
which reduces the cohomology class ¢ of positive dimension, the module 
Homr(Rx/Q,M) of all F-linear maps of Rx/Q into M is an effacing 
module for c. 


THEOREM 2. Every cohomology class of dimension 0, 1, or 2 is of finite 
type, and every effaceable cohomology class of dimension 3 is of finite type. 


Proof. We have already disposed of the trivial cases of dimensions 9 
and 1. Now let c be a two dimensional cohomology class for K in M. 
We have seen above that there is a finite dimensional effacing module N 
for c. Let Q be the annihilator of N in Rx*, and let f be a 2-cocyle for Rx’ 
in M which represents c. There is a 1-cochain g for Rxg* in N such that 
f(r, 8) =r-g(s) —g(rs), for all r and s in Rg*. Taking r in Q, we see 
that g(QRx*) C M. Take a 1-cochain h for Rx* in M which coincides with 
g on QRx*. Then the cocycle f; = f— 6h still represents c. We have 
f.(Q, Rx*) = (0), and a fortiori f,(Q?, Rx*) = (0). Using also that f, is a 


| 

| 

| 
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cocycle, we find that f,(Rx*,Q°) = (0). Hence ¢ belongs to the canonical 
image of H(Rx*/Q*, M). Since @ is of finite codimension, so is Q*, and 
we have shown that c is of finite type. 

Finally, suppose that c is of dimension 3 and is effaced in the finite 
dimensional K-module N containing M. Then c is the image of a cohomology 
class e in H?(K, N/M), under the coboundary map of the exact sequence for 
the pair (V,M/). As we have just shown, e is of finite type. Let P be an 
ideal of finite codimension in Rx* which reduces e. Then we can find a 
2-cochain g for Rx* in N which vanishes on P © Rg* + Re* @ P and which, 
when taken mod. W/, represents e. Its coboundary 4g takes values in M and 
represents c. Let Q be the annihilator of N in Rx*. Then it is evident that 
8g(u, v,w) = 0 whenever one of u, v, or w lies in PO Q. Hence PNQ 
reduces c, and Theorem 2 is proved. 

Our last argument can easily be generalized to prove the following result. 


THEOREM 3. Let K be a finite dimensional solvable Lie algebra over a 
field of characteristic 0, and let M be a finite dimensional K-module. Then 
every cohomology class for K in M 1s of finite type. 


Proof. We merely have to observe that every cohomology class for K 
in M which is of positive dimension is effaceable, and then prove our result 
by induction on the dimension, the inductive step being exactly like our 
above proof for the case of dimension 3. 

Finally, we shall show by means of an example that not every effaceablz 
cohomology class is of finite type. Let S be a semisimple Lie algebra over 
field F of characteristic 0, and let Z be the one dimensional Lie algebra 
over F. Put L—S @Z (direct sum). We shall exhibit a 4-dimensional 
effaceable cohomology class for Z in F which is not of finite type. It is 
known that H*(S,F) ~ (0). Let f be a 3-cocycle for Rst in F which is 
not a coboundary. We can evidently extend f to a 3-cocycle for R,* in F, 
still denoted f, such that f(a, b,c) 0 whenever one of the arguments a, 
b, or c lies in ZR;. On the other hand, let g be the 1-cocycle for Rz* 
in F such that, for some fixed non-zero element z of Z, g(z) =1, while 
g(Rs* + ZRr*) = (0). Now define h(a, b, c,d) =g(a)f(b,c,d). Then h 
is clearly a 4-cocycle for Rz* in F. Since the restriction of h to Rg* is 0, 
it follows at once from Theorem 1 that the cohomology class of h is effaceable. 
We shall show that the cohomology class of h is not of finite type. In fact, 
suppose that it is of finite type, and let P be an ideal of finite codimension 
in R;* which reduces it. Then there exists a 3-cochain & for R;* in F such 
that (h — 8k) (a,b, c,d) —=0 whenever one of a, b, c, or d belongs to P. 
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Taking a =z, we find that we have 


f(b, c, d) + k(2b, c, d) —k(z, bc, d) + k(z, b, cd) =0 


whenever one of b, c, or d belongs to P. On the other hand, using that 
h(a, b, c, d) = 0 whenever one of b, c, or d is equal to z, and that z is in the [| 
center of Rz, we see that, whenever one of b, c, or d belongs to P, we have F 


k(2b, c, d) = k(bz, c, d) = k(b, zc, d) — k(b, z, cd) = k(b, cz, d) —k(b, z, cd) 
= k(be, z, d) + k(b, c, 2d) — k(b, z, cd) = k(be, z, d) + k(b, c, dz) —k(b, z, cd) 
= k(bc, z,d) + k(b, cd, z) —k(be, d, z) — k(b, z, cd). 


Substituting this above, we find that f(d,c,d)—m(bc, d) + m(b,cd)=0 | 
whenever one of b, c, or d belongs to P, where 


m(u, Vv) = k(z, u, v) —k(u, z, v) + k(u, v, 2). 


Now the restriction to Rs* of f + 8m represents the same cohomology class 
in H*(S,F) asf. The last result shows that this cohomology class is reduced 
by PM Rs* which is of finite codimension in Rs*. Hence this cohomology 
class is effaceable, which is a contradiction because it is not 0, and S is 
semisimple. 


4, Finite dimensionality of kernels. Let Jf be a Lie algebra over the 
field F. Denote by D(M) the Lie algebra consisting of the derivations of 
M, and by I(M) the ideal of D(M) consisting of the inner derivations. 
If K is another Lie algebra over F we say that we have the structure of a 
K-kernel on M if we are given a homomorphism y of K into D(M)/I(M). 
A linear map tt, of K into D(M) which becomes yw when reduced mod. 
I(M) is called an operator map for the kernel (M,). We can find a bilinear 
alternating map 7 from K to M such that [t,, ty] — tty; is the inner deriva- 
tion of M which is effected by 7(a,y), for all « and y in K. Such a mapt 
is called a hindrance belonging to the kernel (/,y). The formal coboundary 


(x,y, 2) = te(r(y, z)) 2)) + t2(r(a, y)) 
— r([z, y|,2) + y) —r([y, r) 


lies in the center N of M, on which there is an induced structure of a K- 
module (the operators being given by the t,).. Furthermore, 87 is actually | 
a 3-cocycle for K in N, and its cohomology class in H*(K,N) depends only | 
on the kernel (M/,y), and not on our particular choice of ¢ and 7. This 
cohomology class is called the obstruction of our kernel (M,y). The kernel 


| 
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(M,y) is said to be extendible if it arises in the natural fashion from a Lie 
algebra extension of A by K, which is the case if and only if its obstruction 
is 0. One can define an operation of addition of kernels with the same center 
N and then partition these kernels into equivalence classes, two kernels M, 
and M. being equivalent if and only if there are extendible kernels U, and U2 
such that M,-+ U,—M.-+ U.. The equivalence classes of kernels with a 
fixed center N can be shown to constitute a vector space over F’ (the zero 
element being the class of the extendible kernels) such that our above asso- 
ciation of an element of H*(K,N) with a kernel (M,y) induces an isomor- 
phism of the space of the kernel classes into H*(K,N). Moreover, we have 
shown in [4] that this isomorphism is onto by giving a construction which 
attaches to a given 3-cocycle for Rx* in N a kernel (M,y) with center NV 
whose obstruction coincides with the cohomology class of the given 3-cocycle. 
If the given 3-cocycle is the canonical image of a 3-cocycle for Rx*/Q in N, 
where Q is an ideal of finite codimension in Rx* and annihilating NV, we can 
carry out our standard construction with the algebra Rx/Q in the place of Rr. 
This yields (assuming that K and WN are finite dimensional) a kernel (J, p) 
which is finite dimensional and still has the same obstruction. Hence we 
have the following result. 


THeoreM 4. Let K be a finite dimensional Lie algebra over a field F, 
and let N be a finite dimensional K-module. Then every cohomology class 
in H®(K,N) which is of finite type, or, equivalently, which 1s effaceable, 
is the obstruction of a finite dimensional K-kernel with center N. 


It seems likely that the converse of this result is also true. However, 
we have succeeded in proving this only in the case of characteristic 0. In 
this case, we have actually the following result. 


THEeorEM 5. Let K be a finite dimensional Lie algebra over the field F 
of characteristic 0, and let N be a finite dimensional K-module. Let S be 
a maximal semisimple subalgebra of K. Then an element H*(K,N) is the 
obstruction of a finite dimensional K-kernel if and only if its restriction to 
S is 0. 


Proof. If ce H(K,N) and if its restriction to S is 0 then ¢ is 
effaceable, by Theorem 1. Hence, by Theorem 2, ¢ is of finite type. By 
Theorem 4, ¢ is therefore the obstruction of a finite dimensional K-kernel. 

Conversely, suppose that ¢ is the obstruction of a finite dimensional 
K-kernel (M,y). Then y(S) is a semisimple subalgebra of D(M)/I(M). 
There is a subalgebra FE of D(M) which contains 7(M) and for which 
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E/I(M) = (8S). Since every finite dimensional extension by a semisimple 
Lie algebra is trivial (which is essentially a consequence of the vanishing of 
the 2-dimensional cohomology groups of semisimple Lie algebras), we can 
find an isomorphism p of ¥(S) into # (and thus into D(M)) which is inverse 
to the canonical homomorphism of F onto y(S). Now we can evidently find 
an operator map x—>1?, which coincides with py on S. To this, we can find 
a hindrance + which vanishes on S@S. Then 47 vanishes on SOS@OS, 
whence it is clear that the restriction of c to S is 0. 
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SUMS OF DISTINCT DIVISORS.* 


By B. M. STEwarRrT. 


1. Introduction. For a given positive integer M, let «(M) indicate 
the number of integers n which can be written in the form 


(1) n= Dd 


where the d are distinct positive divisors of 17. Obviously «(M) So(M), 
the sum of all the divisors of WV. 

For a given set S of integers MM, define S* to be the subset of S con- 
taining those integers for which o(J/) —a(J/) is minimal. 

Thus for the set J of all positive integers, the set 7* contains all integers, 
such as MW = 2", for which o(J/) — «(M) =0. However, for the set O of all 
odd integers M = 5, the set O* contains all integers, such as M = 945, for 
which o( J) — «(J/) = 2, for certainly both 2 and o() —2 defy repre- 
sentation in the form (1). 

In this paper we give complete structure theorems for 7* and O* and 
indicate their application to problems of the Egyptian fraction type. We 
also show that the function «(M/)/o(M) is everywhere dense on the interval 
0 to 1. 


2. Structure theorems for I*. It is clear that 1/ = 2* belongs to I*, 
because any n in the range 1 =n So(M) = 2***—1 when written in the 
binary notation is in the form (1). 


THEOREM 1. If M belongs to I* and p is a prime with (p,M) =1, 
then a necessary and sufficient condition that M’ = p*M, k = 1, belong to I* 
is that pSo(M) +1. 


Proof. The smallest divisor of M’ not a divisor of M is p. If 
p>o(M) +1, then n =o(M) + 1 defies representation with respect to M’. 

Conversely, suppose pSo(M)+1. We show by induction that 
p* So(p*1M) +1. This is obvious for Using the induction hypo- 
thesis on k, we have 


* Received May 23, 1953, revised January 28, 1954. Presented to the American 
Matematical Society, April 25, 1953. 
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+ pS po(p*M) + o(M) +1 +1. 


We assume as an induction hypothesis that p*"M belongs to J*. To show 
that M’ = p*M belongs to I*, we consider the intervals ranging from Qp* 
to Op* + o(p**M) for Q@=—0,1,---,o(M). It follows that no integer n 
in the range 1=n=o(M’) is omitted from all these intervals. For on 
the one hand, because p* = o(p**M) + 1, we have 


(Q+ 1) pS + o(p'*M) +1, 


hence the intervals are overlapping or contiguous. On the other hand, the 
intervals include 1 and p*o(M) + o(p**M) =o(M’). Thus each n can 
be written 


n=Qpr+R, OSQSo(M), 


Since M and p**M are assumed in 7*, we can write Q = > d, where the d 
are distinct divisors of M, and we can write R—=>D, where the D are 
distinct divisors of p*"M. Then 


n=Sd+>DD 


where the d’ = p*d are distinct from the D because of involving the factor p* 
and both the d’ and the D are divisors of 1’. Hence M’ belongs to I*. 


CorotLary 1. M belongs to I* if and only if M is of the form M = 24, 
a= 0, or of the form ; 
where the primes p; satisfy the following conditions: 
< Pi So (27) + 1, and 
Pir So + pis) + 1, j=1,2,---,k—1. 


Proof. It is easy to see that 1 is the only odd number belonging to I*. 
We have noted that M = 2¢ belongs to J*. The corollary then follows from 
Theorem 1 by induction on k. 


3. Practical numbers. A. K. Srinivasan [1] has defined a practical 
number M (i.e., a number of advantage for use in weights, measures, or 
coinage), requiring that Jf have property (1) for all n with lLS=n=<M. 
But he failed to find a complete structure theorem for these numbers. 

It is obvious that every number belonging to J* is a practical number. 
We show below that the converse is true. It follows that Corollary 1 answers 
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Srinivasan’s question about the complete structure theorem for practical 


numbers. 
To establish that every practical number belongs to J*, we simply review 


the proof of Theorem 1, noting that if M is assumed practical, the criterion 
of Theorem 1 applies to decide whether M’ = p*M is practical. For if 
p>o(M) +1, then not only does n==o(M) +1 defy representation in 
the form (1) relative to M’, but also n< M’. Thus the condition of 
Theorem 1 is necessary for M’ to be practical. Since the condition is 
sufficient to make J’ belong to I*, it is sufficient to make M’ practical. 


4, Values of «(M)/e(M). We need the following lemma. 
Lema 1. If p ts a prime such that p>o(T), then 
a(pT) = a(T) {a(T) + 2}. 


Proof. Every sum of divisors of pT must have the form n = mp + nz 
where n, and nz are either 0 or a sum of divisors of JT. Furthermore, since 
the maximum value of n. is (7) and p >o(T), no two of these n can be 
equal unless they have the same n, and mz. Since there are a(7’) + 1 choices 
for each of nm, and n., except that the choice 0,0 must be omitted, we have 


a(pT’) = {a(T) +1}? —1—a(T) {a(T) + 2}. 
We define s(M) = «(M)/o(M). 


THEOREM 2. The values of s(M) are everywhere dense on the interval 
0 to 1. 


Proof. For given real numbers x and y such that O0< t< yl, we 
must produce an integer K such that r<s(K)<y. We try to find 
K = pM where M is in I* and p >o(M), for then by Lemma 1 


s(K) = s(pM) = o(M){o(M) + 2}/(p + 1)o(M) = {o(M) + 2}/(p + 1). 
If we set w= 1/y, u(1 +) =1/2, our problem is to find M and p so that 
u{o(M) +2} <p+1<ufo(M) +2}(1 +6). 


On the one hand we know from Theorem 1 that there exist arbitrarily large 
numbers M belonging to 7*. On the other hand we have the Cahen-Stieltjes 
theorem that for any « > 0 and for sufficiently large v = u{o(M) + 2} there 
must exist at least one prime p in the interval y—1< p< (v—1)(1+ 6), 
hence such that v< p+1<v(1+e). Since w=1, such a prime p has 
the required property p > o(M), which completes the proof. 
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It seems difficult to decide whether s(J/) will take on a specified rational | 
value. - Even s(J/) =1/3 requires some search before we find a solution 
such as M 12-89. For s(J/) =4/5 the simplest solution we know is 
M == 2°-73-47269. 


5. Structure theorem for O*. A necessary condition that M belongs to | 
O* is that M be a multiple of 15, for unless M has the factors 3 and 5, the 
numbers n = 3 and n =5 will be exceptions to (1). 

Arrange the + divisors d; of M in natural order 


d,=1<d.=—=3< -<d-—H, 
and let o; denote the sum of the first 7 of the dj. 


THEOREM 3. Necessary and sufficient conditions that M =15T belong 
to 0* are that for 1 = 3, 


either (a) S oi — 2, dis, ~ oi — 4; 
or (b) dis. = oi — 4, diss = oi — 2. 


Proof. For numbers of O, it is clear that 2 does not have the form (1). 
Because of the ordering of the d;, it follows that if dis >o;— 2, then 
2 does not have the form (1). If =o;,—4, then n = o;—? 
= +2 again defies representation unless 2. Therefore the 
conditions (a) or (b) are necessary when i = 3 if M is to belong to O*. 

Conversely, suppose (a) or (b) satisfied when i= 3. As an induction 
hypothesis on 7 assume when 3 =[1=7—1 and 2<2< o,—2 that either 
d using distinct d= d; or that r—o,,—2 This 
hypothesis is easily verified when 1 = 3. 

Consider y where 2< y < When y<o,—2 set and 
use the induction hypothesis to write y= Sd where d= d;,,. When 
o—2Sy< wet and note from (a) or (b) that 
0S=r<0,—2. Except when or r—d;,, —d;+ 2, it follows from 
the induction hypothesis that y= Sd with distinet dS dj,,. 

When = di,, = dj + 2 — 2 we find that y = dis + t= dj + 014 
=o; When the conditions o, and dy, 
imply that either y= oi, —1, or o, The first two cases obviously 
allow y= > d with d=d;,,._ The last case y dj,, + 2 =o; — 2 because 
of (b) has the representation y = d,s. 

Thus we have established for the case i+ 1 the exact counterparts 
of the induction hypothesis on i. It follows that every n_ satisfying 
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29<¢n<0(M) —2 has a representation (1); so do n=1, o(M) —1, and 
o(M), hence M belongs to O*. 


TueroreM 4. If M belongs to O* and p ts an odd prime, then M’ = pM 
belongs to O* if and only if 


2p<o(M)—2, 2p~o(M) —4. 


Proof. Let the divisors and partial sums of M’ be indicated by d,’ and 
oj, respectively, and note that the only divisors of M’ which are not divisors 
of M are multiples of p**? where M = p*A, a=0, (p,A) —1. 

Supose that the first ¢ divisors of M’ coincide with those of M, so that 
d;=dj/,i<t,3<¢%. Then the properties (a) or (b) of M as described in 
Theorem 3 carry over to J’ for 3S1< t. 

Next consider cases  < k where d, S dy’ < S dru. By Theorem 
3 applied to MZ we have 

S S of — 2 + —45 on — 4, 
hence case (a) holds for J/’ and k. 

Finally, consider cases where MS dj’ < SM’. Then diy’ = M’/d; 
where d; < p hence d; divides VW as the notation indicates. When M/d; > 5, 
it follows from Theorem 3 applied to WZ for 1= 3 that 


M/d; S o( 31) — M/d, — M/d, —: - -— M/d; — 2. 
Hence 
da! = pM/d; S po(M) — M’/d, — - -— M’/dy— 2p 
< + po(M) — M’/d, — M’/d, - M’/d; —4 = —4, 


so that case (a) holds for MZ’ and #. There remain only the possibilities 
that di.’ = p, 3p, or 5p with M < dy’. In case 3p < M < Sp we have 
p< M/3 <o(M) —4, hence dy.’ =5p <o(M) + p+3p—4—c0,' —4 
so that (a) holds for 1/’ and k. In case p << M < 3p or M < p, the condition 
dis’ = 8p —2 =o + p—2 is necessary for M’ to belong to O*. 
The contingency suggested by (b) is ruled out for it would require 
diss’ = 3p = dyyo’ — 2 = 5p — 2, hence case (a) must hold with 3p ~Ao(M) 
+p—4. Thus the conditions of Theorem 4 are necessary. Conversely, when 
these conditions are satisfied, then 


duis’ = 5p <o(M) + p+ 3p—4—o%' —4; 


and if M < p, the condition d,,.’ = p < o(M) —4=o,’ —4 holds. 
Therefore for all k =3 the conditions of Theorem 3 hold and J’ 
belongs to O*. 
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Corottary 4. If the prime p divides M and .M belongs to O*, then 
M’ = pM belongs to O*. 


Proof. We have noted that M157 hence when p> 5, we have 
o(M) > 15p > 2p + 4; and if p=—83, or 5, we have o(M) > 15 > 2p +4; 
so the conditions of Theorem 4 hold. 

We cannot give any simple catalog for O* like that for J* in Corollary 1, 
because there are infinitely many different types of “base” numbers in 0* 
instead of the single type 2* noted for 7*. However, a few examples will 
help fix ideas. 

Since J = 15T has o; = 9, it follows from Theorem 3 that if M is in 0%, 
then d, 7%. This implies o, 16 and hence that d; 9, 11, or 13. It 
happens that all of 3°-5-7, 3-5-7-11, and 3-5-7-13 fail to belong to 0%, 
but only because d, >o,;,— 2. It is then easy to check, for example, that 
M = 3°:+5-Tp is in O* for any odd prime p= 103. In particular, when p = 3, 
we find If = 945, the smallest integer in O*. For use in the sequel, we apply 
Theorem 4 to see that M = 3¢-5-7 is in O* for a= 3. 

Another special case of interest is that if M; is the product of the first k 
odd primes, then M;, is in O* when k = 5. 

The author is indebted to the referee for noting and insisting on possi- 
bility (b) in Theorem 3, despite the fact that the smaller numbers in 0* 
consistently satisfy (a). By considering the arguments of Theorems 3 and 4, 
the author has constructed the following example: M = 3°:5?-7?+229pq where 
prime p = 2,641,663 and prime g = 7,924,991, such that M is in O* and has 
ds, = 3p = 055 — 4 and ds; = g = 55 — 2, so that case (b) occurs for 1 = 58. 


6. Egyptian fraction problems. Let a positive rational number with 
numerator 1 be called a unit fraction. Then the usual Egyptian fraction 
problem is to show that any given positive rational number z may be written 
as a sum of a finite number of distinct unit fractions. A variation of this 
problem proposed by E. P. Starke [2] is to show that if x has an odd 
denominator, then « may be written as a sum of a finite number of distinct 
unit fractions with odd denominators. 

Whether u,—1/n or un—1/(2n-+1), the sequence where 
S:;=u, +u,.+----+ uu, is divergent, hence it is easy to replace the 
problem of representing an improper fraction x by that of representing a 
proper fraction A/B = «— 8S; where A/B < u;., is so small that unit frac- 
tions used in its representation cannot duplicate those used in the partial 
sum S;. 
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For the usual Egyptian fraction problem easy constructions are known, 
but an amusing variant is provided by our knowledge of numbers belonging 
to 1*. By Corollary 1 we can find C, one choice being C = 2° for a sufficiently 
large a, so that BC — M will belong to I*. Since AC < M, we may write 
AC = > d, using distinct divisors d of M. If we set dd’ = M, then 


A/B = AC/BC = 3S d/M=31/d 
solves the problem. 

For Starke’s problem, our knowledge of numbers belonging to O* pro- 
vides the same sort of solution. We can find C, one choice being C = 37(945) 
for a sufficiently large a, so that BC = M, by Theorem 4, will belong to O* 
and so that 2 << AC < M, and then proceed as explained above. 


MICHIGAN STATE COLLEGE. 
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PATHS AND CIRCUITS IN CRITICAL GRAPHS.* 


By JouNn B. Ketty and L. M. KELLy. 


1. Introduction. An admissible coloring of a graph is an assignment 


of colors to the nodes in such a way that no two nodes which are joined 
are assigned the same color. If, in coloring a graph G, & is the minimum 
number of colors which may be employed, then G is said to be k-chromatic. 
In analyzing k-chromatic graphs one is led naturally to a consideration of 
those graphs which ‘ just barely’ need & colors. Specifically, a graph is called 
critically k-chromatic or k-critical if any proper subgraph can be colored in 
less than & colors. 

Each k-critical graph of order n has associated with it a maximal circuit 
length. Let the minimum of all such maximal circuit lengths for a fixed n 
and k& be denoted by Z,(n).1_ A number of theorems and conjectures advanced 
by Dirac [1] may be conveniently formulated in terms of this function. 
Thus, for example, he has shown that L;,(n) = 2k—2 provided k => 3 and 
n = 2k—2. We shall be interested here in the behavior of L,(n) for large 
values of n. Dirac has conjectured that: 

(i) lim L,(n) = and (ii) lim inf L,(n)/n4t > 0, k = 3. 


n> co 


In section 3 of this paper we prove that (i) is true while in section 4 we show 
that (ii) is false when & = 4 and in fact that 


lim inf L,(n) /log?(n) S 3/log?(27/4). 


We believe that (ii) is false for all values of k but have been unable to 
establish this. The determination of the true order of magnitude of L;(n) 
even for k = 4-seems to be a difficult problem. 

Let S;,(”) denote the function dual to Z;,(n), that is, the maximum of 
the minimal circuit lengths contained in the critical k-chromatic graphs with 
n nodes, n and & fixed. About this function we know even less than we know 
about L,(n). We conjecture that lim sup S;(n) =o, k=3, but have 
been able to prove only that 1. u. b. S8,(n) = 6, k = 8 or in other words, that 
there exist for any integer hk, k => 3, k-chromatic critical graphs with no 
circuits of length less than 6. This is demonstrated in section 5. 


* Received March 2, 1954. 
1 Relatively simple examples show that the function Z,(n) is defined for all n and 
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CRITICAL GRAPHS. 


2. Preliminaries. Definitions and Notation. 


Definition 2.1. A graph G is a collection of elements N called variously 
nodes, vertices, points, etc., and a collection H of unordered pairs of these 
elements. If (v:,v2) is an element of EF, then we say that v, and v2 are 
joined and the pair is called an edge. 


Definition 2.2. A subgraph of G is a subset M of N together with those 
edges (a, y) of EZ such that ce M and ye M. We note that every subset of 
the nodes N determines a subgraph. 


Definition 2.3. A k-coloring of G, k a positive integer, is a mapping f 
of N into the integers 1, 2,- - -, such that if (v1, v2) e then f(v1) Af (v2). 
A graph G is called k-chromatic if it admits a k-coloring but no & — 1 coloring. 


Definition 2.4. A graph @ is called critically k-chromatic or simply 
k-critical if it is k-chromatic but each of its proper subgraphs is r-chromatic, 
r<k. It is easy to show that if a k-critical graph has n nodes, then any of 
the subgraphs with n —1 nodes is k —1 chromatic. 


Definition 2.5. The order of a graph is the number of its nodes. In 
this paper all graphs are of finite order. The degree of a node is the number 
of edges to which it belongs. 


Definition 2.6. A path in a graph G is a set of nodes v,02,° * *, Um 
such that (v,v2), (V2, Us),* * *> (Um-1; Um) are all edges, together with those 
edges. The number of nodes is the length of the path. 


Definition 2.7%. In the definition of a path it is understood that the 
nodes are distinct. If however we allow the first and last nodes to be 
coincident we then refer to the path as a circuit. The length of a circuit 
is the number of its nodes. 


Definition 2.8. A graph is connected if each two of its points may be 
joined by some path. A connected subgraph, not a proper subgraph of any 
connected subgraph of G, is a component of G. That is, a component is a 
maximal connected subgraph. 


Definition 2.9. A set S of nodes of a connected graph G is called a 
cut set of G, if G— 8S is not connected. Here G— S is the subgraph defined 
by the nodes of G which are not in 8. Throughout the paper we shall use 
this notation. Thus A + B will be the subgraph of G defined by the set of 
nodes A and the set of nodes B. The index of a cut set § relative to G is 
the number of components in the graph G— 8. 
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Definition 2.10. A graph G@ is Hamiltonian if there is a path in G 
including all the nodes of G. 


3. The proof of conjecture (i). 


Lemma 3.1. If f is a k-coloring of a graph G, and P 1s a subset of p 
nodes of G, then there exists a k-coloring F of G such that 1S F(z) Sp, 
zeP. 


Proof. If p=k there is nothing to prove. If p< k the lemma follows 
by a simple interchange of functional values. 


THEOREM 3.1. If P is a cut set of p points and of index r of a k-critical 
graph G, then r < p?. 


Proof. Suppose C,,C.,---,C, the components of G—P. Since 
G — C, is colorable in & —1 colors, it follows that there is a coloring func- 
tion F defined on G— C, with positive integral values less than &. Further- 
more this function may be so chosen that 1S F(z) Sp for zeP. This 
function F in turn induces coloring functions fio, f13, fis,: fir defined 
on respectively, such that l= k—1, 
1Sfu(z) Sp, weP and = fiis(@) -—fir(z), ve P. Similarly 


for G—C,, etc. The resulting functions may be conveniently exhibited thus: 


Clearly if r exceeds the maximum number of ways of coloring p points with 
p colors, namely p?, then the functional values of the functions in at least 
two of the rows, say 1 and 2, must coincide on P. That is = fis 
= fir(2) = for = fos(x) for(x) for all in P. Thus if we define | 


fa(v),veC, +P 
we 


then the result is a k — 1 coloring of G contrary to the hypothesis. Therefore 
r< p?. 

This result can easily be improved to r < p?*. From this we conclude 
that a k-critical graph has no cut points and that a cut pair in a critical 
graph is of index 2. 


for — fos * fer 
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THEOREM 3.2 (Dirac). If in a connected graph G, the mazimal degree 
of the nodes is v and the length of the longest path is 1, then the order of G 
cannot exceed v-+v?+ 


CoroLLaRY 3.2.1. If a graph G has c components, the maximal degree 
of the nodes is v, and the length of the longest path in G 1s l, then the order 
of G cannot exceed c(v-+0v?+---+0"). 


Definition 3.2. The extension of a path P = pipops: - * pr is the sub- 
graph defined by the collection of nodes p, of G such that pz is an element 
of a path pipops* Px but pz is not an element of P. We designate 
the extension of P by P*. 


Lemma 3.2. A component of P* is a component of G—P. 


Proof. This follows at once from the fact that a node of P* and a node 
of G— (P + P*) cannot be joined by a path which contains no points of P. 
It is of interest to observe that a path P will be a cut set of G@ if neither 
G— (P+ P*) nor P* is empty. 

THEOREM 3.3. lim L;,(n) = ©. 


Proof. Let G be a & critical graph with maximal path length / and 
classify the points of G as follows. A point p is of type 7 provided it is the 
i-th point of some path and 7 is the largest number for which this is true. 
Clearly every point of G is in some unique class 1, 2,3,---,l. First observe 
that points of type J have degree less than 7. This will serve as an anchor 
for a (decreasing) induction. 

Suppose now that nodes of type z, i = have maximal degree O,(1, 
where O; is a bounded function of J and /, and consider a point pi. of type 
t—1. Such a point is, by definition, an element of a path pipops-- + pin =P. 
Now the vertices of P* are all of type greater than i—1 and hence by the 
inductive assumption are all of degree less than or equal to O;. Since the 
number of components of P* is less than or equal to the number of com- 
ponents of G— P, and this in turn is, by Theorem 3.1, less than or equal 
to /' it follows from Corollary 3.2.1 that the order of P* cannot exceed 
027 +---+0;') +1=0,,(1,k). Thus the degree of a point of 
type i— 1 is a bounded function of J and k. It follows that if 7 and k are 
fixed then there is a finite upper bound to the degree of the vertices of G. 
Finally, since a k-critical graph is connected we conclude from Theorem 3. 2 
that the order of G is a bounded function of 7 and & or, what is the same 
thing, 7 must go to infinity with n. Now Dirac [1] has shown that a graph 
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without a cut node which contains a path of length 1 > 1 contains a circuit 
of length exceeding (21) from which fact our theorem follows. 


4, Four chromatic graphs. 


The following lemma asserts, in essence, that it is possible to color any 
circuit C in three colors subject even to the restriction that each node be 
denied one of the colors (the forbidden color can vary from node to node) 
provided that the excluded color is not the same for every node. The applica- 
tion of the lemma will be to 3-critical graphs which are easily seen to be odd 
circuits. 


LeMMA 4.1. Associated with each non-constant integer-valued function 
F(x), 1S F(x) S3 defined on the nodes of a circuit C there is a 3-coloring, 
f(x), of C such that F(x) ~f(x) for any reC. 


Proof. Choose the labelling a,a.a3° - -d» of the vertices of C such that 
F(a,) (az). Now define f(a2) —F(a,) and f(a) one of the 
integers 1,2,3, but tAf(a.) and +—=3,4,---,n. Finally 
define f(a,) =2,cAf(an) and eAF(a,) =f (az). 


THEOREM 4.1. For infinitely many values of n, 
L,4(n) S 8 log? n/log? (27/4). 


Proof. Let T be a set of 34 —2 pairwise disjoint nodes, where hh is an 
odd number. Let {H,} be the collection of pairwise different subsets of h 
elements of 7, and {C,} a collection of replicas of C, a circuit with h nodes, 


p==l,2,°°:, “~~ and connect each node of H, with each node of C, 


in a 1-1 fashion. Let G be the graph whose nodes are the elements of T and 
the nodes of the various replicas of C with the edges as indicated above. 
Then we claim (1) G is 4-chromatic and (2) G is critical. 


Proof of (1). We show first that G is not 3-chromatic. If G were 
3-chromatic then some one of the colors would have to be used at least h 
times in coloring 7’, since T contains 3h — 2 nodes. Call this color A, and 
suppose H, is a set of h nodes of T carrying the color A, in some 3-coloring 
of G. The replica C, each of whose nodes is joined with a node of Hu, is 
3-chromatic since h is odd, but A, cannot be used in its coloring. Hence 
three additional colors must be used which contradicts our assumption that 
G is 3-chromatic. 

To color G in four colors we have merely to use a single color to color 
the nodes of T and three additional colors to color the various replicas of C. 
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Proof of (2). The nodes of G are of two types: (i) those belonging to 
T, and (ii) those belonging to some C;. We show first that the removal of 
a node of type (i) renders G 3-chromatic. We color the remaining 3h —3 
nodes of 7’ with three colors using each color on exactly h—1 nodes. Then 
each of the C; may be colored with the same three colors by virtue of 
Lemma 4.1, for in no case is the same color excluded from all h nodes of 
any replica of C. 

Now we show that the removal of a node of type (ii) renders G 3- 
chromatic. There is no loss of generality in assuming that the removed node 
zis an element of C,;. We color the nodes of H, with color A, and distribute 
colors A, and A; equally among the remaining 2h —2 nodes of T. Ci—z 
may now be colored with A, and A, while, in view of Lemma 4.1, the 
remaining replicas may be colored with the three colors. This completes the 
proof of 2. 

We now examine the maximal circuit length in G,. In order to pass 
from one replica of C to another via a circuit it is necessary to pass through 
one of the 3h — 2 nodes in 7. It follows that the length of any circuit in G is 
at most (3h — 2)(h + 1). The number of nodes in G is ‘- 7 *) h+ 3h—2. 


Our theorem now follows from Stirling’s formula. 


Remark. If we could prove the analogue of Lemma 4.1 for critical 
k-chromatic graphs with & > 3, then we could easily show that 


lim inf L;,(n) /log? n <0. 


It would be necessary merely to parallel the steps of our proof of Theorem 
4.1 replacing C by a k-critical graph satisfying the conditions of the theorem, 
modifying 7 appropriately, and using induction on k. 

Dirac [1] conjectured that it might be true that every 4-critical graph 
is Hamiltonian (see definition 2.10). Theorem 4.1 shows that this con- 
jecture is incorrect. 


5. Minimal Circuit Length. 


THEOREM 5.1. For any k there exists a k-chromatic graph with no 
circuits of length < 5. 


Proof. We use induction on k; the theorem is obvious when k ~1. 
Let G;, be a k-chromatic graph with no circuits of length less than or equal 
to 5. Using G, we shall construct a k +1 chromatic graph Gy, with no 
circuits of length less than or equal to 5. 
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Suppose G, has « nodes. Let T be a set of ka nodes, pairwise disjoint. 


Let {H,}, r—=1,2,-°°-, (‘*) be the collection of pairwise different subsets 


of nodes of 7. Form io replicas {G;"} of G, and connect the node of H, 
with the nodes of G," in a 1-1 fashion. Let G’,,; be the graph consisting of 
T and the replicas of G;, the various nodes being joined as described above. 
Then we claim: (1) G’z,; is at least & +1 chromatic, and (2) G’x.1 con- 


tains no circuits of length less than or equal to 5. 


Proof of (1). Suppose G’,,, were k-chromatic. Consider any coloring 
of G’x.1 by & colors. Then at most & colors are used to ‘color 7, in this 
coloring. Since 7 has ka nodes some color, say A, must be used at least 2 
times in coloring T. Let H, be a set of « nodes of T all of which are colored 
by A. Since G;" is & chromatic and since A cannot be used in its coloring 
it follows that at least & colors in addition to A must be used to color G@’,,);. 
Thus G’;,,; is at least k + 1 chromatic. 


Proof of (2). By our inductive hypothesis, no circuit of length less 
than or equal to 5 in G;,, can be composed only of nodes of a single replica 
of G,. Such a circuit must therefore contain a node of 7’, since no two nodes 
in different replicas are joined. It is clear that the shortest possible circuit 
starting and ending at some node, a, of 7 must be constructed as follows: 
The circuit starts at a, then it proceeds to a point in ‘some replica of G,. 
The next node in the circuit must be in the same replica; then must come a 
node in T’,, say b, different from a. From 0 the circuit enters a second replica, 
passes through two nodes and then returns to a. This shortest possible circuit 
is of length 6. 

If G’x41 is not k + 1 chromatic it has a & + 1 chromatic subgraph which 
of course has no circuits of length less than or equal to 5. This subgraph is 
the desired G;.:. We may suppose if we wish that G,,, is critical. 


Added in the proof. It has recently come to our attention that the result 
of Theorem 5. 1 has been obtained by W. T. Tutte. Cf. American Mathematical 
Monthly, vol. 61 (May, 1954), p. 352. 


MICHIGAN STATE COLLEGE. 
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CONCERNING THE IMBEDDING OF UPPER SEMICONTINUOUS 
COLLECTIONS OF CONTINUA IN CONTINUOUS 
COLLECTIONS OF CONTINUA.* 


By Mary-E1izaABETH HAMSTROM. 


1. Definitions and notation. The purpose of this paper is to establish 
for certain upper semicontinuous collections G of mutually exclusive con- 
tinuous curves necessary and sufficient conditions that there should exist 
continuous collections W of mutually exclusive continuous curves such that 
G is imbedded in W. Throughout this paper the space considered will be 
the Euclidean plane, 


Definition 1. If G and W are collections of mutually exclusive point 
sets, G is said to be imbedded in W provided it is true that each element of G 
is contained in some element of W and each element of W contains one and 
only one element of G. 

If the collection G is imbedded in the collection W, then (1) if g is an 
element of G, W(g) will denote the element of W containing g and (2) if H 
is a subcollection of G, W(H) will denote the subcollection of W in which H 
is imbedded. In the remainder of this section G will denote an upper semi- 
continuous collection of mutually exclusive compact atriodic continuous 
curves which is an are with respect to its elements. The endelements of G 
will be denoted by a and b. 

If H is a collection of point sets the sum of the elements of H is denoted 
by H*. If H is an upper semicontinuous collection of point sets a sub- 
collection of H will be said to be closed (totally disconnected, connected, 
perfect, open) provided it is a closed (totally disconnected, connected, perfect, 
open) subset of G with respect to its elements. 


Definition 2. The are h which is an element of G is said to be a K-are 
or K-curve of G provided it is true that there exist two simple closed curves 
J, and J, with mutually exclusive interiors J, and J, such that J,:J» is an 
are which contains and is separated by h, J,:J.: G* =h, I, does not inter- 
sect any element of the interval ah of G and I, does not intersect any element 
of the interval hb. A simple closed curve which is an element of @ is a 


* Received October 5, 1953. 
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K-curve provided it either separates two points of G* in £ or is an endele- 
ment of G. A point which is an element of G is a K-point or K-curve of G 
provided it is not the sequential limiting set of any sequence of simple closed 
curves of G. 


Definition 3. If nm is a natural number and H is a closed subcollection 
of ares of G, H™ is the collection of all elements g of H such that if « isa 
positive number then every open interval of G containing g contains an element 
of H which has a subare the distance between whose endpoints is less than ¢ 
but whose diameter is not less than 1/n. 

Clearly H™ is closed. 

The terms “link” and “ simple chain” will be used'in the ordinary 
sense with the added conventions that each link of a simple chain shall be 
an open disc (i.e. the interior of a simple closed curve) and if x and y are 
links of C, €- ¥ (if it exists) is a closed disc (i.e. the closure of an open disc) 
and (2-7) - (y-y) contains only two points. A simple cyclic chain is a chain, 
C, such that if any link is removed from C the remainder is a simple chain. 
The least upper bound of the diameters of the links of a chain is called its 
link diameter. If C is a simple chain containing three or more links then 
[C] shall denote the chain obtained by omitting the endlinks from C. [C*] 
is the sum of the links of [C], and [C*] is the closure of the sum of the 
links of [C]. If the are or simple closed curve, t, is covered by the simple 
or simple cyclic chain, C, ¢ is said to be straight with respect to C provided 
that if z is a link of C intersecting ¢ the boundary of z does not contain 
three points of ¢ and if it does contain two points of ¢ these points lie in 
different links of C. If ¢ is not covered by C it is said to be straight with 
respect to C provided that ¢- C* is an are whose cut points lie in C*, every 
subare of ¢ lying in C* is straight with respect to C, and if (C*-C*) -¢ and 
the boundary of a link z of C intersect their common part is degenerate and 
x is an endlink of C. The simple or simple cyclic chain K is said to be 
straight with respect to the simple or simple cyclic chain C’ provided it is 
true that there is a subchain L of K such that (1) if « and y are links of K 
and C respectively then z- % is a closed disc if it exists and if z is also a link 
of L then Z is a subset of C*, (2) if y and z are consecutive links of C there 
are not two links of K intersecting z and y-7, and (3) if 2 is a link of K not 
in L, and [C*] are mutually exclusive and if « and L* are mutually 
exclusive so are Z and C*. If the arc or simple closed curve ¢ is covered by 
the simple or simple cyclic chain C, ¢ is straight with respect to C and for 
no proper subchain C’ of C does C’* contain ¢, then C is said to simply cover t. 
(These definitions are modifications of definitions found in [2].) 


t 
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2. Principal imbedding lemma. 


Lemma 1. Let G be an upper semicontinuous collection of compact 
atriodic continuous curves such that (1) G@ is an are with respect to its 
elements, (2) the endelements of G are simple closed curves of which one is 
interior to the other but every non-endelement of G is acyclic, (3) if K ws a 
closed subcollection of arcs of G then for no natural number n is tt true that 
K” = K, and (4) each element of G is a K-curve of G. If H 1s a closed 
totally disconnected subcollection of G containing the endelements of G then 
there is a continuous collection W of simple closed curves such that H 1s 
imbedded in W, W*- G* = H*, and of each pair of elements of W, one ts 


interior to the other. 


Proof. For the purposes of this lemma only, Definition 3 will be applied 
to closed subcollections K of G which contain a or b. For each natural 
number n there is a well-ordered sequence %, such that (1) the first element 
of a is H, (2) if the element A of a, has a successor the first element of %, 
following A is A“), (3) if a’, is a subsequence of ¢, which has no last element 
then the first element of «, following every element of 2’, is the common part 
of the elements of @’,, and (4) a has a last element, K, and K™ does not 
exist. 

If A is an element of a, and A™ exists, A™ is a closed proper sub- 
collection of A. Consequently 2, is countable. Let T,, denote the collection 
of all elements of «,. Then the collection SI; is countable and its elements 
can be arranged in a simple sequence Aj, As, 

Let a and b denote the endelements of G, suppose that a is exterior to b, 
and let I denote the set of points interior to a and exterior to b. An J set 
shall be an interval of G which meets H in a set which is both open and 
closed in H. 

If h is an element of H-a-b let C,(h) denote a simple chain with link 
diameter less than 1 simply covering h such that (1) 6;*(h) is? a subset of I 
and (2) if h is not in A, then A,* and @,*(h) are mutually exclusive. If h 
is a or b let C,(h) denote a simple cyclic chain with link diameter less than 1 
simply covering h such that if h is not in A, then A,* and @,*(h) are 
mutually exclusive. 

Each element h of H is contained in an J set each of whose elements lies 
in C,*(h). Consequently there exist a finite subcollection H, of H and a 


* For typographical convenience, C,*(h) will denote the closure of (,*(h). 
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finite collection S, of non-intersecting J sets covering H such that each element 
of H, belongs to some I set of S;, each I set of S, contains one and only one 
element of H,, and each continuum of the J set of S, containing the element 
h of H, is a subset of C,*(h). Let G, (h) denote the J set of S, containing 
the element h of H. 

Since every element of H, is a K-curve of G there is a collection W (H,) 
of mutually exclusive simple closed curves such that H, is imbedded in 
W(H,), H.i* = G*- W*(H,), and of each pair of elements of W(H,), one 
is interior to the other. If h is an element of H-a-b, let K,(h) denote a 
simple chain with link diameter less than 1 simply covering the closure of 
W(h)-h such that K,*(h) is a subset of J and if g is an element of H,-h 
then [K,*(h)] and G* + [K,*(g)] are mutually exclusive. If h is a or } 
let K,(h) be identical with C,(h). 

If h is an element of H — H,, let C2(h) denote a simple chain with link 
diameter less than 1/2 simply covering h such that (1) if A is not in 4; 
(j=1,2) then C.*(h) and A*; are mutually exclusive and (2) if h is in 
the J set G,(g) of S, then (i) if z is a link of C.(h) there is a link of C,(9) 
containing ~ but there are not three links of C,(g) whose closures intersect 7 
and (ii) if g’ is an element of H,—g then C.*(h) does not intersect 
[K.*(g)] + W*(Hi) + + 

If h and h’ are elements of H, let L.(h) be a simple cyclic chain with 
link diameter less than 1/2 simply covering W(h) such that (1) L.(h) is 
straight with respect to [Ki(h)], (2) £.*(h) and L,*(h’) are mutually 
exclusive, (3) if x is a link of Z.(h) there is a link of C,(h) or K,(h) 
which contains Z but neither C,(h) nor K,(h) contains three links whose 
closures intersect Z, (4) if h is not in A; (j=1,2), Z.*(h) and A;* are 
mutually exclusive and (5) if A is in H,—a-b there is a simple subchain 
C2(h) of L2(h) which simply covers h and has the property that no link of 
L.(h) not in C2(h) has a closure which intersects G*. Let C.(h) and 
K2(h) be L.(h) if h is a or b. Otherwise let K.(h) be the simple chain 
whose links are the links of Z.(h) not in C,(h). 

Each element h of H is contained in an J set each of whose elements 
lies in C,*(h). Consequently there exist a finite subcollection H, of H and a 
finite collection, S., of non-intersecting J sets covering H such that (1) H, is 
a subcollection of H., (2) each element of H, is an element of some J set of 
S. and each element of S, is a subcollection of some J set of S, and contains 
one and only one element of H», (3) each continuum of the J set of S. con- 
taining the element h of H, is a subset of O2*(h). Let G.(h) denote the / 


set of 8. containing the element h of H,. 
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There is a collection W(H-) of mutually exclusive simple closed curves 
which contains W(H,) such that (1) H. is imbedded in W(H2) and 
* = G*-W*(H.), (2) if h is an element of H, and the J set G,(h’) 
of S, then W(h) is straight with respect to [K.i(h’)] and intersects each 
link of C,(h’) and K,(h’), and (3) of each pair of elements of W(H,.), one 
is interior to the other. For each element h of H, let Ni,(h) be the number 
of ares x of W*(H.-G,(h)) for which there is a link ¢ of C,(h) such that 
zis a subset of c and is irreducible between the closures of two non-intersecting 
links of C,(h). Let Ni,(W) be the sum of the N,,(h) for h in H,. Then 
W(H.) is to have the further property that if W’(H.) is any other collection 
satisfying properties (1) and (2) above then N,,(W’) = Niui(W). 

If h is an element of H,—H, and the J set G,(h’) of S, let K2(h) 
denote a simple chain with link diameter less than 1/2 simply covering the 
closure of W(h) —h such that (1) if 2 is a link of K.(h) there is a link 
of C,(h’) or K,(h’) containing « but neither of these chains contains three 
links whose closures intersect @, (2) [K2(h)] is straight with respect to 
[K,(h’)] and (3) if g is an element of H.—h, [K.*(h)] does not inter- 
sect W*(H,) + G* + [K.*(g) ]. 

If we continue in this fashion it can be shown that for each natura] 
number 7? there exist collections H;, W(H;), and S; and that for each element 
h of H; there are an J set G;(h) and simple chains C;(h) and K;(h) and 
that these collections and chains satisfy the following conditions. 


[1] H;,, is a finite subcollection of H containing H;-+ a-+ b and it is 
imbedded in W(H;,,), a collection of mutually exclusive simple closed curves 
containing W(H;) such that W*(Hi.) - G* =H;j,,* and of each pair of 
elements of W(J/;,,), one is interior to the other. 

[2] Ifh isin Hi, (i) is an J set in the collection it contains 
h, it is subcollection of some J set of S;, each of its elements lies in C;,,*(h), 
and if h is not an element of A; (jSi+1) then Aj;* + ,.,.*%—h and 
are mutually exclusive. 

[3] Si. is the collection of all Gi.(h) for h is Hi, it covers H, it is 
finite, and its elements are mutually exclusive. 

[4] If h is an element of H;,,— Hi. Ci.:(h) denotes a simple chain 
with link diameter less than 1/i + 1 simply covering h such that if h is not 
in A; (j Si +1) then C;,,*(h) and A;* are mutually exclusive and if h is in 
the J set Gi(g) of S; then (i) if x is a link of Cj. (h) there is a link of Ci(g) 
containing @ but there are not three links of (;(y) whose closures intersect 
* and (ii) if g’ is an element of H,—g, Cian*(h) does not intersect 
[Ai*(g)] + + G*(9’) + 
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[5] lf h and h’ are elements of Hj, Lii(h) is a simple cyclic chain 

with link diameter less than 1/i1-+ 1 simply covering W(h) such that 
(i) Lis (h) is straight with respect to [Ki(h)], (ii) Lia*(h) and Lj,.* (I) 
are mutually exclusive, (iii) if 2 is a link of Lj,,(h) there is a link of C;(h) 


or K,(h) containing Z but neither of these chains has three links whose | 


closures intersect 7, (iv) if h is not in A; (7 Si+1) then A;* and Z;,,*(h) 


are mutually exclusive and (v) if h is not a or b there is a simple subchain | 
Cis(h) of Li,:(h) which simply covers h and has the property that no link | 
of Lis,h) not in C;,,(h) has a closure which intersects G*. Let Ci..(h) and | 


Kin(h) be Lis(h) if h is a or otherwise let Ki,,(h) denote the simple 
chain whose links are the links of Lj,,(h) not in Cj,,(h). 

[6] If h is in (Hi: — Hi) - Wh) is straight with 
respect to [K;(h;)] and intersects each link of C;(hj) and K;(h;). For each 


element h of H; let Ni;(h) be the number of ares z of W* (Hi: Gj(h)) for | 


which there exists a link ¢ of C;(h) such that x is a subset of c¢ and is 
irreducible between the closures of two non-intersecting links of C;(h) and 
let Ni;(W) be the sum of the Ni;(h) for h in H;. The collection W(H;j.:) 
is to have the further property that if W’(H;.,) is any other collection having 
properties [1] and [6] then NV;,(W) = N;,(W’) and if Ni(W) = Ni;(W’) 
then Nij(W) S Nij(W’). 

If is an element of H;,, — H; and the I set G,(h’) of Si, Kis.(h) 
denotes a simple chain with link diameter less than 1/; +1 simply covering 
the closure of W(h) —h such that (i) if z is a link of K;,,;(h) there is a link 
of Ci(h’) or K;i(h’) containing ~ but neither of these chains has three links 


whose closures intersect Z, (ii) [Ki.1(%)] is straight with respect to [K;(h’)] | 


and (iii) if g is an element of Hi,;—h, [Ki..*(h)] does not intersect 


W* (Hi) + G* + [Kin*(g)]- 
If h is an element of H there is only one sequence G(h), (91, 92, Jas***) 


such that for each 7, gi,, is an J set of S;,, containing h and is a subcollection 


of g; If, for each i, 9: = Gi(hi) let k; denote K;*(hi) + C;*(hi) and let 


K(h) denote the sequence k,,k.,k;,---. Since for each i, hi. +h isa | 


subset of k;, h is a subset of the common part of the elements of K(h). If 
h is an element of 3H;. W(h) is the common part of the elements of K(h); 
if h is an element of H —3H;j, let W(h) denote the common part of the 
elements of K(h). It readily follows from the construction that the collec- 


tion W of all W(h)’s is a continuous collection of mutually exclusive continua | 


no one of which intersects two elements of G. It remains to be shown that 
every element of W is a simple closed curve. 
Suppose h is an element of H — 3H; and that for some natural number 
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n and each positive number « there is a natural number n(e) such that Ane, 
contains an are of diameter not less than 1/n but the distance between whose 
endpoints is less than «. The are h is thus an element of H®™. Suppose that 
there is an element of %» which does not contain h. Let A be the first such 
element of a. Clearly A has an immediate predecessor, B, B contains h, 
and B”) =A. The collection B is an element of the sequence Aj, ; 
suppose B= Aj. If h; (j 2%) is not an element of B, g; does not intersect B. 
Since g; and B contain h this is impossible. Therefore if 7 = 1, B contains h;; 
hence h is an element of B™. Consequently every element of o» contains h. 
In particular, the last element, K, of a contains h and a repetition of the 
above argument shows that all but a finite number of the h,’s belong to K. 
This implies that h lies in K™, which is impossible since there is no K™. 
Thus, if n is a natural number there is a positive number « such that for each 7, 
if h; contains an arc the distance between whose endpoints is less than ¢ then 
the diameter of that arc is less than 1/n. 

Suppose h is an element of H —3H;. It follows from the construction 
of the chains [K,(h,)| and the methods used in the proof of Theorem 1. 
Chapter II of [7] that W(h) —h is topologically equivalent to an arc minus 
its endpoints. Consequently, if W(h) contains a point at which it is not 
locally connected that point is in h. 

Since h is a K-are there are simple closed curves J, and J, with interiors 
I, and I, such that J,- J. is an are containing and separated by h, J,-J2° G* 
=h, I, does not intersect any element of the interval ah of G and J, does not 
intersect any element of hb. 

If W(h) is not locally connected there exist a subare x of h and 4 
sequence of mutually exclusive subares of W(h) —h converging to x. Let 35 
denote the diameter of x. It follows from the arguments of the third para- 
graph above that there is a natural number N such that 1/N < 8/100 and 
(1) for each 7, if h; contains an arc the distance between whose endpoints is 
less than 1/N then that arc has diameter less than 8/100 and (2) Cy* (hy) 
is a subset of J, + 7, + J,-J. minus its boundary. There exist a subare y 
of « the distance between whose endpoints is 8, a sequence 8 of mutually 
exclusive subares of W(h) —h converging to y, and a minimal subchain C 
of Cy(hy) covering y such that B* + y is a subset of C*. If A and B are 
the endlinks of C the distance between them exceeds .988. Let D,, D2, and D, 
be consecutive links of C in that order no one of which intersects A + B and 
let D denote D, — (D, + D3) - Do. 

It follows from the definition of NV that if 7 > N there do not exist two 
subares of h; both irreducible between .J and B. Therefore, since the sequential 
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limiting set of W(h,), W(hz),- - - contains all the elements of 8 and all but ! 
a finite number of the elements of 8 intersect both A and B there exist for 
each natural number g and each positive number « a neighborhood U of h 
lying in an e-neighborhood of h and a natural number WV such that if} = M, | 
h, lies in U and W(hx) — hx contains qg ares each lying in C* —C*-U and 
irreducible between A and B. In particular, there are a natural number p 
and a collection A of three subares of W(h,) — h» such that (1) each element 
of A is a subset of D irreducible between D, and D3, (2) A* lies either in J, 
or in J,, (3) no two elements of A are separated in D by h-+d,-+ J2, any | 
element of W(Hy) or any element of W(H;), (j Sp), not in Gy(hy) and [ 
(4) no component of D—A* whose boundary contains two arcs of 2 inter- 
sects C,* (hp). 

The ares of A can be denoted by w;, us, and us in such a way that (1) 
in D, us separates u, minus its endpoints from uw; minus its endpoints and 
(2) if £, and F, denote the interior and exterior of W(h,) respectively then 
for i= 1,2 there is an are ¢; lying in D irreducible between u; and u;,, and 


lying except for its endpoints in Fj. 

Suppose that h is exterior to W(h,) and A* lies in J,; since w. and u; | 
lie in J, and are not separated by J; + J. in D, the sum of the elements of 
the interval ah of G does not separate u. from u; in D. Since ¢, lies except 
for its endpoints in the exterior of W(h,) the sum of the elements of the 
interval h,b does not separate u. from uz; in D. Since ¢, does not intersect 
C,*(hp,) and the elements of the interval hh, all lie in C,*(hp), us and u, 
are not separated in D by the sum of the elements of hhy. No element of 
W (Hy) separates u. from uz in D. Suppose there is a natural number m | 
between V and p+ 1 such that some element of W(H,,) separates u. from 4; 
in D or contains uw. + us, but no element of W(H,»_,) separates u. from u; | 
in D. There is an element w of W(H,,) containing two arcs w, and w, lying f 
in D such that each is irreducible between D, and D., neither lies entirely in 
G*, each separates uw. from uz in D and no element of W(Hy _,) separates w 
from w2. It follows from (3) above that w belongs to W(Gy(Hy)). There 
is an are ¢ lying in D which is irreducible between w, and w. and does not 
intersect G* or any element of W(H,,), other than w. This contradicts 
condition [6] required of W(H,,) since the existence of ¢ clearly shows that | 
(W) is not a minimum. 

If h is interior to W(h,) and A* lies in J, then the sum of the elements | 
of the interval ah does not separate uw. from u; in D and since the elements hi 


of the interval hb are interior to W(h,), the sum of these elements does not 
separate wu. from u; in D. It follows from an argument similar to the above 


that this leads to a contradiction. as does the assumption that A* lies in J. | 


t 
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it Fl Thus, in any case, the assumption that W(h) is not locally connected 

yr leads to a contradiction. It follows from the construction of C;(h;) and 

h K,(h;) and the argument used to show that W(h) —A is topologically equi- 

rt valent to an arc minus its endpoints that W(h) contains no triod and is a 

d simple closed curve. The proof of Lemma 1 is now complete. 

Pf 

it Be 3. Lemmas concerning upper semicontinuous collections. 

: Lemma 2. Suppose the upper semicontinuous collection G of mutuaily 

yy exclusive continua fills up the compact continuum M and ws a continuous 

d curve with respect to its elements. If D is an open dise (the interior of a 

simple closed curve) and N,, is a sequence of distinct components 
of M-D converging to a continuum N which intersects D but no element of 

°°, then N is a subset of some element of G. 

d 

i Proof. Suppose the lemma is false. With the aid of Theorem 8, 

d Chapter II of [7] and some arguments used in the proof of Theorem 2 of 


[6] it can be shown that there are a subcontinuum A’ of NV intersecting two 


elements of G, a simple closed curve J whose interior, 7, intersects K, and 2 


r ‘ such that for each natural number i, Kj,, separates K; from K in W-/. The 
‘ sequential limiting vet, Z, of the sequence J-K,,J- K.,- - - does not contain 
+ : three points. Consequently there is an element 2 of G which intersects K - J 
i but not There ts a connected open subcollection U of containing such 
‘ that (* and LZ do not intersect. Since U* is open and intersects K it inter- 
' sects all but a finite number of the AK; and since U* is connected and the A; 
; are components of ./-/, it intersects all but a finite number of the elements 
of This implies that (* intersects L. Thus the supposi- 
5 4 tion that Lemma 2 is false leads to a contradiction. 
g 
n LemMa 3. // G is an upper semicontinuous collection of mutually 
, exclusive arcs which is an arc with respect to its elements then if G is 
ey continuous? at an element g which contains a point at which G* is no 
t | locally connected then there is a natural number n such that g belongs to G, 
Ss 
.o Proof. It follows from Lemma 2, the proof of Theorem 2 of [6] and 
' Theorem 8, Chapter II, of [7] that there is an open dise D intersecting g 
‘ ' such that g- D is an are and that there is a sequence a of mutually exclusive 


*The upper semicontinuous collection @ of compact closed point sets filling up a 
compact closed point set is continuous at the element g provided that if a sequence of 
elements of G@ converges to a subset of g it converges to g. 
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‘components of D- G* converging to g- D none of whose elements intersects g. 
Let P be a point of g- D and let n be a natural number such that the distance 
from P to D—D exceeds 1/n. If € is a positive number let U be a domain 
containing P such that U is a subset of D with diameter less than « and the 
distance from U to D—D exceeds 1/n. Since G is continuous at g there is 
an interval H of G containing g as an endlement such that H* contains 


infinitely many elements of « and every element of H intersects U. Let kt 
be an element of « which intersects U and lies in H*. Since no connected 
subset of H*-D intersects both k and g, H*- D is the sum of two mutually 
separated point sets 1 and N, L containing g and N containing ’. The 
point set U- H* is the sum of the mutually separated point'sets L- ( - H* 
and N-U-H*. Since H* is connected there is an element h of H which 
intersects both L-U-H* and N-U-H*. The subare h’ of h irreducible 
between L-U-H* and N-U-H* is not a subset of D since L-H*-D 
and N-H*-D are mutually separated. Thus h’ intersects D—D and U 
and consequently its diameter exceeds 1/n. Since its endpoints le in U the 
distance between them is less than e. Since every segment of G containing g 


contains such an element, h, g belongs to G™. 


Lemma 4. If G satisfies the conditions required of it in Lemma 1 then 
if for the natural number n, K, 1s the set of all elements x of G for which 
there is a sequence a(x) of elements of G converging to a subset y of x such 
that x —y has a component whose diameter exceeds 1/n then Ky is nowhere 
dense in G. 


Proof. Suppose there is an interval V of acyclic elements of G in 
which K, is dense. Since for each natural number 1, G@ is closed and 
totally disconnected there is an element x, of K,- V which is not in G® and 
there is a subinterval V, of V containing z, as a non-endelement but con- 
taining no element of G™. Let y, be the sequential limiting set of «(2,). 
There is a domain D, containing y, each point of which is closer to y, than 1 
and such that there is a component z, of 7, — D,- x, whose diameter exceeds 
1/n. Since G is an upper semicontinuous collection and D contains all but 
a finite number of the elements of «(z,) there is an element x. of K,-V; 
which lies in D, but is not an element of G®) and there is a subinterval V. 
of V, containing x, as a non-endelement such that each of its elements lies in 
D, but none of them belongs to G°). Let y. be the sequential limiting set of 
a(x.). There is a domain D, containing y2 each point of which, is closer to 
y, than 1/2 and such that D, is a subset of D, and there is a component 2: 
of x,— D.: x. whose diameter exceeds 1/n. 


an 


rr, 
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In general, for each natural number 1 there are continua 2, Yi, and 4%, 
a domain D;, and an interval V; of V such that (1) 2; is an element of K,- V; 
but not an endelement of V;, (2) V; and G@ are mutually exclusive and Vi,, 
is a subset of Vi, (3) Viu* is a subset of Dj, (4) yi 1s the sequential limiting 
set of a(x;) and Dj,, is a domain containing yi. each point of which is closer 
tO Yi, than 1/1-+ 1 and such that Dj,, lies in D; and there is a component 
Of Visi — Vis Whose diameter exceeds 1/n. 

Let z be the sequential limiting set of an infinite subsequence 2’, 22’, ° 
of z:,22,: °°, and suppose z is a subset of the element x of G. Since for 
each i, is in Vi, x belongs to For each i, (j > 1), and con- 
sequently z; lies in D; and thus each point of z; is within distance 1/1 of yi. 
Therefore if P and Q are points of z and Dp and Dg are domains containing 
P and Q respectively such that Dp and Dg are mutually exclusive there is a 
natural number NV such that if 7 > N, Dp and Dg both intersect the mutually 
exclusive subcontinua y; and z/ of the are 2/. This is sufficient to show that 
for some natural number m, a is in G™, which is impossible. Thus the 
supposition that K, is not nowhere dense leads to a contradiction. 


Lemma 5. Jf W is a continuous collection of mutually exclusive simple 
closed curves filling up a compact continuous curve M such that W 1s an arc 
with respect to its elements and H is an upper semicontinuous collection * 
of mutually exclusive arcs which is imbedded in a closed subcollection of W 
then for no positive integer n is it true that H™ = H. 


Proof. If a and c¢ are elements of H the notation (ac) will be used to 
denote the collection consisting of a, c, and all elements b of H such that 
W(b) is between W(a) and W(c). Suppose that for some natural number n, 
H™ =H. Since every element of H is thus a limit element of H, W(H) 
is a perfect subcollection of W. 

Supopse that the element z of H is a limit element of I(x, y) and that 
there is a finite sequence 2,,22,° *-,2» of mutually exclusive subares of 
W(x) each having diameter greater than 1/4n. There is a simple cyclic 
chain, C, simply covering W(a) such that if i and j are integers not exceeding 
p and s; and s; are links of C intersecting 2; and 2; respectively then 5, and §, 
are mutually exclusive. For each 7 let C; denote the minimal subchain of C 
simply covering 2;. Since x is in H™ there is a subare 2’ of x whose diameter 
exceeds 1/4n such that (1) 2’ does not intersect three arcs of the sequence 
U,%2,° * +, and (2) if K is the simple subchain of C simply covering 2” 


*If, in Definition 3, the words “open interval” be replaced by “open set” the 
definition of H will apply here. 


| 
t 
| 


804 MARY-ELIZABETH TIAMSTROM. 


then if z is an element of I(2,y)—z there is an element of [(x,2) con- 


taining two mutually exclusive ares of diameter exceeding 1/4n which lie 
in K* and intersect both endlinks of A’. Since W is a continuous collection 
of simple closed curves it follows from [1] and [3] that W* is an annulus 
whose boundary simple closed curves are the endelements of W. Therefore 
there is an element z of J(2,y) such that (1) 2 is a limit element of [(2, 2), 
(2) 2 contains two mutually exclusive ares of diameter exceeding 1/4n which 
lie in KX and intersect both endlinks of K and (3) if 2’ is an element of J (.r, z) 
then W(z’) contains the elements of a sequence 22,° of mutually 
exclusive ares such that for each 7, z; lies in C;* and intersects both endlinks 
of C;. Since z itself has property (3), property (2) and the fact that W(z) 
is a simple closed curve can be used to show that W(z) contains p-+-1 
mutually exclusive arcs with diameter exceeding 1/4n. Since z is a limit 
element of I(z,z) the above argument can be repeated. 

Since each element of H is in H™ there is an element x) of H con- 
taining two mutually exclusive arcs of diameter exceeding 1/4n. It follows 
from this and the argument of the above paragraph that there is a sequence 
I,. Io, - + of intervals of W such that for each 7, J;,, is a subinterval of J, 
and each element of J; contains i mutually exclusive ares of diameter exceeding 
1/4n. There is an element w of W lying in the common part of the J;. Con- 
sequently, for each 7, w contains i mutualiy exclusive ares of diameter exceeding 
1/4n. This is impossible, since w is a simple closed curve. Hence the 
supposition that H = H™ leads to a contradiction. 


4. Imbedding theorems. 


THEOREM 1. Under the conditions of Lemma 1 there is a continuous 
collection W of simple closed curves such that G is imbedded in W. 


Indication of proof. Let H denote 3G +3K;. It follows from 
Lemmas 3 and 4 that if g is an element of G-H, G is continuous at g and G* 
is locally connected at every point of g. Since, for each i, G@ and K; are 
closed and totally disconnected, H is the sum of the elements of a countable 
collection H’ of mutually exclusive closed and totally disconnected sub- 
collections of G. Thus there is a sequence A:H,,H>,--~- of mutually 
exclusive closed and totally disconnected subcollections of G the sum of whose 
elements is dense in G and such that every element of H’ is an element of A 
and every element of A not in H’ consists of only one element ef G. Suppose 
that H, contains the endelements of G. 

There is a continuous collection W, of simple closed curves such that 


| 
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H, is imbedded in W,, W,*- G* = H,*, and of each pair of elements of W,, 


one is interior to the other. Suppose U is a component of J—IJ- W,* and 
is bounded by the simple closed curves w and w’ (where J is as defined in 
the proof of Lemma 1). Let uw, and w, be mutually exclusive arcs lying in 
[ — G* and irreducible between w and w’ such that if P is a point of the 
component of U—U-: (wu: + uz) containing G*-U then the distance from 
P to G*-U is less than 1. Let V be the component of U—U- (u, + wz) 
which does not intersect G*. There is a continuous collection Z of mutually 
exclusive ares which fills up V such that (1) Z is an are with respect to its 
elements, (2) the endelements, wu and wu’, of Z lie on w and w” respectively and 
(3) each are which is an element of Z is irreducible between w, and wo. 

If g and g’ are the elements of G imbedded in w and w’ respectively there 
is a reversibly continuous transformation, 7, of the interval gg’ of G onto 
Z such that T(g) =u and T(g’) =w’. Let T be the common part of the 
interval gg’ and the collection H, + H.. There is a continuous collection 
Wy of mutually exclusive simple closed curves such that (1) T and T(T) 
are imbedded in Wy, (2) Wy*- G* and Wy*-Z* =[T(T)]*, (3) w 
and w’ belong to W,, and (4) if h is an element of T, h and T(h) are 
imbedded in the same element of Wy. Let W. be the collection such that h 
is an element of it if and only if either there is a compoent U of J —I- W,* 
such that h belongs to Wy or h belongs to W,. 

Suppose U’ is a component of U—U-W,* and is bounded by the 
simple closed curves w, and w,’.. The point set U’-u, and U’- wus, are arcs 
irreducible between w, and w,’. Let wu,’ and ws’ be mutually exclusive ares 
lying in U’—U’-(G*-+V) irreducible between w, and w,’ such that 
(1) if P is a point of the component of U’ —U”’-: (u,’ + us’) containing 
G*-U’ then the distance from P to G*-U’ is less than 1/2 and (2) 
u,’ + ue.” separates U’- V from G*-U’ in U’. Let V’ be the component of 
U’—U’- (u’ + u.’) which contains U’-V. There is a continuous collec- 
tion Z’ of mutually exclusive ares which fills up V’ such that (1) Z’ is an 
are with respect to its elements, (2) the endelements, v and v’, of Z lie on 
w, and w,’, respectively, (3) each are of Z’ is irreducible between wu,’ and wu.’ 
and (4) the collection of elements of Z lying in V’ is imbedded in Z’. 

If g, and g,’ are the elements of G imbedded in w, and w,’ there is a 
reversibly continuous transformation 7” of the interval gg,’ of G into Z’ 
such that 7’(gi) =v, T’(g:’) =v’ and if h is in the interval g,g,’, T(h) 
is imbedded in 7’(h). Let I, be the common part of g,g,’ and H, + H. + Hs. 
There is a continuous collection Wy of mutually exclusive simple closed 
curves such that (1) I, and 7’(T,) are imbedded in Wy (2) Wyo-*- G* =T,* 
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and Wy,*- Z’* — [T’(T,)]*, (3) w, and w,’ belong to Wy, and (4) if h is 
an element of I, h and T(h) are imbedded in the same element of Wy. 
Let W; be the collection such that h is an element of it if and only if either 
there is a component U’ of J—J-W,.* such that h belongs to Wo: or h 
belongs to 

A continuation of the process indicated above will yield the desired 
collection W, which consists of SW; plus its limit elements. That if h is an 
element of G— SH; then W(h) is a simple closed curve follows from the 
construction of Z, Z’, etc. and the fact that G@ is continuous at h and G* is 


locally connected at h. 


TuroreM 2. If G is an upper semicontinuous collection of mutually 
exclusive atriodic continuous curves which is an are with respect to tts 
elements then there exists a continuous collection W of mutually exclusive 
simple closed curves such that G is imbedded in W if and only if (1) each 
element of G is a K-curve of G, (2) if H is a closed subcollection of arcs of 
G then for no natural number n ts tt true that H™ = H and (3) of each pair 
of simple closed curves which are elements of G, one is interior to the other. 


Proof. Suppose there is such a collection, W. It follows from Anderson’s 
result, [1] and Cohen’s result, [3], which state that W* is an annulus whose 
boundary simple closed curves are the endelements of W that conditions (1) 
and (3) hold true. The truth of condition (2) follows from Lemma 5. 

Suppose G satisfies conditions (1), (2), and (3). It follows from (1) 
and (3) that the collection, H, of all simple closed curves of G is closed and 
continuous. If g and g’ are simple closed curves which are elements of & 
such that no element of G between them is a simple closed curve, the elements 
of the interval gg’ satisfy the conditions of Theorem 1. That the conditions 
of 2 are sufficient follows from this and the fact that H is continuous. 


Lemna 6.-If G is an upper semicontinuous collection of compact atriodic 
continuous curves such that (1) G ts an arc with respect to tts elements, 
(2) the endelements of G are simple closed curves J, and J, with exteriors 
E, and E, such that J, and J, lie in E, and Ey respectively and contain 
points P, and P, which are respectively accessible from the same point of 
E,-E,— E,: E.- G*, (3) if K is a closed subcollection of arcs of G then for 
no natural number n is it true that K™ = K and (4) each element of Gs 
a K-curve of G, then there is a continuous collection W of simple closed curves 
and arcs such that G is imbedded in W and W* is a continuous eurve. 


Proof. It follows from (2) and (4) that every non-endelement of / is 
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acyclic and that there is a simple closed curve J whose interior J contains 
G* — (J, + J2)-J such that J-J, and J: J. are points P, and P, respec- 
tively. The curve J is the sum of two non-overlapping arcs P,XP, and 
P,X’Ps. There are reversibly continuous transformations T and T” throwing 
onto P,XP, and P,X’P. respectively such that T(J:) = P:—=T’(J,) and 
T(J.) =P2=T’(J.). The methods used in the proofs of Lemma 1 and 
Theorem 1 may now be used to construct the collection W, where W* =—J + I 
minus the sum of the interiors of J, and J, and if h is a non-endelement of 
G then W(h) is an are with endpoints T(h) and T’(h). 


TuroreM 3. Jf G is an upper semicontinuous collection of compact 
utriodic continuous curves which is an arc with respect to its elements then 
there is a continuous collection W of mutually exclusive compact atriodic 
continuous curves such that G is imbedded in W and W* is a continuous curve 
if and only if every non-endelement of G is a K-curve of G and tf H 1s a 
closed subcollection of arcs of G then for no natural number n is tt true that 
H® H, 


Proof. Suppose @ is imbedded in a collection W which satisfies the 
required conditions. If the non-endelement g of G is imbedded in the simple 
closed curve W(g) of W, it follows from the methods used in the proof of 
Lemma 1 of [4] that since W* is a continuous curve, W(g) separates two 
points of W* in F. Thus g is a K-curve of G. It follows from this property 
of the simple closed curves of W that no acyclic non-endelement of W is the 
sequential limiting set of a sequence of simple closed curves of W. Thus 
every point of G which is a point and non-endelement of W is a K-curve of G. 
If A is a non-endelement of G and W(h) is an are, W(h) is a non-endelement 
of an interval, H, of ares of W. It follows from Theorem 3 of [5] that H* 
is a closed dise and thus that h is a K-curve of G. In any case, every non- 
endelement of G is a K-curve. It follows from an argument similar to that 
in the proof of Lemma 5 that if H is a closed subcollection of ares of G then 
for no natural number n is it true that H™ —H. Thus the conditions of 
Theorem 3 are necessary. 

Suppose G satisfies the conditions required of it in the statement of 
Theorem 3. Since every non-endelement of G is a K-curve then any three 
simple closed curves of @ can be labeled ¢,, #2, and tz in such a way that either 
t, is interior to ¢, and ¢, is interior to ¢, or ¢, is interior to ¢, and t. and ft, 
are mutually exterior. Let a, and a2 be the endelements of G and suppose 
first that a, is a simple closed curve having a, in its interior. The continuum 
a, is thus interior to every simple closed curve of G—a,. From the above 
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remarks and the fact that the non-endelements of G are K-curves it follows 
that there is a sequence J;,J2,J3,- + of mutually exclusive simple closed 
curves converging to a, such that J, is a, and for each 7, a, is interior ty 
interior to J; and an element of G. That the collection 
W exists now follows by applying Theorem 2 to the intervals of G whose end- 
elements are J;- G* and G*. 

Suppose a, is not interior to a, and a, is not interior to a,. From the 
remarks at the beginning of the previous paragraph concerning the simple 
closed curves of G and the fact that every non-endelement of G is a K-curve 
it follows that there exist mutually exterior simple closed curves, A, and K, 
with exteriors F, and such that a; is interior to Ay, (v= 1,2), 
and K.- G* are acyclic elements of G and every element of G between them 
is acyclic and A, and K, contain points P, and P. respectively which are 
accessible from the same point of 2,-#,—G*-H,-H,. Lemma 6 can be 
applied to the interval of G whose endelements are K,-G* and K.- G* and 
the conclusion of the foregoing paragraph can be applied to the remaining 
two intervals of G. This completes the proof of the sufficiency of the con- 
ditions of Theorem 3. 


THEOREM 4. If G is an upper semicontinuous collection of mutually 
exclusive, compact, atriodic continuous curves which is an are with respect 
to its eleménts then there exists a continuous collection W of mutually 
exclusive continua filling up a continuous curve in sueh a way that G 1s 
imbedded in W and every element of G is topologically equivalent to the 
element of W in which it is imbedded if and only if (1) each non-endelement 
of Gis a K-curve of G, (2) tf H is a closed subcollection of arcs of G then 
for no natural number n is it true that H™ = H, (8) the collection of all 
degenerate elements of G is closed and (4) if the simple closed curve or 
endelement, h, of G is a subset of the boundary of a complementary domain, 
D, of the sum of the simple closed curves and endelements of G then some 
point of h is accessible from D. 


Proof. Suppose such a collection, W, exists. The truth of (3) is obvious 
and it follows from arguments similar to those used in the proof of Theorem 
3 that (1) and (2) are true. Since (1) is true, any three simple closed 
curves of G can be labeled ¢;, ¢2, ¢; in such a way that either ¢, is interior to 
t, and ¢, is interior to ¢, or ¢, is interior to ¢. and ¢. and ¢, are mutually 
exterior. Suppose X is the collection of simple closed curves and endelements 
of G. The collection K is closed. Suppose D is a complementary domain of 
K* and let x, and xr, be the elements of K whose sum is the boundary of D. 
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(If D is bounded by only one element of K, it does not intersect G* and thus 
(4) is obvious.) If x, (t—=1,2), is acyclic let xi = 2;; otherwise let 
denote x; plus its complementary domain which does not interest D. Let 
U be the collection consisting of 7,’, r.’ and all elements of W between 2, 
and x». The collection U is an arc with respect to its elements, its non- 
end-elements are acyclic and neither of its endelements separates the plane. 
Clearly U* is not the entire plane, it has only one complementary domain, V, 
and the boundary B(V), of V is a compact continuous curve. It follows from 
Theorem 3 of [5] that every non-endelement of U intersects B(V) and thus 
that x, and a, interesect B(V). Therefore x, and z, each contain a point 
accessible from V. But V is a subset of D— D- G*, a connected domain. 
Therefore 2, and x, contain points accessible from D—D-G*. Thus the 
condition of Theorem 4 is necessary. 

Suppose G@ is a collection satisfying all the conditions required of it in 
the statement of Theorem 4. Let ZL be the collection of degenerate elements 
of G. If K+ LG it follows from (3) and the fact that every non- 
endelement of G is a K-curve of G and consequently that K is continuous, 
(s¢e the proof of Theorem 3 and Theorem II of [1]) that @ is continuous 
and G* is a continuous curve. 

If G contains an are not in K+ JL there is a segment of ares of G 
whose endelements belong to K+ L. The collection of all such segments 


is countable. Denote them by U,,U.,U3,: -- and denote their respective 
endelements by x; and y; From condition (4) it follows that there is a 
sequence J,,J.,J3,- - - of simple closed curves every convergent subsequence 


of which converges to a subset of an element of G such that for each 1%, 
(1) Ji-G* is a subset of a+ y;, Ji-v; and J;:y; are degenerate and 
«4—ax,-J; (if it exists) is not separated in the plane from y;— y- J, (if it 
exists) by Ji, (2) U;* is a subset of the complementary domain of J; + a + y% 
whose boundary is J; +2,+y; and (8) if pAi and J, and J; intersect, 
their common part is a point, U; and U, have a common endelement and 
Jy-J; is a subset of this endelement. Because of the continuity of K + L, 
if the methods of Lemma 6 are now applied to the intervals xy; of G where 
vi, yi, Ji, and D; replace J;, Jo, J, and F,- FH, respectively of Lemma 6, the 
proof that the conditions of hTeorem 4 are sufficient is now complete. 


GOUCHER COLLEGE. 


(| 
VU 
| 
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MARY-ELIZABETH HAMSTROM. 
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ON ALGEBRAIC HOMOGENEOUS SPACES.* 


By Morixuni Goro. 


1. Introduction. For the sake of simplicity, we call a connected com- 
pact complex analytic homogeneous manifold a C-space, if the fundamental 
group is finite, where by homogeneity we mean the existence of a transitive 
group of biregular automorphisms. The problems on C-spaces, especially the 
classifications of C-spaces, were studied in detail by H. C. Wang in Closed 
manifolds with homogeneous complex structure, American Journal of Mathe- 
matics, vol. 75 (1953), which will be quoted as [Wang] in this paper; 
Wang defined a C-space as a simply connected C-space in our definition. 
Here we shall consider a special class of C-spaces. 

Let & be a connected compact Lie group, and U1 a closed subgroup of &. 
It is well-known that all maximal connected abelian subgroups of & are 
conjugate tori, and the Euler characteristic x(/11) of the coset space R/U 
does not vanish if and only if 11 contains a maximal torus T. We shall call 
a coset space R/U semi-simple if y(R/U) 0. Since any C-space is homeo- 
morphic to a coset space of a connected compact Lie group, we obtain a class 
of semi-simple C-spaces. 

Let us denote by Proj. (s) the complex projective group of dimension s, 
and by Grass. Gs) the Grassmann manifold, the manifold of n dimensional 
linear subspaces in the r dimensional complex vector space. By using Pliicker 
coordinates, Grass. ) is imbedded naturally in Proj. ((;) —1) as an 
algebraic subvariety. A complex analytic manifold V of complex dimension 
n is called algebraic (an algebraic variety) if there exists a biregular mapping 
from V onto an algebraic subvariety V, of a certain dimensional projective 
space. And if, in particular, we can find V, which is birationally equivalent 
to Proj. (n), we call V rational (a rational variety). 

The principal result in this paper is the following: 


THEOREM. A C-space is a rational variety if it is semi-simple, (and 
vice versa’), 


* Received January 4, 1954. 
* A. Borel in “Sur les espace homogénes Kihleriens compacts” (to appear) proved 
that a O-space is semi-simple if it is Kihlerian. 
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Let V be a semi-simple C-space of complex dimension n, homeomorphic 
to R/U, where & is a connected compact Lie group of dimension r. Then, 
more precisely, we can give a biregular mapping from V into Grass. (n)? in 
such a way that the image is a subvariety composed of an n-dimensional affine 


space and a proper subvariety. 

As a special case of the above theorem, the coset space R/T of a connected 
compact Lie group modulo a maximal torus has a complex structure of a 
rational variety. 

Now we shall call a semi-simple C-space simple if it is homeomorphic 
to a coset space of a compact simple Lie group. Then, as to the structure 
of semi-simple C-spaces, we have: ' 


I. A semi-simple C-space ts a direct product of a certain number of 
simple C-spaces. 


The classification of simple C-spaces was done almost completely in 
[Wang]. Also as a direct consequence of our theorem and [Wang], we have: 


II. A C-space is a complex analytic fibre bundle with a torus as the 
fibre over a rational variety. 


The subjects of this paper have connections with those of recent works 
of A. Borel and A. Weil (to appear). In particular, the fact that a semi- 
simple C-space is simply connected (§ 3, Proposition 1), was pointed out by 
Borel. The proof given here is a by-product of our methods and is different 
from the proof of Borel. 


2. Lemmas. 1. Let G&(r) be the group of all non-singular matrices 
of degree r over the field of complex numbers, and let M(r) and Y(r) be 
subgroups of G(r) composed of all elements of the form 


0 


and 


0 * | 
respectively. We call an element g = (gi;) in G&(r) singular if some of 
the determinants 


Ji1s 


Jo1 Jo22 J21 Joo Jo3 | Jor Joo *Qor-1 


va 
to 
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vanishes, and denote by © the set of all singular elements. Then it is easy 
to see that a non-singular element g of G&(r) can be decomposed into the 
product of an element n(g) in Nr) and p(g) in ¥(r). In fact, we have: 


Lemma 1. GL(r)—S = R(r)P(r),? Nr) N P(r) = 1 (the unit matrix). 
The decomposition g = n(g)p(g) is unique, and the coefficients of n(g) and 
p(g) are rational functions of gij. 


2. Let G be a complex semi-simple Lie algebra of dimension r. After 
H. Weyl we can choose a complex basis hi, €a; @-ay* * Cu, C-w Of G, 
so that [hi, hj] (1,7 
[ S Ashi, ea] = ea, (ar) = 
4=1 


i=1 
U 
= ahi, [ea, = Nageasg. 
g=1 


Here a = are real vectors, called root vectors, and each 
Nog is a real number, which does not vanish if «+ @ is a root vector. If 
a+ B is not a root vector, we may put Nog = easg = 0. 

We define a linear order in the 7 dimensional real vector space R#, so 
that R' becomes an ordered abelian group. We put namely 


if Ay = = Nin, > for some 7. Then we can decompose G 
into a direct module sum of two subalgebras defined by 


Cea, 


(2 >0,---,w>0) where C denotes the field of complex numbers. 
Let x be an element of G. We denote by adv the matrix representing 
the mapping adx- y= [x,y], ye G, with respect to the basis 


The correspondence x —> adz gives a faithful representation of G. We use a 
Latin capital * for a complex subalgebra of G@, and use the notation ad* 
= {adz|xe*}, and the corresponding German capital denotes the matric 
Lie group generated by ad*. For example, & is the Lie group, generated by 
adG, namely the adjoint group of G. 


* This decomposition was used by I. Gelfand in his theory of unitary representations 
of Lie groups. 
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LemMA 2. =N=— {n(g)| geG—Ss} 
GN P(r) = B= {p(g)| ge G6 —S} 
NF, NAP —1. 


Proof. Since G=N+P, NNO P=0, we have adG =adN + adP, 
adN M adP —0. Therefore we can take nuclei V; and P; of the Lie groups 
NM and , integrated by adN and adP respectively, so that G, = IM; 
Bi —1) is a nucleus of where by a nucleus we mean a neighborhood 
of the identity. Here, by our choice of the basis the matrices in adN and 
adP are of the forms 


0 0) * | 
0 * 


and 


0) 
respectively, whence we have J C MN(r), B C P(r). 

Now it is known that © coincides with the identity component of the 
group of all automorphisms of G, and therefore algebraic. In fact we can 
find polynomials ¢,, ¢2,- - with respect to r* variables &; (1,7 
so that 


$1 = $2 =" imply (&;) G. 
For g in &;, n(g) is contained in 9%. Hence 
o:(n(g)) = d2(n(g)) =" = 0, ge Gr. | 


On the other hand, since © M © is a proper subvariety of G, the complex 
dimension of © M © is less than r, whence © — © is a connected manifold. 
Therefore the equalities must be valid everywhere in G—©. Hence 
n(g) namely {n(g)| ge G—S} CGN The converse implica- 
tion is obvious. Thus we get GN M(r) = {n(g)| ge G6—S}. 

The left hand side is a closed subgroup, which is a connected set by the 
equality. Differentiating the closed connected subgroup N(r) N &, we get 
clearly the Lie algebra adN. 

By the same argument, we have the relations on §. 


3. Let A be the set of root vectors of G, and M a linear subspace of 
R', and let © be the set of root vectors in M:@—AMM. Denote by o1 
the complementary set of 6 in A:A—@=61l, Then, ® contains — PB with 
8, and if 8 and f’ are in ® and if 8 + B’ is a root vector, then B + f’e®. 
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Moreover, that ae 61 and Be® implies a+ Be, if it is a root vector. 
We may assume that 


where + a(t) e@1, + B(j) ©. In fact any basis of R’ such that the last 
part of which constitutes a basis of Mf gives a desired ordering of R’. 
Consider the following subalgebras of G: 


P, = + + Ceatn)s 


= Ceaay +: Ce-an), Ne = +° + Ce_pem) 
P=P,+ N=N,+N,, H=P+N.. 


Then we have G—=WN,+ H, H=0, and [H,H|] =P,+ [P.4+ 
P,+ Ns], where [P.+ No, is a semi-simple subalgebra corre- 
sponding to the root system ®, and P, the radical (maximal solvable ideal) 
of [H, H]. 

Since P, is an ideal of H, we have adH-P, C P;. Conversely, let x be 


an element of G so that ada: P,; C 


n 

P. We may putr=—y+ > ge —a(j), 
j=1 

ye H. Because @aci) | is a non-vanishing linear combination of h’s 


and = €ile-aciys + (a linear combination of ¢’s), & 
must be zero, namely ce H. Thus we have H = {a| ada: P, C P3}. 

Let $(r) be the subgroup of GL(r) defined by S(r) = {g| P: == Pi}. 
Then the Lie algebra of (7) M © must be adH. Therefore the Lie sub- 
group § corresponding to ad H is the identity component of (7) N ©. Now 
let n(S) be the normalizer of § in G, and g an element of n(). Since P, 
is the radical of [H,H], $8, is a characteristic subgroup of §. Therefore 
namely g:P,—P,, whence ge G(r). Since § is normal in 
§(r) N G, we have n(S) = S(r) NG. 


4. Since N—=WN,-+ and N, is an ideal of N, we have 2 
where N; (i = 1,2) is a Lie subgroup corresponding to ad N;. On the other 
hand, by Lemma 2 we know that G—G—=—N. Therefore we have 6 — S 
= PB = Mi (MB) C MG. 

Now, let N(r) be the matric Lie algebra of M(r). For z in N(r), 
xpr—1+a¢+2?/2!+--- is a polynomial of x of degree less than r. 


816 MORIKUNI GOTO. 


Conversely let g be in M(r). Then logg = (g—1)—(g—1)?/2+--. 
is also a polynomial of g degree less than r, and we have exp(logg) =g, 
log (expz) =z. Thus the mapping 


N(r) 32 = (aj) ~expxr—1-+ (ys) EN(r), t>j 


is one-to-one and onto, where each yj; is given as a polynomial of z’s and 
vice versa. 

Let g be an element in 9%, M n(). Since g is contained in Jt,, we have 
g = exp(adz), ze N,, where adz is a polynomial of g. Since g-P, =P,, 
we have ad x-P; whence te H. Since N,N H we have 
namely g=1. Thus we have n($) —1. 

Suppose that n($) is not connected, and let us decompose n(§) into 
cosets modulo $:n($) which yields 2.n($) 
+99: -+---, where + denotes the direct sum of sets. Since 9,9 con- 
tains an open set, so does 9ti9g:. On the other hand 


Ni CS’ = §— NG GN GS, 


where & M © has a lower complex dimension than ©. This is a contradiction. 
Therefore § must coincide with n(§). 


LemMsa 3. WNH—1, 


where ©’ 1s contained in a closed set of lower complex dimension. 


5. Lemma 4. The complex coset space @X(r)/S(r) is biregularly 
equivalent to Grass. ( n)* 


Proof. Consider the mapping f defined by 


39 = (9192° Gr) > Lg: A A+ A gn] © Grass. (n)? 
where 


Gri grr 


and [g:/\---A gn] denotes the element of Grass. (7)? spanned by 


915 « ) 
This mapping is onto, and two elements g and g’ = (9/19’2: - -g’r) 0 
to the same point if and only if 


namély g’=gh he&(r). Hence f is one-to-one on the left coset space 
®L(r)/H(r), and is clearly biregular. 
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3. Propositions. We shall start from a C-space V of complex dimension 
n, and first mention the results in [Wang] in a way which is convenient 
for us: 

There exists a connected complex semi-simple Lie group G of biregular 
automorphisms transitive on V. Therefore V is biregularly equivalent to a 
coset space @/R, where & is a closed complex subgroup of @. Let G be the 
Lie algebra of @, and L that of 2. Then we can find a subalgebra H which 
satisfies the conditions in § 2. 3., so that GD H DL and [H, H] = [L, L}. 
Here H coincides with Z if and only if V is semi-simple. 

Let D be the (finite) center of G. Then the factor group G/®D is 
isomorphic to the adjoint group &. Let § be the closed Lie subgroup 
corresponding to H in &. Since n($) = by Lemma 3, and since ®D is 
known to be contained in §, the normalizer n(9) of § coincides with §, 
and G/§ is biregularly equivalent to G/S. 

Let V be semi-simple. Then since 2 coincides with § locally, n($) = 3) 
implies 2—§. On the other hand G/§ is known to be simply connected. 
Thus we have: 


PROPOSITION 1. A semi-simple C-space is simply connected. (See the 
end of § 1.) 


Now we shall prove the following: 


PROPOSITION 2. A semi-simple C-space V is a rational variety, con- 
taining a complex affine space V, such that V—V, is a subvariety. More- 
over there exists a biregular mapping from V into a projective space which 
is rational in terms of a system of affine coordinates on Vo. 


Proof. We may assume V = @/S. By Lemma 4 we have a biregular 
mapping f from GX(r)/H(r) onto Grass. Since GN HS(r) by 
Lemma 3, f is one-to-one on @/§. Therefore f induces a one-to-one biregular 
imbedding of V into Grass. (1): 

Now by the decomposition 6 — SG’ = 9,4, V contains an open sub- 
manifold Vj =f(N,). Let &,---,& be the parameters of the Lie algebra 
N,. Then the Pliicker coordinates of f(Vo) are polynomials of é,- - -. én, 
say fo(€),- - +, fw(é), where VN = ( n) —1. Let us construct the irreducible 
variety with the generic point (fi(€)). It is clear that the variety coincides 
with f(V). 

Thus we obtained a rational mapping from Proj. (n) onto f(V). Since 
the algebraic correspondence given by £— (fi(€)) is one-to-one for generic 
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points é, the mapping is birational, whence f(V) is a rational variety. The 
image of the o-hyperplane in Proj. (nm) coincides with f(V) —f(V»). 


Proposition 3. <A C-space is a complex analytic fibre bundle over a 
rational variety with a torus as the fibre. 


Proof. We retain the above notations. Let V =@/& be a C-space, 
and g an element of &. Since P, (see § 2.3.) is the radical of the commutator 
subalgebra [H, H] =[L,L], the corresponding Lie subgroup is a charac- 
teristic subgroup of the identity component of Z. Therefore g*P.ig = X,, 
namely gP, = P,, whence g is contained in &. Thus, 2 is a subgroup of §. 
Since @ contains the commutator subgroup of §, & is a normal subgroup of 
§. and the factor group $/2 is abelian. Hence $/2 is considered as a 
complex toral group operating on V = @/, and we have: V /$/2= G/§, 
which establishes the proposition. 


PRoposITION 4. A semi-simple C-space ts a direct product of simple 


C-spaces. 


Proof. Let @/ be a semi-simple C-space, and let G=G,-+---+ 4; 
be the decomposition of the Lie algebra @ into simple ideals. Then clearly 
H=H,+---+H,, where H; is a subalgebra of G; (t1—1,2,---,8.) 
Let @; be the adjoint group of G;, and §; the Lie subgroup of G; corresponding 
to ad H;. By Proposition 1, § is connected. Since G=Gi X:-- XG, 


These propositions clearly imply the statements mentioned in § 1. 
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NUMBER OF POINTS OF VARIETIES IN FINITE FIELDS.* 


By Serce Lane and ANDRE WEIL. 


1. Statement and proof of the main theorem. Let V be a variety in 
a projective space P”. If V has dimension r and degree d, we shall indicate 
this by writing V = V,,,2,._ A point of V can be represented by homogeneous 
coordinates (x) = 2»), and we shall also say that (x) is a point 
of V. We use & to denote a finite field and g to denote the number of 
elements in k. Let V be defined over &. A point (2) of V is said to be 
rational over k, or more briefly to be in k, if its coordinate ratios 2/2; (%j~ 0) 
are in k. We intend to give an estimate for the number of points of V which 
are in k. Denoting this number by N, we prove 


THEOREM 1. There exists a constant A(n,d,r) depending only on 
n,d,r such that for any variety V=Vna,r defined over a finite field k 


we have 
| qr | S 8g" + A(n, d, gr 


where 8 = (d —1)(d—2). 


If r—1, i.e. if V is a curve, the above diophantine statement is a 
reformulation of the Riemann hypothesis in function fields [2]. Indeed, let 
V, be a non singular projective curve, defined over &, birationally equivalent 
to V over k. Let g be the genus of V and V,. Then we know that the 
number NV, of rational points of V, satisfies |1-+q—N, |S 2gq3. Each 
non singular point of V corresponds to exactly one point of V,. The number 
of singular points on V is bounded by a constant depending only on d, and 
each singular point of V corresponds at most to d points on V,;. Hence 


|\V,—N|SA where A is a constant depending only on d. This shows 
that | Using the fact that g S 4(d —1)(d — 2), 
we see that Theorem 1 is true for r—1. 

The proof of Theorem 1 will now be carried out by induction on the 
dimension r, and the arguments will be of an elementary nature. 


We begin by two lemmas and use throughout the terminology of [1], 


* Received November 30, 1953. 
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except that a bunch of varieties will be called an algebraic set. We shall say 
that an algebraic set is defined over & if it is defined by algebraic equations 
fa = 0 with coefficients in &. If the algebraic set is in projective space, then 
the f, can be chosen to be forms, and the algebraic set is identified with the 
rays of zeros of these forms. For example, a hyperplane H in P” is defined 
by an equation }wX;—0. If wiek, then H is defined over k, or as we 
shall say more briefly, H is in k. 

Let Z be a positive cycle in P", of degree d > 0, and dimension r. We 
can then write Z = >} a,V; as a formal sum of distinct varieties Vj, with 
integer coefficients a; >0. We then have d—degZ = a;- deg Vi, and 
dim V;=—~r for each 7. By a point of Z we shall mean a point of any of the 
varieties V;, The set of points of Z is an algebraic set denoted by | Z|. If 
Z is rational over k, then | Z| is defined over k. We let Nz be the number 
of points of Z in k. 


Lemma 1. There exists a constant A,(n,d,r) depending only on n, d,r 
such that for any positive cycle Z in P", of degree d, dimension r, and rational 
over k, we have A,q’. 


Proof. By induction on the dimension r. Suppose first r—0. Then 
| Z | consists of at most d points, and the lemma is trivial. Assume now r= 1. 
If we express Z as a sum of prime rational cycles over k, then there will be 
at most d such cycles in the sum. Hence it suffices to prove the result for a 
prime rational cycle. Let (Xo,- --,Xn) be the variables of P". For some 
pair of indices, say (0,1), Z (which we assume to be prime rational over k) 
intersects properly the hyperplanes H¢ defined by the equations X, — éX, = 0 
where € ranges over k, and the hyperplane H defined by X¥,—0. Every 
point of Z in & is contained in one of the cycles Z-H¢ or Z-H, each of 
which has degree d by Bezout’s Theorem ([{1], App. I) and is of dimen- 
sion r—1. According to the induction hypothesis, there exists a constant 
B(n, d,r—1) such that Nz.n~ and B(n,d,r—1)q**. As we have 
exactly ¢-++ 1 cycles, we see that there are at most B(g + 1)q"* points in Z, 
and this is certainly = 2Bg". The constant 2B is what we are looking for. 

Our second lemma will be concerned with the following situation. Let 
V be a variety in P”, of dimension r = 2, and not contained in a hyperplane. 
Let P’ be the projective space dual to P”, and let (w) = (wo,* + +, Wn) be 
the homogeneous coordinates of a point in P’. Let H, be the hyperplane 
defined by }wi:X;—0. Then the set R of points (w) of P” such that the 
cycle V-H, is not a variety is an algebraic set, defined over every field of 
definition of V, and RP’. (Cf. [4], §1, 6). The hyperplanes H such 
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that V- H is not a variety may therefore be viewed as forming an algebraic 
set R in P’. Lemma 2 gives an estimate for the number of such hyperplanes 
defined over a finite field k. 


Lemma 2. There exists a constant A.(n,d,r) depending only on n, d,r 
having the following property. If V=Vna,r is any variety defined over k, 
not contained in a hyperplane, and if Np is the number of hyperplanes H 
in k such that V-H is not a variety, then Np S Az2g"". 


Proof. Let Z be a positive cycle of dimension r and degree d in P*. 
Let F(U,- - +, U)) be the associated form of Z. It is of degree d in each 


set of variables U,- - -, U, and its coefficients are called the Chow coordi- 
nates of Z. These are viewed as the homogeneous coordinates of a point in 
a projective space P”. The coordinates (c) =(¢o,* of a cycle which 


is not a variety form an algebraic set C in P™” (cf. [4], §1, 6), and C is 
defined over the prime field. We let ¢,—0 be a finite system of equations 
with coefficients in the prime field, defining C. The forms ¢, depend only 
on n, r, and d. 

Let now V = V»,a,, be a variety defined over &, not contained in a hyper- 
plane. Let (w) be a point of P’, and let c(w) = (¢o(w),- - +, cu(w)) be the 
Chow coordinates of the cycle V-Hy. If F(U,---,U) is the associated 
form of V, then it can be verified that the associated form of V- Hy is 
F(w,U,--+-,U-%). Hence each c;(w) is a form of degree d in the 
quantities (w), with coefficients in k. 

Let # be the algebraic set of points (w) in P’ such that V- H, is not a 
variety. This algebraic set R is defined over /, and we have (w) «fF if and 
only if c(w) «C, i.e. if and only if da(c(w)) = da(Co(w),- +, =0 
for all a If (W) =(W.,:--,W,) are the variables of P’, at least one of 
the polynomials ¢,(c(W)) does not vanish identically. (If they all did, 
R would be the entire space P’, which is not true.) The degree of this 
¢a(c(W)) depends only on the degree of ¢q and on the degree d of each 
c(W). Since the ¢, depend only on n, d, and r we have proved that R is 
contained in a hypersurface ¢a(c(W)) —0 with coefficients in k, whose 
degree is a constant e depending only on n, d, r. This hypersurface defines 
a positive cycle Z in P”, of degree e and dimension n —1, rational over k. 
By Lemma 1, we have Np SNzS A,(n,e,n—1)q"* where A, is the 
constant of Lemma 1. Putting A.(n,d,r) =A,(n,e,n—1), we see that 
Lemma 2 is proved. 


Remark. All the constants depending on n, d, r which we are finding 
in this paper can easily be estimated, except for e in the proof of the preceding 
lemma, 
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We are now in a position to prove the main theorem. 

Let V = Vnar be defined over k. We may assume that V is not con- 
tained in a hyperplane. Indeed, if V is contained in a hyperplane, or in a 
linear variety, then it is easily verified that the smallest linear variety L, 
containing V is also defined over k. (The linear forms defining Ly are 
precisely the linear forms contained in the prime homogeneous ideal in k[X] 
defining V.) J, can then be viewed as a projective space of dimension no S n, 
and it would suffice to prove our theorem for V as a subvariety of Lp. 

If V is not contained in a hyperplane, then the cycle V-H is defined 
for every hyperplane H, and Lemma 2 will be applicable. 

We consider the pairs ((x), H) consisting of a point (x) of V in & and 
a hyperplane H in P’ defined by an equation } wjX;—0 with wiek and 
passing through (2), i.e. such that H(z) = Swyz,;—0. We shall count 
these pairs in two ways. 

We shall denote by «n,, the number of points in & in the projective space 
P", This is given by kai = 9". This is 
also the number of hyperplanes H of P’ in k. Similarly, the number of 
hyperplanes in & passing through a given point (x) in k& is xy. From these 
remarks, it follows that the number of pairs ((z),H) is given by the 
following two equal numbers: 


Nkn Ny. H 
H 


where N is the number of points of V in k, Ny. is the number of points of 
the cycle V-H in k, and the sum is taken over all hyperplanes H in fk. 
Solving for NV, and using the same notation as in Lemma 2, we see that NV 
is equal to 

1 1 

Kn HéR Kn HeR 
where £ is the algebraic set of hyperplanes H such that V - H is not a variety. 

If Nz is the number of terms in the second sum, i.e. the number of 
hyperplanes H in R, the number of terms in the first sum is kp, — Ne. 
We shall now estimate the two sums, and begin with the first one. 

For H¢R, V-H is a variety of degree d, dimension r—1, defined 
over k. It follows from the induction hypothesis that for H¢R, we have 
| | S + A,q"* where A; = A(n, d,r—1) is the constant 
determined inductively, depending only on n, d, and r—i1. Summing over 
H ¢R, and dividing by x, we get " 


1 n+1 — WN > 


Kn HeR 
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where Q = kn" (kn — We note that kn +1, and S kp. 
Also we know by Lemma 2 that Np A.q™*S A2kn. This shows that 
|S (1+ and that — Ne) S kn SQ+1. From 
this we conclude immediately that 


1 


Kn HéR 
with a suitable A,. 


We now turn to our second sum, } Ny.yg, and we shall prove that it 
HeR 


can be absorbed by the error term involving q’* only. 

If He R, then V-H isa cycle of degree d, dimension 7 — 1, and rational 
over k. By Lemma 1, it follows that Ny.y S Aig’ for a suitable constant 
A,. By Lemma 2, we know that Np S A2q"*. Hence 

1 n-1 


Kn HeR Ky 


A, S Asqr- 


because qo <= kn, as we have already remarked. 

Combining (1), (2), and (3) we see that there exists a constant A. 
(the desired constant A(n,d,7r)) such that | NV | S8q"4+ This 
concludes the proof of the theorem. 


2. Corollaries and applications. We list some of the immediate corol- 
laries to Theorem 1. 
In the first place, our theorem is of the nature of an asymptotic result. 
If k, is the extension of degree v over k, and N™ is the number of points of 
V in k, then 
NO = + for 


In particular, if v—>oo then N®- eo also. We shall now see that this 
asymptotic behavior can be stated more generally for abstract varieties. 

Let V = Vna,r be a variety defined over /, and let F' be a frontier on V 
(i.e. an algebraic set properly contained in V) also defined over k. Let 
N> » = Ny — Ns, be the number of points of V—F which are in k,. 
Then from Lemma 1 we see immediately that | V™y_p— Ny | S Bar 
where B is a constant depending on V and on F. 

If V and V’ are two varieties of dimension r in P”, defined over hk, and 
if T is a birational correspondence between them, also defined over k, then 
there exist frontiers F and F’ defined over & such that 7 is everywhere 
biregular in V-—F and V’—F’. T gives a 1—1 correspondence between 
the points of V — F and those of V’— F’, so = Nyp-_p. 
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Let now V = (Vq, Fa, T'ga) be an abstract variety, represented by varieties 
V, in projective space, with frontiers Fa, and birational correspondences Tg,, 
all of which are defined over k. If P is a point of V, then P has repre- 
sentatives P, in some of the Vg. If one of these P, is rational over k, then 
so are all the other representatives, and hence in this case we may say that 
P is rational over k. 

If K is a function field of dimension r over k, then we shall say that an 
abstract variety V is a model of K if V is defined over k, and if K —k(P) 
where P is a generic point of V over k. The following two corollaries of 
Theorem 1 are immediate consequences of the preceding discussion. 


CoroLuary 1. Let K be a function field of dimension r over a finite 
constant field k. If V is any abstract model of K, and N® the number of 
points of V in k,, then N® —q’", mod O(q’"-»), is a birational invariant 
(i. e. does not depend on the choice of the model V of K). 


Corotuary 2. Let K/k be as in Corollary 1. There exists a constant y 
such that for any abstract model V of K, we have 


| Nm — | + 


where B is a constant depending on V. If K has a projective model of 
degree d, then we can take y= (d—1)(d—?2). 


The smallest constant y that can be selected in Corollary 2 is obviously 
a birational invariant, and we shall return to discuss it below. For the 
moment, we note that an abstract model of the function field K can be 
projective or affine, and can also be chosen without singularities, because the 
singular points on a variety defined over /: are a frontier F, properly con- 
tained in V and defined over &. Hence our corollaries show that as v increases 
indefinitely, the number of simple points on V in k, also increases indefinitely. 


Corotuary 3. Let V be an abstract variety defined over k. Let m bea 
given integer. There exists a zero dimensional positive cycle on V, rational 
over k, and of degree prime to m. Hence there exist cycles on V which are 
zero dimensional, rational over k, and of degree 1. 


Proof. From the above remark and the corollaries, there exist non 
singular points of V in the field &, when y is sufficiently large and prime to m. 

The preceding result generalizes the well known theorem that a function 
field in one variable over a finite constant field always has a divisor of degree 
1. In this vein, we can state the following invariant result: 


VARIETIES IN FINITE FIELDS. 825 


Corotuary 4. Let K be a function field over a finite constant field k. 
Let t1,° * *,%n be a finite set of non zero elements of K. For all v sufficiently 
large, there exists a place p of K which is k,-valued, and such that $(%) 0, 
#o for any t. 


Proof. After enlarging our set (x) by a finite number of elements if 
necessary, we may assume that k(x) = K, and that for each 1, z;* appears 
in the set (x). Under these assumptions it suffices to prove that there exists 
a place ¢ of K which is &,-valued and such that $(2;) is finite for all x. 
We view (x) as the generic point of an affine variety V defined over k. By 
Corollary 1, for all sufficiently large v, there exists a point (2’) of V in k, 
which is simple on V. If 0 is the specialization ring of (2’) in K and p the 
maximal prime ideal, then o/p is isomorphic to the subfield k’ = k(z’) of k,. 
(The isomorphism is induced by the map f(x) >f(2’), fek[X]). Since 
k is perfect, the completion of o is isomorphic to a power series ring 
W{t} = k’{t,,- - -,¢-} and K can be identified with a subfield of the quotient 
field 2 of k’{t}. It is clear that there exists a k’-valued place @ of O mapping 
each ¢t; on 0. (For instance, view © as a subfield of the repeated power series 
field T = k’{t,}{t.} --- {t,}. By mapping successively each t; (7 =7,:++,1) 
on 0, we get a k’-valued place ¢ of T.) Since the quantities 2, are in o for 
all 1, the place ¢ is finite on the 2;. The restriction of ¢ to K gives the 
desired place. 

We now turn to the applications of Corollaries 1 and 2 to the zeta 
function. Let V be an abstract variety of dimension r, defined over k. We 
associate with V the analytic function Z(U) defined by 


dlog Z(U) /dU = 3 NOU, 


If V is complete and without multiple points, then Z(U) is the zeta function 
of V [5] and Weil’s conjectures state among other things that Z(U) is a 
rational function, satisfying a functional equation of the usual type. If V 
has singular points, or is not complete, then the function Z(U) is probably 
still a rational function, but it need not satisfy a functional equation. 

We shall not at this point go into a detailed analysis of the relation 
between the functions Z(U) associated with arbitrary varieties, and those 
associated with complete varieties without multiple points, because the results 
which we shall now prove as a consequence of Corollary 2 will be applicable 
to any variety. 


Corottary 5. Let V be an abstract variety of dimension r, defined 
over k. Then the associated function Z(U) has no pole or zero in the circle 
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|U | <q", and has exactly one pole of order 1 in the circle |U | < qr, 
namely at U=q". 


Proof. According to Corollary 2, we can write N® = q’" + A,qr-d, 
where A, is a constant, bounded in absolute value by a fixed constant A. 
This gives 


dlog Z(U)/dU = +S 
y=0 y=1 


The first of these series is the geometric series and defines the function 


1/(1— q'U). Since —q’/(1— =dlog (1— gU) /dU we see that 
dlog [Z2(U) (1— q'U)]/dU = A,(q3U)”. Hence Z(U) (1— has 


no pole or zero in the circle | U | < q--» because the latter series converges 
in that circle. Furthermore q-" is a pole, and is the only pole inside the 
circle, as was to be shown. 


Corottary 6. Let V and V’ be two varieties which are birationally 
equivalent, and let Z(U) and Z’(U) be the associated functions. Then the 
function Z(U)/Z’(U) has no zero or pole in the circle|U| <q». Hence 
the zeros and poles of Z(U) in this circle are birational invariants. 


Proof. It is clear from Corollaries 1 and 2 that the series for 
dlog Z(U) /dU —d log Z’(U)/dU = d log (Z(U) /Z’(U)) /dU 


converges in the circle | U | < qg-*-», and hence that Z(U)/Z’(U) has no 
zero or pole in the circle. . 

Concerning the behavior of Z(U) for | U | = q--» we can only make 
the following conjectural statements, which complement the conjectures of 
Weil [5]. 

Let P be the Picard variety of V. P is an abelian variety, whose dimen- 
sion g is the irregularity of V. Let. be the endomorphism induced on P by 
the automorphism (2-27) of the universal domain, and let 


F(U) (1 — 


be the characteristic polynomial of « as defined in [3], §67. Then the 
series for 


dlog Z(U) /dU + dlog (1— q’'U)/dU — dlog F(U)/dU 


converges in the circle | U | < q-*-, and the function z(U)\(1— q’U)/F(U) 
has no zero and no pole inside this circle, and has at least one pole on the 
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circle |U | = q--», provided of course that V is complete and without 
multiple points. 

Furthermore, if K/k is a function field as in Corollary 2, and if y is 
the smallest constant for which Corollary 2 holds, then y= 2g. (This gives 
the explicit determination of y as a birational invariant.) 

If V is a curve, then P = J is the Jacobian variety, and g is the genus 
of the curve. For this case, all the preceding statements are well known and 
are contained in [2], except for the last, i.e. that y = 2g is actually the best 
possible constant for the inequality | — q’ | Syq’/? + B of Corollary 2. 
As to this, it obviously suffices to prove it in the case that V is complete and 
without multiple points, and then it amounts to proving that y = 2g is the 
best possible constant for the inequality | 1+ q”?—N®™|=yq’*. Actually, 
this last statement is implicit in [2], as the following argument shows. 


We have ([2], § 22, p. 71) + g?—N | = Sav and 
i=1 


3, so that we can write (27i6;) where 6; is a real 


number between 0 and 1. Hence we ean write 
. 
= gS exp(2riv6;). 
j=1 


29 
For infinitely many v, >} exp(2ziv6;) will be as close to 2g as we please, and 
i=1 


hence 2g is indeed the best possible constant. 
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REMARKS ON PERIODIC SEQUENCES AND THE RIEMANN 
ZETA-FUNCTION.* 


By C. R. Putnam. 


1. This paper is essentially a continuation of [1]. The following 
theorem concerning the imaginary part of the Riemann Zeta-Function {(z), 
z=o-+ it, in the half-plane o > 0, will be proved: 


(*) Let d0 denote a fixed real number and, for a fixed o > 0, let 
ti, to, ts, - - denote any sequence of t-values for which 


(1) &(£(o4+ ttn)) — A(E(o + ind)) = 0(1), as 
Then 

(2) limsup| &(E(o + ttn))| > 0, as noo. 

Furthermore, by an application of the van der Corput method for esti- 
mating certain summations, it will be shown that 


(**) If tu —=nd+8n, then relation (1) (hence, by (*), also (2)) 
holds, for <1, tf 

(3) 8, log-? n), as n> 0, 
where M = 2", o=1—m/(M—2), m=3; and, for o=1, if 

(4) 8, = o(log log n/log? n), as no. 

The gist of (*) is that the imaginary part of {(o-+ it) on any fixed 
line ¢ = const. > 0 fails to tend to the limit zero as t—=t,—>00 (or —©), 
whenever the sequence 1,, to, ts,- - - approximates “closely enough ” a periodic 
sequence d,2d,3d,---. As a consequence of (*), one obtains the result 
that the sequence of the consecutive zeros of £(4-+ it) does not contain any 
periodic subsequence d, 2d, 3d,- - - (ef. [1]), nor even, according to (3) for 


M = 8, contain any approximately periodic sequence of the type d+ 4, 
2d + 82,3d + 83,- --, where log? n), as n—00. 


Theorem (**) is a supplement to (*) when a is restricted to } =o S1. 


* Received December 3, 1953; revised January 28, 1954. 
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Moreover, estimates other than those of (3) and (4) could be obtained by 
other means, for instance, by the Weyl-Hardy-Littlewood method; cf. [2], 
pp. 84 ff. 


2. Proof of (*). The proof of (*) will follow as a consequence of the 
results of [1]. Thus, in order to prove (*), it is sufficient to show that if 
(2) does not hold for some d > 0, then 


(5) J “9 (2) cos (dnx)dz = 0(n"), asn—>o 
0 
(which leads to a contradiction), where g(x) = (e*—n)e for lognSz 
<log (n +1) and n—1,2,---, and 
(6) £(z) =2(z—1)7?*—z Jf o> 0. 
0 


(It should be noted that there is no loss of generality in supposing d > 0. 
Moreover, although [1] dealt with the case o = 4, it is clear that any o > 0 
could be allowed.) 


According to (6), for z =o + it, 


J (9 (2) (tx) dz = (0 —1)/((o 1)? + #) — 
0 
A simple calculation shows that 
(2-*E(z)) = (oR (£) + t2(£))/ (0? + #). 
Since £(o + it) = O(t4) as t 00, for any fixed ¢ > 0 (cf., e. g., [2], p. 82), 
it is clear that 
Hence, if (2) fails to hold, so that 
(8) &(L(o + itn) ) —0(1), as 
then (1) and (7%) imply (5). This completes the proof of (*). 
3. Proof of (**). It is supposed that =o < 1, and it will be shown 
that (3) implies (1). Since £(o-+ it) can be expressed as 
f(o + it) = 4+ 0(1), as 
nSt 


(cf., e. g., [2], p. 6%), an application of the mean value theorem of differ- 
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ential calculus shows that, for | 8 | < const. and ¢{—>00, there exist values 7, 
lying between ¢ and ¢ + 8, such that 


(9) + i[t + 8])) —A(L(o + tt)) 


=— logneos (T log n) + 0(1). 


The summation occurring on the right side of (9) is the imaginary part of 

(10) log n, 

nSt 
and so it will be sufficient to estimate the latter. Since the terms n~ logn 
constitute a monotone decreasing sequence (for n sufficiently large), the 
summation (10) is amenable to the van der Corput estimation procedure, 
just as is the summation >} n-*##), It is easily seen that this process yields 
nst 

for (10), when ¢ is large, an estimate which is O(log #) times the estimate 
for £(o + it) ; and hence, in the notations following (3), the summation (10) 
is equal to O(t?/™-*) log? t). In this connection, see [2], pp. 94 ff. Relations 
(3) and (9) now imply (1). 

In case o = 1, methods similar to those described above (cf. [2], p. 97) 
show that (10) is equal to O(log’ t/log log ¢#) and (1) readily follows from 
(4) and (9). This completes the proof of (**). 


PURDUE UNIVERSITY. 
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ON THE ESSENTIAL SPECTRA OF ORDINARY DIFFERENTIAL 
OPERATORS.* 


By Puitie HARTMAN. 


In the differential equation 


(1) + (A+ Q)y=9, 


let A be a real parameter and g = q(t) a real-valued continuous function on 
0<t<o. When (1) is of the limit-point type (in the sense of Weyl [10]), 
let S’ denote its essential spectrum, that is, the set of cluster points A of the 
spectrum of the self-adjoint operator associated with (1) and a homogeneous 
boundary condition at {= 0. The A-set S’ is independent of the particular 
houndary condition determining it ([10], pp. 251-252). This note is con- 
cerned with the essential spectrum of (1) when q satisfies either the condition 
that 


(2) q(t) tends monotonously to «© as t->2, 
or the condition that, for some constant C, 


(3) q(t) <C for 0St <a. 


1. If (2) holds, then (1) can be either of the limit-point or of the 
limit-circle type. Under the assumption (2), a sufficient ([6], p. 297), but 
not necessary ([6], pp. 306-308), condition that (1) be of the limit-point 
type is 


(4) 


In [7], p. 646, there was raised the question whether or not (2) implies that 
(5) S’ is the entire A-axis 


Whenever (1) is of the limit-point type. It was shown ([4], pp. 123-125) 
that the answer is in the negative and, in fact, that S’ can be empty; but 
it was pointed out that there remained the question as to whether or not (2) 
and (4) imply (5), as is the case when q is of “ regular growth ” (for example, 


* Received November 17, 1953. 
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when q is a logarithmico-exponential function ; cf. [9], p. 106). The following 
theorem can be considered a partial answer to this question. 


(*) Let q(t), where OSt<o, be a positive, continuous (non- 
decreasing) function satisfying (2). If, in addition, q(t) satisfies either the 
condition 


(6) q(t) =o0(t?) as 
or the condition 
(7) f dt/q?# =o for some > 1, 


then (1) ts of the limit-point type and (5) holds. 


This theorem is known for the case when » > 1 in (7) is replaced by 
p= 2; [6], p. 303. It will be clear from the proof of (*) that (7) can be 
weakened to any of the following conditions, with » > 1, 


On the other hand, if “some » >1” is replaced by “every »< 1” in (7) 
or (7 bis), then the assertion (5) need not hold. This is seen by a suitable 
modification of the example in [4], pp. 123-125 (thus, if vz =n? is replaced 
by vn =n log? n in that example, then the resulting differential equation (1) 
remains of the limit-point type and the corresponding 8’ is empty; but 
q(t) = O(# log* t), so that the integral in (7) is divergent for every » < 1). 

This theorem does not settle the question whether or not (2) and (4) 
imply (5). It also leaves undecided the related question whether or not (2) 
and the assumption that S’ is not empty imply (5). 

It will be clear from the proof of (*) that if the condition (6) is 
weakened to 


(6 bis) q(t) =C#? for large ¢ (C = Constant), 


then the assertion (5) of (*) is weakened to the statement that there exists 
a positive constant c (depending only on C and not on q) with the property 
that every A-interval [A,A +c] contains at least one point of 9’. 


2. In contrast to (2), the condition (3) implies that (1) is of the 
limit-point ([10], p. 238) and that S’ is bounded from below ([10], pp. 252- 
253). Under the condition (3), S’ can be empty (this is the case, for 
example, if q(t) »~—o, as to; [10], p. 252). The following theorem 
concerning S’, for the case that S’ is not empty, was proved in [8]: 
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(**) Let the continuous function q= q(t), where satisfy 
(3) and let S’ contain the point A=0. Then there exists a constant c 
(depending only on C and not on q) with the property that for all sufficiently 
large X>O0, the interval [A,A-+ cA4] contains at least one point of the 
essential spectrum 8’. 


It will be shown below that this result is the “best possible” in the 
following sense: The length cA} of the interval [A, A + cdA4] cannot be replaced 
by a length which (can depend on q but) is 0(A#), as Ao. This will be 
proved by showing that 

(***) There exist continuous functions q(t) satisfying (3) for which 


the essential spectrum S’ of (1) consists of the points X\=n°, where 
nom °° 


In this case, there exist arbitrarily large values of A for which the 
intervals [A, A + 2A4] contain no points of 8’. 

For the sake of completeness, a simple proof of (**) will also be given 
below. 


3. Proof of (*) when (6) is assumed. That (1) is of the limit-point 
type when (6), hence (4) holds, follows from [6], p. 297. Let N(T,U;A) 
denote the number of zeros on (OS) JT <tSU of a (non-trivial) solution 
y=y(t,A) of (1). The assertion (5) is equivalent to the statement that 
(8) N(T,U;A\ —N(T,U;A) as 
for every « > 0. For, in view of the theorem in [1], the assumption (8) is 
sufficient in order that the closed interval [A, A + «], and is necessary in order 
that the open interval (A,A +), contains a point of S’. 

If A+ q(t) >0 for ¢=T, then a lemma of Wintner and the author 
states that, for U > T, 


U U 
(2) [aN (7,030) — (a+ | | 
[3], p. 658. Hence for fixed 7, the difference in (8) exceeds 
U 
— f{log(A + q(U)) + log(A + « + q(U))} + const. 


. . . U 
This is minorized by (A q)4dt— log(A+¢-+ q(U)) + const. 
JT 


The assumption (6) means that if M is an arbitrary positive constant, then 
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the last expression is, in turn, minorized by Vlog U — log U + Const., if U 
is sufficiently large. This shows that (8) holds for every « > 0 and proves 
(*), when (6) is assumed. 


4, Proof of (*) when (7) is assumed. Without loss of generality, it 
can be supposed that g(0) =1(>0). It will be shown that the assumption 
(7) implies that if U = U(T) is the largest t-value satisfying g(t) = 2q(T), 
so that q(U) =2q(T), then 


U 
(10) lim sup f dt/qi(T) =o. 
T> © 


U 
For if (10) does not hold, then J dt/q3(t) = const. Hence, for the sequence 
T 


of t-values (0O—=) where so that 
q(Tn) = 2", one has 


dt < > f dt/qi where u=Ty,, v = 
0 n=0 u 


Since this contradicts (7), the relation (10) is proved. 
If U is chosen so that g(U) = 2q(T), then the monotony of qg shows 
that (9) becomes 


| (T, U2) —{ (A + q)4dt | S const. 
T 


U 
for large T. Hence the difference in (8) exceeds te f dt/qs — Const. for 
T 


large T and U=U(T). In view of (10), the limit relation (8) holds if 
U =U(T) and T tends to o through some sequence of ¢-values. Hence, 
the Sturm separation theorem shows that (8) holds (if T is fixed and U tends 
continuously to «). This proves (*) under the assumption (7). 


5. Proof of (**). The assertion of (**) will be proved if it is verified 
that there exists a constant c (depending only on (’) such that (8) holds 
for « = cA}, whenever X is sufficiently large. 

In view of the characterization [5] of the least point of 8’, the assump- 
tion that A = 0 is in S’ means that (1) is oscillatory if \ > 0. Let const. > 0 
and let g + const. and C + const. be denoted by g and C, respectively. Then 
(3) holds and (1) is oscillatory for 4 > —const.; in particular, for \=0. 
(Incidentally, this implies C > 0.) — 

Let y = yo(t) #0 be a solution of (1) when AO and let th, ti,° °° 
be the sequence of successive zeros of y(t). On an interval t, << t < tea, 
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the ratio —y(t,A)/yo(t) satisfies the differential equation 
(yo2a”)’ + = 0 and has the same zeros as y(t,A). If a change of 
independent variables ¢ — s, defined by ds = dt/y,”, is made, then the differ- 
ential equation for x becomes d*x/ds* + Ayotx =0. Accordingly, the lemma 
of [3], mentioned in connection with (9), implies that if A>0 and if 
ustSv is a subinterval of t, << t < th, then 


3(v) ev 
| (u, v3a)— f My'ds|Sa+4 f | d log (Ayo*) |. 
s(u) u 


This can be written as | rV(u,v;A) — (v—u)| Sa +f | yo /Yo | dt, 
since ds = dt/yo”. Consequently, 
n(N(u,v3;A + €) —N(u,v3;A)) 2 ((A+ AI) 


| yo’ /Yo | at. 


It will be shown that the numbers vu = ux, v = vz can be chosen so that 
ustSv is a subinterval of t, << ¢ < ty,, and satisfies 


(11) and | ats 


(the point in these inequalities is that the minorant $C-4 and the majorant 7 
are independent of /). 

Let (11) be granted for a moment. Then the last two formula lines 
give 

a(N(u,v3A + —N(u,v3dA)) 2 4C4( (A+ A) — 2a — 2a; 
so that 
(12) N(u,0;A+¢) —N(u, 03a) 22, 


provided that (A + «)3—AI2Z12xrC4. Clearly, there exists a number 
c=c(C) >0 such that this proviso holds if is sufficiently large and 
«= cat; in which case (12) holds. Since there is at least one zero of 
y(t,A-+ €) between every pair of zeros of y(t, A), it follows from (12) that. 
N(0,U;A —N(0,U;A) =k if U = vx; and so (8) holds (with T = 0). 
Consequently (**) will be proved if the estimates (11) are established for 
suitably chosen u, v satisfying th <u<v < thar. 
In the proof of (11), it can be supposed that 


(13) yo(t) > 0 and max yo(t) =1 for th << t < tha; 
in fact, (11) and the differential equation (yo22’)’ + Ayo2x = 0 for x show 
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that, on t << <ty.1, the function y(t) can be replaced by const. y(t), 
where const. can depend on k. By virtue of the lemma of [2], p. 650, it 
follows from (3), (13), and = Yo(tis1) = 0 that 


(14) | yo’ (t)| S 2C4 for Ct < tea. 
Choose u =u, and v =v; in such a way that 

(15) $S y(t) S1 foruX<tSv 
and 

(16) Yo(u) = Yo(v) and y.(w) 


for some w satisfying u< w < v. 


From (14) and (16), =| yo(w) —yo(u) |< "| yo’(t)| dt, hence 


$52C3}(w—u). Similarly, 4=2C4(v—w), and so, by addition, 
1S 2C3(v—u). This is the first inequality in (11). Let [u,v] be replaced 
by a subinterval (to be denoted again by [u,v]) such that v —u= 304, 
so that (15) holds (but (16) need not). By (14) and (15), 


| yo /Yo | dt S —u) = 2. 


Hence, the second inequality in (11) holds. This proves (**). 


6. Proof of (***). There will be constructed a step-function q(t) 
which has discontinuities at t= 7,2z7,--~-, which satisfies (3) and for 
which the essential spectrum of (1) is the sequence of points A = 1, 2?, 37,---. 
The passage from such a q to a continuous g with the same properties causes 
no difficulty; see [4], Section 11, for the treatment of a similar problem. 

By a solution of (1), when q is a step-function, is meant a function 
y=y/(t) possessing, for 0=¢t<o, a continuous first derivative and, at 
every continuity point of qg, a continuous second derivative satisfying (1). 

For k =0,1,-- -, put 


q(t) =0 for < (2k 


(17) 

q(t) =—k for (2k+1)rSt < (2k+ 2)z. 
Then 
(18) q(t) S0; 


so that (1) is non-oscillatory for A 0 (and, correspondingly, no negative 
value of A is in 8’). 


), 
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Let A > 0 be fixed and K = K(A) so large that K—A>O0. Then, for 
{= 2K-z, a solution y= y(t,A) is of the form 
= cos — + By sin —- 
(19) y= ay exp{(k — — (2k + 1)x)} 
+ by exp{— ((k — A(t — (2k + 1)z))}, 


on St < (2k + (2k + St < (2k + respectively. The solu- 
tion is determined for / = (2K + 1), for example, by assigning the constants 


ax, be and determining Bru, Aku, recursively so as to 
saitsfy y(t —0) = y(t +0) and y’(t—0) =y'(t+ 0) at f= (2K 4-1)z, 
(2K + 


At (2k + 1)z, there result the conditions 
(20,) ax cos-+ B, sin, a, — by = — sin + COs), 
where sin = sin A4x and cos = cos Ad; and, at t = (2k + 2)z, the conditions 
= Ay exp* + = (k — exp” — b; exp), 
where exp* = exp((/—A)4r) and exp*=—1/exp". The equations (20;) 
give, as k 
(22) dy = $%, (cos + 0(1)) + 38x(sin + 0(1)). 
On substituting the formulae (21,) for a,, 8; into (22), one has 
(23) = + 0(1)) — 1 — exp**) + 0(| |), 
if 
(24) sin = sin Abr 4 0. 
From (21;,) and the second equation in (20;), 

— = (1 + 0(1) )cos exp*? + 0(| by. |), 


or, in view of (23), 


(25) by = ax (1+ 0(1)) + 0(| |). 
In (23) and (25), the 0(1)-factors depend only on & and 4, and not on 
the numbers ax, bx, dx, °°. Thus, if K = K(X) is sufficiently large and 


the numbers ax and bx, where 0 < bx S 2ax, determine the solution y = y(t), 
then it follows from (23) and (25), since (k —.)3exp*—>o as k +0, that 
|o.| <2 | a; | and that a, fork—K,K-+1,:--. Consequently, (23) 
and (25) give 


(26) ay by and ay ~ — 1 — A) 4 sin exp*. 
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For large k, the two numbers a, and b, are of the same sign; so that 
by the second formula in (19), y= y(t) does not vanish on (2k 
< (2k +2). Furthermore, the first formula in (19) shows that the number 
of zeros on (2k + <t < (2k + 3)ais either or + 1, where 
is the greatest integer not exceeding A4. According to (26), y((2k + 1)z) 
=a, + by and y((2k + = + are of the same or of opposite 
sign according as sin A’x > 0 or sinA’r < 0. Thus the number of zeros on 
(2k + StS (2k +4 3)z, that is, on (2k +2)r<t < (2k +3)z, is of 
the same parity as [A#] and is, therefore, equal to [A’]. Hence N(0,U;,) 
= U[A4]/22 + 0(1), as V0. 

Thus, if neither A+ nor A is the square of an integer, then (8) 
holds (with T—0) if and only if there is an integer n >0 satisfying 
A<n?<A+e. In view of the characterization [1] of the points of 8’, 
the assertion (***) follows. 
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A MAXIMUM PROBLEM IN HARMONIC ANALYSIS.* 


By Epwin Hewitt and IstporeE HirscHMAN, JR.* 


1. Introduction. 1.1 Mise en scéne. The classical theories of Fourier 
series [18] and integrals [15] have undergone a far-reaching and conceivably 
definitive generalization in the theory of harmonic analysis on locally compact. 
Abelian groups. The first step in this generalization was the enunciation in 
1934 of Pontryagin’s duality theorem for compact Abelian groups ([10] and 
[17], pp. 99-109) ; thereafter followed rapidly the general duality theorem, 
Weil’s generalization of the Plancherel theorem and of the Young-Hausdorff 
theorem, and so on ([17], pp. 111-113). A number of the more refined results. 
of harmonic analysis on the circle and the line have nevertheless escaped 
attention until quite recently (see, however, [4], [5], [6], [7], and [12]). 

The present paper is devoted to the extension to general locally compact. 
Abelian groups of a classical theorem of Hardy and Littlewood dealing with 
Fourier series. Let p be a number such that 1 < p < 2, and let f be a function 
in &, on the circle group (equivalently, let f be a measurable function on the 


us 
line of period 27 such that f | f(x) |?de <<. Writing c, for the Fourier 


coefficient f f(x)e-*dz, n =0, +1, + 2,- one has the Hausdorff 
wv 


inequality 
= 

n=- 00 


where p-? + p’1=1. (See for example [18], p. 190.) In 1926, Hardy and 
Littlewood [2] published a memcir in which the possibilities for equality in 
1.1.1 are determined. They show (loc. cit. Theorem 1, p. 167) that equality 
occurs in 1.1.1 if and only if f(x) is equal to ae‘ for some complex number 
a and some integer n. Thus, the functions in 2,(— 7,7) which obviously 
produce equality in 1.1.1 are the only functions which can do so. 

The analogue of 1.1.1 for functions in &,(—, ©) is naturally a bit 
more subtle, since the integral 
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1.1.2 F(x) etude 


is undefined for functions not in 2,(—o, 0). The analogue of Plancherel’s 
theorem, nevertheless, works perfectly for all fe X,(—o, 0)(1<p< 2). 
For every such f, there exists a function f* in 2,.(—o, o) such that 


B 
1.1.3 lim (y) — \"dy = 0 
A>-@ A 


Bow 


and such that 


This theorem is due to E. C. Titchmarsh ([14]; see also [15], pp. 96-105). 
A proof much simpler than those offered by Titchmarsh can be obtained from 
M. Riesz’s convexity theorem (see [13], § 26, p. 484, or [18], p. 316, Theorem 
12. 41). 

Diligent search and inquiry by the writers have disclosed no treatment 
of the case of equality in 1.1.4. It is to be conjectured, of course, that 
equality can occur in 1.1.4 only if f 0, and we shall, in fact, show that 
this is so. We study here the analogue of the theorem of Hardy and Littlewood 
for functions which are defined on a locally compact Abelian group G and 
are in &,(G@) for some p, 1< p< 2. We are able to identify all functions 
for which equality occurs in the analogue of 1.1.1 and 1.1.4. It is a pleasure 
to record our indebtedness to Professor H. S. Zuckerman for a number of 
valuable conversations on the subject matter of this paper; and in particular 
for collaboration in the construction presented in 6. 5. 


1.2 Notation. We use the following symbols throughout the present 
paper. £& denotes the real number system, K the complex number system. 
T the multiplicative group of all complex numbers of absolute value 1, and 
N the set of all integers. For any set X and A C X, ¢u is defined as the 
function with domain X such that ¢4(z) = 1 or 0 as ve A or A. 

For a topological space X, we denote by ©,,(X) the set of all continuous 
complex functions f on X such that for every « > 0, there exists a compact 
subset F of X such that | f(x)| << for all e¢F. The set of all bounded 
continuous complex functions on XY will be denoted by €(X). 

For a measure-space X with a measure-function » defined on some o- 
algebra of subsets of XY, we have the usual complex function-spaces &,(X), 
here considered only forr=1. For feX,(X), we write, as usual, 


840 
t] 
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x 
For a topological space ¥ and fe €(X), we write sup | f(x)| as || f lle 


We reserve the symbol p for numbers such that 1< p< 2. For every 
number r 1, the number 7” is defined as r/(r—1). 


1.3 Hausdorff’s inequality for groups. The symbol G will be used 
throughout the present paper to denote an arbitrary locally compact Abelian 
group. The character group of G@ will be denoted by the symbol G*, a 
character of G being taken here as a continuous homomorphism of G into 
or onto the group 7. We write (x,y) for the value of the character y e G* 
at xe (i, or, dually, the value of the character ze G at ye G*. The annihilator 
of a subset X of G is the set N(X) of all ye G* such that (2, y) =1 for all 
ceX. We select any Haar measure A on G, and choose Haar measure p 
on G@* so that equality is obtained in the Plancherel theorem. For all group- 
theoretic terms not explained here, the reader is referred to the monographs 
of A. Weil [17] or of L. H. Loomis [9]. For f e¢ 2:(G), we write the Fourier 
transform 


1.3.1 (x, f(x) da(a) 


as f*(y). For functions h in 2,(G@*), the Fourier transform h*(z) of h is a 
function on G given by the formula 


1.3.2 duly). 


Let p be a number such that 1<p<2. For fe&,(G@), the Fourier 
transform f* of f is defined as follows. Let = be the family of simple 
measurable complex functions on G, that is, the measurable functions g which 
are in 2,(G@) and take on only a finite number of different values. It is 
clear that C 2,(G) N&(G). For we have || g* || g and 
| g* |e = || g ||. by the Weil-Plancherel theorem. It is clear too that g e &,(@). 
By applying M. Riesz’s convexity theorem (see for example [18], p. 192 et seq.), 
one can prove without difficulty that g* « 2,-(G*) and that || g* lp S || g |lp. 
Since § is a dense subspace of 2,(G), it is clear that the mapping g— g* 
can be extended in exactly one way over &,(G) so as to be continuous and 
linear. Thus we can ascribe to every f ¢ &p(G) a Fourier transform f* which 
is a function in &,(G*). It is clear that 


1.3.3 lo SUF 
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for all f e 2,(G@), since 1. 3.3 holds for all f in the dense subspace & of 2,((). 
We call 1. 3. 3 Hausdorff’s inequality, since the proof of 1.1.1 for general p 
is due to Hausdorff. We note in passing that the image of 2,(@) in &-(@*) 
is a proper subspace of 2,-(G*) for all infinite groups G (see [5]). 


2. Statement of results. 2.1 Definition. A function f in 2,(G@) for 
which equality holds in 1.3.3 is said to be a maximal function in &. A 
function f in 2,(G@) such that || f |], = || f* ||, is said to be maximal in &, (@). 


2.2 Definition. Let A be a compact and open subgroup of G. A func- 
tion f defined on @ such that 


2.2.1 f(r) y)oba(2), 


where ye G* and ae K, is said to be a subcharacter of the group G. 

In our search for maximal functions in 2,(@), we shall first show that 
every subcharacter and every translate of a subcharacter are maximal functions 
in 2,(G). We make the following introductory remarks. 


2.3 THeorem. Let f be a function on G which ts either a subcharacter 
or the translate of a subcharacter. Then there exist ae K, xe G, and ye G* 
such that 


2.3.1 f(r) = a(2, Yo) 
where A is a compact open subgroup of G. 
This assertion is obvious. 


2.4 TuHerorem. Let f have the form 2.3.1. Then f* has the same 
form on G*, and in fact 


2.4.1 f*(y) =A(A)a(Lo, Yo) Y) 
N(A) being a compact open subgroup of G*. 


We prove first that N(A) is a compact open subgroup of G*. It is 
trivial that N(A) is a subgroup of G*. Since A is compact and open, 
we have 


2.4.2 0<A(A) 


The Fourier transform of ¢4 is given by the equalities 


2.4.3 oa*(y) (x, = (x, y*)dr(z). 


If ye N(A), it is clear that da*(y) =A(A) > 0. If yf N(A), let aeA 
be an element for which (a,y) #1. Then we have 


( 
4 
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9,4. 4 oa*(y) (ax, ba (ax) dr(z) 


= (a,y*) Y*) ara (x) dr(x) = (a, (y). 


The first and last terms of 2.4.4 can be equal only if da*(y) =0, and 
therefore we find 


2.4.5 =2X(A) 


Now ¢a* is continuous, and hence N(A) is open and closed. Furthermore, 
o4* is in ©.(G*), and this implies that V(A) is compact. Returning to our 
original f, we can now easily compute f*. In fact: 


24.6 f(y) —a f (0,97) (2, 40) 
== (2a, (Lor, Yo) ba 


= 2 (7,90) yoy) 


By 2.4.5, the integral appearing in the preceding line is equal to 
by (ay (Yoy) =A(A) and this completes the proof. 
2.5 THrorem. Lvery subcharacter and every translate of a sub- 


character of G are elements of 2,(@) for all p, 1< p< 2, and are maximal 
functions in every 2p(G@). 


Let f be a function of the form 2.3.1. We lose no generality by taking 
@=1 in 2.3.1. Then we have 


2.5.1 Cf 1 da(a) = a(A)*”, 
and 
2.5.2 ll f le = [A(A) J. 


For every r => 1, we have 


By the Weil-Plancherel theorem and our choice of the Haar measure p» on G*, 
we have 


2.5.4 [A(A) = f le = = A(A) J, 
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upon equating 2. 5.2 and 2.5.3 with r—2. 2.5.4 implies that 
2.5.5 p(N(A)) = [A(A)]7 


Hence || f* = A(A) [A(A)]-/” = [A(A)]*? = | f 

Theorem 2.5 indicates that subcharacters of G and their translates play 
the same role for general locally compact Abelian groups that characters of 
T do for the theorem of Hardy and Littlewood. That is, they are functions 
in £2,(@) which are obviously maximal. As in the case of the theorem of 
Hardy and Littlewood, we now prove that subcharacters and their translates 
are the only maximal functions. Our results are comprised in the following 
theorem. 


2.6 THeroreM. Let G be any locally compact Abelian group and let p 
be a number such that 1 < p< 2. Then if f is a maximal function in 2(G), 
f is either a subcharacter or a translate of a subcharacter. 


2.7 Remark. Theorem 2.6 may be regarded as a direct generalization 
of the theorem of Hardy and Littlewood, since the group T obviously has no 
compact open subgroups other than itself. Since R has no compact open 
subgroups at all, Theorem 2.6 implies that no function 40 in &,(R) can 
be maximal. 


2.8 Method of proof. The fundamental inequality 1.3.3 is proved by 
using the Riesz convexity theorem. Our analysis of the case of equality, 
which occupies us here, depends upon obtaining conditions for equality in 
Riesz’s theorem, and exploiting the consequences of these conditions. In this 
process, the conditions for equality in Hélder’s inequality play an important 
role. 


3. Equality in Holder’s inequality. Throughout this Section, @ is an 
arbitrary locally compact Abelian group. The following theorem, while a 
consequence of well-known facts, does not seem to appear in the literature. 
We take 1’ oo and &,,(G) as usual the space of essentially bounded measur- 
able functions on G with || g ||. ess sup | g |. 


38.1 THroreM. Let lSr<o. Let fe&(G) and ge&(G) and 
suppose that f is different from 0 on a set of positive measure. The relation 


3.1.1 J. f(x)g(x)dr(z) = ||f lg 


holds if and only tf 
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3.1.2 g(t) =| 9 |e |r sgn f(x) ae. on G(1<r<o); 
3.1.3 g(x) = || 9 sgn f(x) a.e. on the set where f 40 (r=—1). 


(For ze K, sgn z is defined as z/|z| if zA0, and as 0 ifz—0.) For f(z) 
Ay and g(x) real and for 1 <r <0, this result is demonstrated in [8, p. 140]. 


of Only very minor changes are needed to extend the proof given there so that 
ns it applies to complex functions and to the case r= 1. 

of 

es 4, Some preliminary results. Let G be, as usual, a locally compact 
1g Abelian group. It follows immediately from Theorem 3.1 that a function 


fe2,(G@) is maximal if and only if f(z) =c(y,z)p(z) a.e. in G, where c 
is a complex constant, y is an element of G*, and p is a non-negative function 
), in 2,(G). Further information concerning this case is contained in the 
following assertion. 


4.1 TuHerorem. Let f be a maximal function in 21(G). Let E* be the 
set of points y for which | f*(y)|=||f Then where 


10 
- is a compact subgroup of G* and ye G*. 
- Let A be the set of points x for which f(z) 40. Then ye H* if and 


only if sgn [(y, x) f(x) ] is constant almost everywhere in A, as Theorem 3.1 
shows. Let so that sgn [(yo,7)f(%)] | co |—=1 ae. in A. 
We set Let y2€ Ho*; then sgn [ (2, yo) (2, 1) f (2) ] = 
|e: |==1a.e.in A, and sgn [(2, yo) (2, y2) f(r) ] = ce, | co| =1 a.e. in A. 
is A simple computation now shows that sgn [ (2, yo) y1¥2)f(@) ] = Co 
at ae.in A, Again, if y, then sgn [ (2, yo) (2, f(z) ] 
a.e. in A, Thus is a subgroup of G*. If is 
any function in 2,(G@), if a > 0, and if K* is the set of points y in G* for 
which |[k*(y)]] =a, then K* is compact, since k* e €(G*). In particular 


n 

E*, and therefore H,*, are compact. 

. We shall also require the following result. 

4.2 Let f be a maximal function in &,(G). Then the 
function h = | f* |?’-*f* is maximal in &,(G*), and its Fourier transform 

(1.3.2) in is equal to 

mf 42.1 lf | 


this function being taken as 0 wherever f =0. 


As f is maximal, we have by definition || f |p = || f* ||». The function 
h=—=| f* is in &,(G*), since | h |p =| f* | f* We have 
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furthermore |j h = || f* = || f = f For each g &,(G), 


consider the number 
H(g) 
It is clear that H is a bounded linear functional on 2,(G) and that 


4.2.2 || H || | H(g)|S lo lo =I 
g\|\p=1 
Taking g = || f “f, we find 


Combining 4.2.2 and 4.2.3, we find that || H || —|| f |p/”@. It follows 


from the representation theorem for linear functionals on & (see for 
example [9], p. 41) that there is a function 7 in &(G@) such that 


H(g) — g(x)n(z)da(x) for all ge2,(G); ie, 


We know too that | 
To identify the function 7, put g=—f in 4.2.4; this produces the 
inequalities 


Comparing the initial and final terms in 4. 2.5 and applying Theorem 3.1, 
we infer that 


4.2.6 = || f lle | F(x) |? = | F(x) |? F(z) 


for almost all xe G (the last function being taken as 0 wherever f(x) = 0). 
Finally, the relations 4.2.4 suffice to show that » is the Fourier transform 
of h (see [17], p. 118). This establishes the present theorem, as || 7 ||» and 
|| are evidently equal. 


5. Proof of Theorem 2.6. 5.1 If fe,(G) or ge%(G*), we set 
T(w)f=|f sonf and T(w)g =| sgng. Note that if 
q(w) =2/(u+1), where w=u-+iv, then || T(w)f = | f 


Tr) 


It 


| 
| 
B 
a 
5. 
5. 
un 
sec 
5 
un 
thi 
the 
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= || g Furthermore, q(0 + iv) = 2 and q(1 + = 1, 
—oc<u<o, and 7(—1-+ 2/p)f=f, T(—1+2/p)g=—g. 


Let J,,: - +, Zy be disjoint measurable sets in G, each of finite measure. 
Let 
exp(A, +%iBn) An, Bre R; (n==1,---,N) 
F(z) = 
li 
n=1 
Similarly, let J*,,- - -,.7*y be disjoint measurable sets in G*, each of finite 


measure, and let 
exp(Cm +71Dm) yed*n; Cn, Dne R; 
A(y) = 
0 gt 


m=1 


Function of the type of F and H are called simple functions. Consider 


By direct computation, we have 
N M 


¥(w) =X exp[Anp(w + + + Cmp(w + 


ff 
J 1, J*m 


It follows that &(w) is an entire function. 
Let f be a maximal function in &,(@). Let f* = | f* |?" sgn f*. Choose 


a sequence of simple functions Fy (a =1,2,- - -) on such that 
| (w) Fae = || T (w)f (0S Rew =1), 
5.1.2 lim || T(w)Fa —T(w)f = 9, 

a> o 


uniformly for w in any compact subset of 0= Rew =1. Similarly choose a 


sequence of simple functions H, on G* (# —1,2,- - -) such that 
5.1.3 | T(w) Ha S | T(w)f* (0S Rew S1), 
5.1.4 lim || T(w)Ha— T(w)f* = 9 


uniformly for w in any compact subset of Rew = 1. It is evident that 
this can be done. If 


va(w) =f [P(w)Fal*(y) [2(w) 


then for each a, Wa(w) is an entire function of w. Moreover, it follows from 


n=1 m=1 


848 EDWIN HEWITT AND ISIDORE HIRSCHMAN, JR. 


5.1.2 and 5.1.4 that lim = y(w), as uniformly for w in any 
compact subset of 0 = Rew = 1, where 
Thus we see that 

a. y(w) is analytic for 0 << Rew < 1 and is continuous for 0 = Rew <1, 

It follows from Holder’s inequality that 

| Ya(w)| (w)Fa]* T (io) Ha 
From 5.1.4 and Hausdorff’s inequality 1.3.3, we have 
From 5.1.2 and 1.3.3, we obtain 
(w)Fa]* S | T(w) Fa Sl T(w)f — f 


Thus | Ya(w)| S || f(x) (a= 1,2,- -; OSRewX 1). Let us 
suppose, for the sake of simplicity, that || f ||p = 1. Then we have | ¥,(w)| <1 
and thus 


b. |y(w)|S1 0S Rew = 1. 
Finally, since f is maximal in &2,(G@), 
y(—14+2/p) =1. 


The statements a., b. and ec. together with the maximum modulus principle 
imply that 
5.1.6 y(w) =1 0< Rew <1. 


If we estimate | y(w)| using Hélder’s inequality just as we previously esti- 
mated | ¥a(w)|, we find that this estimate is 1. Thus we may apply Theorem 
3. 1 to conclude that 


5.1.7 [T(w)f]*(y) =| Tw) fF (y) sgn [T(w) ] (0S Rew <1) 


for a.a. yeG*. (The exceptional set can of course depend on w). For 
Rew —1, we find that if #,,* is defined as the set of ye G* for which 
|[7(w) f]*(y)| = 1, then 


5.1.8 T (w)f*(y) =0 (Rew = 1) 


for almost all y ¢ F',,* (again the exceptional set may depend on w). 


| 

z 
d 


ny 


us 


le 
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In Theorem 4. 2, it was shown that f* is maximal in 2,(G*) and that 


(y, 7) du(y) =| f(z) sgn f(x). 


From this it follows that f plays the same réle with respect to f* that f* 
does with respect to f. Consequently, we may conclude that the Fourier 
transform on G of 7'(w)f* is given by the expression 


[T (w) f(x) sen [T (w)f (x) ] (0S Rew < 1) 
almost everywhere. For Siew —1, we find that if D, C G is defined as the 
set of points where the Fourier transform on G of T(w)f* has absolute 


value 1, then 
5.1. 8’ | T(w)f(x)| =0 (Rew — 1) 


for almost all +f Dy. 


5.2 Taking w= 1 in 5.1.8 and 5.1.8’ and using Theorem 4. 1, we see 
that f vanishes almost everywhere except on the compact set D, and f*(z) 
vanishes almost everywhere except on the compact set #,*. We immediately 
infer from this that f and f* may be taken as continuous and that 5.1.7 and 
5.1.7” then obtain everywhere. Consequently, we may now permit w to 
approach 1 + ww in 5.1.7 and 5.1.7’ to obtain the following equalities: 


5.21  [T(1+ iv) f]*(y) ye 

5.2.1’ [T (1+ iv) f#]* (2) weG. 

Taking v = 0, we see that 

5. 2. 2 =0 for E,*; f#(y) 40 for ye 

5. 2. 2’ f(x) =0 for f(z) 40 for reD,. 

Since the complement of /,* is exactly the set where f* vanishes, it is closed, 

and H,* is open. Thus = where is an open compact subgroup 

of G. Similarly, D, = 2D , where Dy is an open compact subgroup of G. 
It follows from 5. 2. 1 that for all <u |[T(1 + iv) f]*(y)! 

=1, ye Taking an arbitrary y, we must have 

5.2.3 sgn[7'(1 + iv) f(z) (2, y:7)] =d,, weD,, 


where d, is a constant of absolute value 1, independent of 2. Rewriting 
5.2.3, we have | f(x) |**/? sen[f(a) (2, =d», ce D,. Let v0; we 
find that sen f(x) = (2, y:)do, xe Dy. Consequently, if —o< v <<, then 
| f(a) |#e-/? — d,/do, xe D;. This is possible only if | f(a)| is a constant, say 
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k, throughout D,. Thus we have proved that f has the form f(z) = kd)(a, y,) 


or f(z) = 0 according as rea Dy or t.e., f is a subcharacter or a 
translate of a subcharacter. 


6. Examples. 6.1 If G is any compact connected Abelian group, then 
the only maximal functions in &,(G@), for any p such thot 1 < p < 2, are 


multiples of characters. 


This proves the original theorem of Hardy and 


Littlewood (GT). Theorem 2.6 shows this to be the case, since G can 
have no proper open and closed subgroups. 


6.2 If G is any connected, locally, compact, and non-compact Abelian 
group (for example, R or R" (n=2,3,---)), then 2%,(G@) contains no 
maximal functions at all save 0, since G has no compact open subgroups. 


6.3 A discrete Abelian group G always admits maximal functions: any 
function vanishing except at a single point in G is maximal in every &%,(G) 
(1< p< 2). Non-trivial maximal functions exist if and only if @ has finite 
proper subgroups, and then they are completely identified by Theorem 2. 6. 


6.4 There exist, of course, non-trivial examples of locally compact 


Abelian groups having compact open subgroups. 


As Pontryagin has proved 


({11], p. 77, Theorem 17), if G is locally compact, Abelian, 0-dimensional, 
and has a countable open basis, then it has arbitrarily small (hence compact) 
open-and-closed subgroups. All such groups admit non-trivial maximal fune- 


tions in every &,(G), as Theorem 2.5 shows. 


6.5 An intriguing example of the situation described in 6. 4 is presented 


by the p-adic numbers Q, under addition. 


all formal power series 


a=> axp*, 
k=n 


We represent Q, as the group of 


where n is an arbitrary integer and a; can be any of the numbers 0, 1, 2,- °°, 
p—1. Addition of two p-adic numbers a and b is carried out by adding 
the terms a, and b, and carrying remainders modulo p in the usual way. 
A generic neighborhood [,, of zero consists of all @ such that a, 0 for 


k < _m; here m is an arbitrary integer. 


Every I, is clearly a compact open 


subgroup of Q,. Haar measure on Q, is completely determined by specifying 


the measure J for all of the sets I,,. 


(It is easy to see that this fixes ) for all 


compact subsets of Q, and thence, via the usual construction, for all Borel 
sets in Q,.) As Ty is the compact group of p-adic integers, we deem it appro- 


priate to take == 1. Then a simple calculation shows that = p™ 


tate tet — 
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for all integers m. The character group Q,* can be constructed as follows 
(see [8], pp. 564-565). A generic character of Q, is a function yn, such that 


6.5.1 ynn(@) yak ( 2 axp*) —= exp [2xip*h( 2 axp*) | 


= exp[2rip"h ( )]. 


Here n and h are integers such that pfh. It is not hard to see that a 
complete family of neighborhoods of the identity yoo in Q,* are the subgroups 
N(Im) (m=0, +1, + 2,-- -); note that is the set of all yna such 
that n < m, and that as m decreases, I, increases and N(I,) decreases. As 
shown in 2.5.5, we have »(NV(T,)) = p™ for all integers m. 


6.6 Finally, we consider the group D which is the direct product of a 
countably infinite number of groups each consisting of exactly two elements. 
Each factor group may be taken as the numbers — 1, 1 under multiplication, 
and thus D is representable as the group of all sequences x = {@p}n-1” in 
which each z, is either —1 or 1 and the group operation is co-ordinate-wise 
multiplication. The neighborhood Um(e) (m = 1, 2, 3,- - -) consists of all x 
such that 7, = = 7; =* ‘= 2» 1, and these neighborhoods are a com- 
plete family of neighborhoods of the identity. Each U,, is an open-and-closed 
subgroup of D. There are of course other such subgroups: they are all 
identified as the sets of xe D for which a certain specified finite set of co- 
ordinates are all equal to one. It is furthermore easy to see that a translate 
of an open-and-closed subgroup of D consists of all xe D for which 2 has 
prescribed values at a fixed, finite, number of co-ordinates. Finally, one verifies 
without difficulty that a maximal function in 2,(D) is any linear combination 
of characters of D which has constant absolute value on one of these trans- 
lates and is zero elsewhere. (Characters of D are just the functions 
* * Where t2,° is any finite set of positive integers. ) 
It was a close study of certain maximal functions in 2,(D) that led to the. 
formulation of Theorem 2. 6. 
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ALGEBRAS WITH TWO GENERATORS.* 


By JoHN WERMER. 


Introduction. Let C denote the algebra of all continuous complex- 
valued functions defined on the unit circle |A|=1. With the norm 
\ f || sup |f(A)|, € is a Banach algebra. 

|A[=1 


For any ¢,y in C, let [¢,~] be the closed subalgebra of C generated by 
¢,y and the constant 1. [¢,y] then consists of those functions in C which 
can be uniformly approximated by polynomials in ¢ and y. 

In [1] the author determined the algebras [¢,y] with } one-one on 
|A|==1. In this paper we shall give explicitly all algebras [¢,~] where 
$(A) =A? and ¢ and y together separate points on |A|—1. In §2 we 
discuss the general problem: when is [¢,y~] equal to C? 


1. We assume \” and y separate points on |A|—1. We shall prove: 


THEOREM 1. Jf [A*,~] #C, then there exist n distinct points 
in |A| <1, where n is odd, and functions E, analytic in |A| <1 


and continuous in |X| <1, such that y(A) = + { Il (A? — 
Conversely, if y has this form, then [d?, y] AC. = 


Definition. Let R be any Banach algebra, M a closed subalgebra. We 
say M is a maximal subalgebra of R if for any closed subalgebra M’ of R 
with MC WM’ we either have M’ = M or M’ = R. 


THEOREM 2. Every subalgebra [A*, p], where p(rA) = (II (A? — 2,7) 3, 
i=1 


the 4 are distinct points in |A| <1 and n is odd, is a maximal subalgebra 
of 


We. shall use the following notations: If h(A) is any function defined 
on |A| = 1, then ho(A) = 4(h(A) —h(—A)), he(A) =F (h(A) + h(—A)). 
Clearly h = he + ho. 

If § is any set on |A| = 1, —S= S$}. 

If » is any complex-valued measure on | A | = 1, then 
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Ho(S) = we(S) = 3(#(8) + 


Clearly and again are measures on | A | = 1, and p= pe + po. 

% denotes the class of functions f(A) in C with f(A) analytic in |A| <1, 
continuous in |A| <1. 

§’ denotes the class of function h(A) on |A|—=1 such that there 


2r 
exists H(A) analytic in |A| <1 with sup f | and 
rc 0 
h(e*) =lim H(re®) If he’, #0 ae. (See [3].) Also 
r>1 


| da a H(z) —2 
Sine! (A)| | da | and H(z) 2)*h(A) 
Lemma 1. If [A*,g] AC, go(A)KO0, |A| —1, then g.(A) = E(A’), 
E in &. 


Proof of Lemma 1. Let o be a measure on |A|=1 with | A?"do(A) =0, 
JA|=1 Al=1 


JA]=1 


A"doe(A) 0, = 0. 
|A|=1 


By a known theorem, [2], this implies that o, is absolutely continuous, 
doe(A) =h(A)dd, and h(A) 

By hypothesis now [A*,g] ~C. A well-known property of the space C 
then yields a measure » with »=£0 and 


(1) (Adda = 0, n,m >= 0. 


Setting m—0,-we conclude from the preceding that dy,.(A) =a(A)d, 
a(A) in ©’. Then a(A) =—a(—A). 

But the measure dv(A) =g(A)du(A) also annihilates all 
as is given by (1) for m=1. Hence dy,(A) =m(A)dd, m(A) in 
Now dve(X) = go(A)dpo(A) + ge(A)dpe(A) = Jo(A)dpo(A) + ge(A)a(A) da. 
Hence dyo(A) = {m(A) — ge(A)a(A)}(go(A)) = B(A)dA. Now go(A) 
all |A 1, by hypothesis. Hence is summable on |A|=—1. Also 
b{A) =b(—A). Thus = (a(A) + )dA =f (A)dA, f summable, 
<0. We can then rewrite (1): 


(1’) f(A) dA = 0, 
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The above considerations, applied to the measures g”(A)f(A)dA, 
m=0,1,2,- - -, yield at once: (g™(A)f(A)dA)¢ = ym(A) dA, ym in $’. Hence 


(2) (A) F(A) )o = ym(A) a.e. on |A| =1. 


An elementary computation then gives for n odd and setting A(A) = y1? — yoyz 
= (A)f(—A): 


(n-1)/2 


= C,* — Yn-vY Yn+2-vyv) = An 
y=0 


ae.on |A|=1. Thus g.(A) = $A,/A(A) = ae. on |A|—1. Here 
® is meromorphic in |A| <1. It must be shown, of course, that A540, 
and we shall do this below. Now 2"@"A(A) =A,(A) a.e. on |A|=—1. On 
both sides of the equation are functions analytic in |X| <1 and with non- 
tangential boundary values existing a.e. on |A|=—1. By a theorem of 
Privaloff, [3], we hence get 2"6"(A)A(A) =A,(A), |A| <1. Since A is a 
fixed function and n arbitrary, ® can have no poles. Further, 


Since = (g/f)o(A), we get | S$ K(1—|2|)7I] where K is 
a constant. Hence 


(n-1)/2 
2" | n(z)| | A(z)| =| An(z)|S 4K?(1—|2 |)? 
= K’(1—|z|)?- 2" 


Taking n-th roots and letting we have | (z)|= || g ||, provided 

A(z) 40. But the zeros of A are isolated in | z| <1 and so ® is bounded 

in|z| <1. Since g-(A) =lim ®(ra) for a.a. A, |A| —1, and since g, is 


continuous, we conclude that ®(A) is continuous in |A|S1. Thus 
ge(A) = E(A?), E analytic in |A| <1, continuous in 1, ie. is 
in 

It remains to show that As£0. Suppose the contrary. Since A(A) 
= (A) f(A) f(—A) and go? #0, we get f(A)f(—A) on |A| —1. 
Let D= {A|f is defined at A and —A, f.(A) 40 and f(A)f(—A) = 0}. 
Now f, 0, for else f(A) =f(—A) and so f?(A) =0a.e. Since f is defined 
a.e., f(A) f(—A) = 0 a.e. and fy by (2), with m =0, and so f.(A) #0 
a.e., D has measure 27. 

Let 8 = {Ain D| f(A) 40}. Then SU—S =D and Sn —S is empty, 
as is easily verified. Hence the Lebesgue measure of S, m(S), = m(— S) =z. 
By (2), (g™f)o(A) =ym(A) a.e. on |A|—1, ym in §. For A in S, 
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(y"f)o(A) = 3g" (A) F(A) =g"(A)fo(A). Hence g"(A) = ym(A)/fo(A) ace. 
on 8S. Set G(A) =y:(A)/fo(A). Then G is meromorphic in |A| <i and 
G(A) =g(A) on a subset S, of S with m(S,;) = m(S) and G(A) = g(—A) 
on S2, 8, C —S,m(S2) =m(—S8). Then fo(A)G"(A) = ym(A) on 8. 
Since m(S) > 0, it follows that this relation is true for |A| <1 and ae. 
on |A|=1. Hence G has no poles in |A| <1, and also 


fo(2)G@™(z) = 2 — G™(A) da. 


Now G(A) = g(A) a.e. on S, G(A) = g(—A) a.e. on — 8, and m(S U — 
= 2m. Hence | G(A)|S|\g | ae. on |A!=—1. Thus 


fo(2) || @™(2)| S 2 


whence | G(z)| S| g || if fo(z) 40. Thus G is bounded in | z| <1. 

Choose now A, in S,* M S.*, (the asterisk meaning closure). There exists 
such a Xo, since S,* and S.,* are closed sets on the circle, each of measure 7. 

Set g(Ao) = % g9(—Ao) = B. We shall prove «=~. To this end set 
H(z) = (G(z) —«)(G(z) —B). H(z) is then a bounded analytic func- 
tion in |z| <1. Given « >0, choose § > 0, so that | g(A) —a| <e and 
|g(—A) —B| <e provided |A—2A | < 8. 

Now a.e. on |A | —1 either G(A) =g(A) or G(A) =g(—A). Hence 
a.e. on |A|=—1, |A—A| <8 | H(A)| It follows from the 
Poisson integral representation for H in | z| <1 that! H(z)| <eif|z| <1 
and |z—dA.| < 8. 

Now S,* and also Aye S8,*. Hence the neighborhood | 
contains z,, 22 with | G(z,) —a|<é, | G@(z.) —B|<e. An arc in this 
neighborhood which joins z, and 2, then contains some z where both 
| G(z) —a| < dand | G(z)—B| < Hence 8. Thus g(Ay) = g(— A»). 
This contradicts go(Ao) #0. Hence AS0. Lemma 1 is thus established. 


Proof of Theorem 1. [A*?,~] by assumption, and also 
since A*,y separate points on |A|—1. Lemma 1 then gives that 
We(A) = (A?) with in %. Now any function in may be uniformly 
approximated by polynomials in | A |= 1 and so y, is approximable in the 
norm of C by polynomials in A?. Hence ye [dA7,y]. It follows that 
wo [A*, y]. 


Seth + y. Then ho = ~00n]rA|—1. Also [r2,h] C [A’, ¥] 
AC. By Lemma 1, then, he(A) = Wo?(A) + ¥-(A) = E(A2), in and so 
Yo°(A) = F(A’), F in Now yo(A) 40 if |A | and so F(z) has only 
finitely many zeros in |z| <1. Thus we can write 


h 


Cf 


W! 

Yo 

si 

Si 
| 
t 

A 

Pp 

§ 

b 

St 
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F(z) (2 — Ai?) (£2 (z) )*, 
i= 
where E,e% and where the A? are distinct points in |A| <1. Hence 


= (a2 — a2) (a2), where {II is one of the two 
i=1 


single valued branches of this multiple-valued function defined on | A | —1. 
Since Yo(A) = — o(—A), m must be odd. Thus y = ye + yo has the desired 
representation. 


Conversely, suppose is of the form y(A) = H(A?) + p(A)£2(A?), 
where E,, Be %, = {II (A2—a2)}%. Then [22,9] C p]. Now it 
i=1 


is easily seen, using the fact that p(A) = — p(— A), that [A*, p] consists of 
those and only those functions having the form: f(A) = A(A”) + p(A)B(A?’), 
A,B in 2%. Hence clearly [A?, p] and so [A*, y] AC. Theorem 1 is thus 
proved. 


Proof of Theorem 2. Suppose [A*, p] C M’, where M’ is a proper closed 
subalgebra of C. We must show M’ = [)?, p]. 


Take any f in M’. Let h =f -+- rp where r is a constant chosen so that 
ho(A) #0 on |A|=—1. Since [A?,h] CM’, Theorem 1 then gives 


h(a) = A(a*) + (II BQ’), A,B in 
Hence 


ho: p(A) = a (A? — }4B(A*) 


Set k=hop+p. Then k)(A) =p(A) ~0 and [A*,k] CM’. Hence by 
Theorem 1, = hop(A) = K(A?), K in Hence the é; and the 4 must 
be equal in pairs, whence h(A) = A(A?) + p(A)B(A?). Thus he [A?, p] and 
so fe [A?, p]. Thus M’ = [2?, p]. 


2. Let now ¢,w be any pair of functions in C separating points on 
|A| 1. Let © be the curve in R,, the space of two complex variables, which 
is given parametrically by: 2, = 22=y(A), |A|—1. is then a 
simple closed Jordan curve. By a piece of an analytic surface we mean a 
bounded subset ¥ of R, such that if (z,°,2.°) e F, there exist functions 
%(€),22(€) analytic in a neighborhood of & in the complex é-plane such 
that z, = = represent parametrically in a neighborhood of 
(21°, 22°) with z,(é)) = 21°, = 22°. We now have condition: 


JOHN WERMER. 


(1) There exists no piece of an analytic surface bounded by YL. 
THEOREM 3. Condition (I) is necessary in order that [¢,¥] =C. 


Proof. Suppose # is a piece of an analytic surface bounded by [. Let 
P(z,,22) be any polynomial. Restricted to ¥, P(2:, 22) is an analytic fune- 
tion on ¥ and hence | P(2;,z2)| attains its maximum on the boundary. 


Fix now 2,°, 22° in ¥. For all polynomials P set m(P(¢, w)) = P(z,°, 22°). 

Then by the above, 
| m(P(¢, ¥))| P(%, 22)| = || P(¢, 9) 

Thus m is a bounded multiplicative functional defined on a dense subset of 
[¢, ¥] and hence may be extended to be a multiplicative functional on all of 
[¢,y]. If now [¢,y~] —C, every such multiplicative functional has the 
form: f > f (Ao), Ao fixed, | Ao | = 1. Thus 2,° = (Ao), 22° = 
= (Ao). But (21°, 22°) ¢T and ($(Ao), ¥(Ao)) This is a contradiction, 
and so our assertion holds. 


Turorem 4. Let (A) be one-one on |A|=1. Then (I) is also 
sufficient in order that [¢,~] = C. 


Proof. Let y be the curve on which ¢(A) maps the unit circle. Then y 
is a simple closed curve in the plane. Suppose now [¢,y~] ~C. The author 
showed in [1] that then y belongs to the algebra generated by ¢ and hence 
that (A) = F(¢(A)), where F is continuous inside and on y and analytic 
inside y. Then the set of points (2:, 22) in Ry with 2; =, z. = F(é) where 
é ranges over the interior of y is a piece of an analytic surface F and F is 
bounded by T since (é, F(é)) with € on y is the general point on I. 


~ 


THEOREM 5. Let A*,y separate points on |A|=1. Then (I) is also 
sufficient in order that [d*, y] = C. 


Proof. Suppose [A?,y] ~C. By Theorem 1, 


y(A) = (2) + {11 (a2 —a2) (a2), Bee n odd. 
i=1 


Let & be the Riemann surface of the function { Il (z — »,7)}4, represented as 
4=1 


a two-sheeted covering of the z-plane with branch-points over d,°, i=—1,?, 

-+,n. Let &’ be the region of & lying over the region | z| <1, and y 
its boundary. Then y is a simple closed Jordan curve on &. Let Z be the 
function on §’ whose value at qg, where q lies over z, =z, and W the function 
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on &’ whose value at qg is Ei(z) + {II Z and W are 
i=1 


analytic on 9’ and together separate points on 9’. 

The set of points (Z(q), W(q)) in Rs, q in Y’, is a piece of an analytic 
surface #. Its boundary is the set of points (Z(q),W(q)) with qin y. But 
for q in y, q lying over A”, Z(q) W(q) = and so (Z(q), W(q)) eT. 
Conversely all points in Fr are obtainable in this way. Hence I is the boundary 
of ¥. Thus (I) fails if [A2,y] AC. ged. 

Theorems 4 and 5 suggest the conjecture that [¢,y~] —C if and only 
if there exists no analytic surface bounded by I. 

We hope to return to this question at some later date. 
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ON MONOTONE SOLUTIONS OF SYSTEMS OF NON-LINEAR 
DIFFERENTIAL EQUATIONS.* 


By Puitip HartMAN and AUREL WINTNER. 


A theorem of Kneser [3], pp. 183-191, on a non-linear differential 
equation of second order, 


(1) y’ = F(t,y), 


implies that if g = q(t) is a non-negative, continuous function on 0 St <a, 
then the linear equation 


(2) y’=a(t)y 
possesses a solution y= y(t) satisfying y(0) > 0, 
(3) y(t) y(t)S0, y(t) 20 


on 
Kneser’s theorem dealing with (1) has been generalized in [1] so as to 
apply to an equation of the type 


(4) = F(t, y’) 3 


cf. [1], where references are given to earlier treatments of (4). The theorem 
concerning (2) has been generalized in [2] to the case in which (2) is replaced 
by a linear system of differential equations of first order, 


(5) =—A(t)a, 


where z is a vector with n components z',- - -,2" and A(t) = || aj,(¢)|| isa 
continuous by n matrix. 

The proof in [2] for the general theorem on (5) is considerably simpler 
than older proofs for the particular case dealing with (2). The object of the 
present note is to carry over the ideas of that proof to a non-linear system 


(6) = — f(t, 


where zx and f are vectors; so that (4) is a particular case of (6). 
The following terminology will be used: A vector z is called non-negative 


* Received June 24, 1954. 
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(c= 0) or positive (c >0) if all of its components are non-negative or 
positive. Similar definitions apply to r=0, x < 0, and to 14 = 2 if 2%, Le 
are two vectors. 


(*) Let f=(f',---,f") =f (t,7) be a continuous (vector) function 
of (t,2) = --,a*) for 


(7) t=0, 
satisfying 
(81) f(t, 0) =0; (82) f(t, 2) 20. 


For every constant T > 0, let there exist a positive, continuous function 
¢=—¢r(r), OS r<o, satisfying 


(9) f dr/p(r) =00 
and 
(10) (0S) if OStST. 


Then, for any constant c> 0, there exists at least one (vector) function 
t=a2(t) which satisfies 

(11) | 

and which is a solution of (6) for Ot <o (so that 

(12) a(t)=0O and (t)S0 

for 0=t<o). 

In (10) and (11), the symbol | 2| denotes the Euclidean length of the 
vector z and the dot denotes scalar multiplication. 

(*) contains the result of [2] concerning (5) ; in fact, if f(t, 7) = A(t)z, 
then ¢r(r) in (10) can be chosen to be Crr, where Cr =const., and (82) 
holds on the set (7) if the elements aj,(¢) of A(t) are non-negative. 

If x = (x1, x*) is identified with (y,— y’), then (4) becomes the system 


== — = — F(t, — 2’). 


If this system is identified with (6), so that f= (f*,f?) becomes 
(1, F(t, x1, 2?)), then (*) requires that F(t,y,y’) be defined and con- 
tinuous for 0, y=0, y’ S0; that 


(13) F(t,0,0)=0 and F(t,y,y’) 


finally that, for every 7’, there should exist a positive continuous function 
¢=¢r(r) satisfying (9) and 
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(14) if S50. 


Under these conditions (*) implies that, for every c > 0, there is a solution 
of (4) satisfying y?(0) + y’7(0) =c and (3) for 0 St<o. 

This assertion is in some respect similar to that of (1) in [1] which, 
however, places more emphasis on the unequal standing of the components 
of the vector x= (y,—y’). In [1], the function F(é,y, y’) is continuous 
fort =0, y= 0, y = 0 and satisfies (13), but (9) and (14) are replaced by 
the requirement that, for every T > 0, there exists a function yy = y7(r), 
0=r<o, satisfying 


(15) rdr/y 


and 
16) F(t,y,y) Sv(—y) if OStsST, 0SyST, ¥ S0. 
y y y 


Correspondingly, (I) in [1] permits the assignment of y(0) > 0, rather than 
that of y?(0) + (0). (Note that if y is fixed (and y’~0), then (14) 
implies the inequality in (16) with y(— y’) =— ¢(y?+ y”)/y’. With this 
choice of y, the relations (9) and (15) are equivalent.) 


Proof of (*). Let the definition of f(¢, x) be extended over the (n + 1)- 
dimensional half-space (0 x arbitrary) by the assignment f(t, x) = 0 
if at least one component of x is negative. It follows from (8,) that f(t, 7) 
is continuous on this half-space. Furthermore, (8,) and (10) hold for 
arbitrary x (whether or not 7= 0). 


The condition (9) and the inequality (10) (or just | f(t, )| S ¢(| 
imply that if ¢ = x(t) is a solution of (6) on some interval (0S) fStS?, 
then all continuations of z(t) exist on Ot <0. This follows from [4], 
pp. 176-178, if (10) is replaced by | f(t, 7)| = (||). In order to use (10), 
let r =| | in [4], p. 177, be replaced by r= | a |*; cf. [5], p. 557. Then, 
by (10), | =| S2¢(r) and so the argument 
applied in [4], p. 177, becomes applicable. 

Note that the formulation above in terms of “all continuations” is 
needed since the conditions imposed on (6) do not imply the local uniqueness 
of solutions of (6). The assertion concerning the continuability of « = 2(t) 
over the range 0 = ¢ <o is not true if f(t, x) is defined only on the set (7). 

Let T > 0 and let 2) > 0 be a given positive vector. Consider a solution 
x(t) of (6) determined by the initial condition = 2. 
This solution can be considered to exist for 0 t<o and, in particular, 
for OXSt=T. In view of (6) and (8.), 2’(t) 0. Consequently, 
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a(t) >0if OStST. (In particular, x(t) on 0 
a solution of (6) even before the extension of the definition of f.) 

For a given T > 0 anda given c > 0, there exist at least one x) = 2)(T’, c) 
and a corresponding solution «=.a(t;T) =a(t,T, 2) of (6) satisfying 


and, for t = 0, 
(18) | T)| 


If this fact is granted for a moment, the proof of (*) can be completed as 
follows : 

Since «= satisfies (6), hence a’(t:7) S0 by (82), it is seen 
from (17) and (18) that 


In view of (19) and the fact that e—-a(t;T) is a solution of (6), the 
sequence of functions x —«a(t;m) of t, where m =1,2,:--, is uniformly 
bounded and equicontinuous on any finite ¢-interval. Hence there exists a 
subsequence which is uniformly convergent on every finite ¢-interval. Let 
t= x(t) denote the limit of such a subsequence. 

By (18), satisfies (11) att =0. Also, is a solution 
of (6), since each x—«a(t;m) is, and satisfies x(t) 20 on OSt<om., 
since 20 on OStST if m=T; cf. (17). Finally, S90, 
by (6) and (82). 

Thus, in order to complete the proof of (*), all that remains to be proved 
is the assertion, granted above, concerning (17) and (18). To this end, 
suppose first that the solutions of (6) are uniquely determined by initial 
conditions. Thus is uniquely determined by (7,2) and is 
a continuous function of (¢, 7,2). In particular, | )| is a con- 
tinuous function of 2, for fixed T. Since (6) and (8,) show that x(t, T, 0) = 0, 
it is seen that | (0, T,0)| =0. Also, a > 0 implies 7(0, T, 2) = 2, hence 
= Thus, if (> 0) is fixed, continuity considerations 
supply the existence of an 2) > 0 satisfying | 7,2 )| c. This proves 
the statement concerning (17) and (18) when the solutions of (6) are 
uniquely determined by initial conditions. 

If (6) does not have this (local) property, let 7 be fixed and, on the 
bounded (t, a)-set defined by 


(20) r= 0, |x| 2¢, 
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let 2), fo(t,%),- be a sequence of smooth functions satisfying 
(211) fj(t, 0) =0; (212) fi(t,7) 20 

and, as j 00, 

(22) 2) > f(t, 2) 


uniformly on (20). (The existence of such functions f; is clear, for example, 
from the theory of Fourier series.) 

The uniformity of (22) and the fact that ¢ = r(r) is a positive func- 
tion show that, on (20), 


(23) (0S) < 2br(| |*) 


if j is sufficiently large. Consequently, if j is sufficiently large, the function 
f;(t, x) can be extended to the set (0 ¢=T,2=0) so as to be continuous 
and satisfy (21.) and (23). 

The considerations above, dealing with (6), show that, if 7 is sufficiently 
large, then 


(24) a — f;(t, 2) 
has a solution x = 2;(t) satisfying 
(25) |2;(0)|—=c and a(t) Z20,2/(t) on OStST. 


In particular, | 27;(¢)} Sc on OStST. 

The sequence of functions 2,(¢),22(¢),- - - is uniformly bounded and 
equicontinuous on 0 tT. Consequently, there exists a subsequence which 
is uniformly convergent on OS¢XT. Let x—<2x(t;T) denote the limit 
function of such a subsequence. It follows from the uniformity of the limit 
relation (22) on (20), that x(t) —2(t;T) is a solution of (6). By (25), 
z(t; 7) satisfies (17) and (18). This completes the proof of (*). 

(*) and the procedure applied in [6] supply existence theorems for 
solutions of (6) which are representable, for 0 = ¢t <2, as Laplace integrals 


(26) z(t) = e-t*da(s), where da(s) =0 

0 
(the last inequality means that a a(t) is a vector function the components 
of which are non-decreasing on <0). 


(**) Let f—f(t,v) in (6) satisfy the conditions of (*) and, m 


addition, possess continuous partial derivatives of every order satisfying 
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on (7), where D.' =0"/dz". Then any solution of (6) satisfying (12) on 
0St <oo is representable in the form (26). 

Proof. The theorem of Hausdorff-Bernstein states that the existence 
of an a= satisfying (26) is equivalent to 


(281) for 0<St<o, 


where h =0,1,---. The relations (12) are equivalent to (28,) and (28,). 
Differentiation of (6) gives 


a’ (t) = — df —3 


Thus it is seen from (27) and (28,) that (28,.) holds. A simple induction 
proves (28,) for h —0,1,-- - and, therefore, (**). 


The well known “enveloping” property of the partial sums of the series 


et == (—t)™/m!}, 
m=0 
where x e-* is the solution « —-—-2, is the simplest illustration of the 
following theorem: 
(***) Let f(t, x) in (6) satisfy the conditions of (*) and, in addition, 
let f(t, 2) be defined and continuous on the half-space (0 St<o,|ax| <<) 
and be a non-decreasing function of 2, 


(29) if mS 
Let r= x(t) be a solution of (6) satisfying (12) forOSt<o. Then the 
successive approximations to x= x(t), defined by 
(30) z(t) =0(0); = — f(s, 


where m =1,2,: +, satisfy 


(31m) (—1)"(am(t) —x(t)) 20 for OSt <o, 
where m=0,1,° °°. 


Although f(¢, 2) is defined and continuous on the half-space (0S t <a, 
and x arbitrary) and (29) is required on this half-space, the inequalities (82) 
and (10) are assumed only on (7). Accordingly, it is neither stated nor 
assumed that the successive approximations satisfy z(t) 20. Neither is 
it stated or assumed that the successive approximations converge. 
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The conditions of (*) and the additional condition (29) of (***) are 
satisfied if (6) is the linear system (5) and every element aj(¢) of the 
matrix A(t) is continuous and non-negative. 


Proof. It is clear that 


tmn(t) —2(t) = 2(8)) — 


Hence (31m) holds, by virtue of (29), if (31n+) holds. But 2’(t) <0 
implies that z(t) S2z(0) for OS that is, that holds. Thus 
(***) follows. ; 
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THE SPECTRA OF TOEPLITZ’S MATRICES.* 


By Puitiep HARTMAN and AUREL WINTNER. 


1. Introduction. The function classes L? will refer below to functions 
of a real variable ¢ on the interval —7r< ¢ < z. Two functions which differ 
only on a zero set will be considered to be identical. 

If f=(---, fu, fo +) is a vector with complex components and 
a finite length, 


(1) 3 Ife [2)4<0, 


00 


let f(@) denote the function of class L? having the Fourier series 


(2) 


Let f = (f-, f+), where f* and f- are the respective sequences f* = (fo, f1,° °°), 
f=(---+,f-s,f4+), and let the corresponding functions ft(¢), f-(¢) of class 
L? be called the interior and exterior functions of f(¢), 


n=0 n=1 
Finally, let 2 and T denote the Laurent and Toeplitz matrices determined by 
f, that is, let 2 = (fnm), where n,m —=0,+1,---, and © = (fam), where 

Functions of class LZ’ having Fourier series of the form (3) will also be 
referred to as interior and exterior functions. 

If (a, 2*) and u* have finite lengths, then @(a-, x+) = y*) and 
Tut = vt are defined (without necessarily having a finite length). The 
relationship 2(0, 2*) = y*) implies that = y*. In addition, y*) 
= X(2-, z+) means that f(¢)2(¢) = y(¢), where y(¢) is a function of class: 
L* with the sequence of Fourier coefficients (y-, y*). In particular, if y* = Ta*, 
then y*(#) is the interior function of f(¢)2*(¢), 


Let f(¢) be real-valued, so that 
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(5) fn = 


Correspondingly, 2 and Z are Hermitian matrices. @ can be considered 
as a self-adjoint operator on the space L’. A function z(¢) of class L? is 
in the domain of if y(¢) =f (¢)z(¢) is of class L*; in which 
case y(?) = Lr(¢). According to Toeplitz ([10]; cf. [6], pp. 152-155), 
A is in the spectrum, S(2), of 2 if and only if the set of ¢-values 
{p|A—« < f(¢) <A+<} has a positive measure for every « > 0, while 
is in the point spectrum, P(&), of 2 if and only if meas{¢|f(¢) =A} > 0. 
In particular, 2 is bounded if and only if 


(6) f() is bounded 


(that is, if and only if | f(¢)| const. for almost all ¢). 

According to Toeplitz (cf., e.g., [3], p. 360), when (5) is assumed, 
& is bounded if and only if 2 is, that is, if and only if (6) holds. In this 
case, { can be considered as a bounded self-adjoint operator on the Hilbert 
space of the interior functions 2*(¢) of class L?, where y*(¢) = T2*(¢) is 
defined by (4). Furthermore, the least value, m, and greatest value, M, of 
the spectrum S(Z) of ZT coincide with the corresponding values belonging 
to S(2). 

Because of the problem of the possibility of associating a self-adjoint 
operator with the matrix { when (6) does not hold, the case of a bounded 
& will be considered first (Part I); the discussion of an unbounded & will 
be deferred to Part IV. 


Part I. Bounded T-matrices. 


2. The spectrum S(2). The following theorem, conjectured in [3] 
but verified there only in certain particular cases, will first be proved. 


Let (1), (2), (5) and (6) hold (so that f(¢) is bounded and real-valued). 
Let m and M be the (essential) lower and upper bounds of f(¢). 


(i) The spectrum S(X) of X is the closed interval [m, M]. 


(ii) If f() ts not a constant, that is, if m < M, then the point spectrum 
P(X) of is empty. 


These assertions imply that, wnder the assumptions (1), (2), (5), (6) 
and f ~const., T has a continuous spectrum only. This contrasts with the 
situation for Laurent matrices 2 which can have a pure point spectrum (or 
a pure continuous spectrum or a mixture of continuous and point spectra). 
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The above assertions are somewhat surprising in view of Szegd’s 
results [8] on the asymptotic distribution of the characteristic numbers 
Ace Of the k-th section (frm), where n,m 1, 
-++,k—1, of ©. Those results imply, for example, that, in the case of 
the function defined by f(¢) = + 1 according as—_7r< ¢<0or0<¢<7, 
one has, as k oo, 


{number of Ay, << —1-+ «}/k—> 5, {number of Aj, > 1—e}/k > 8, 


for every « >0; while, according to Toeplitz, —1lAj;z1. Thus, for 
large &, about a half of the numbers A,x,° - *,Axx, are near —1 and about a 
half are near +1. On the other hand, the spectrum of © consists of the 
entire interval —1=A=1; in particular, the numbers Aj, are dense in 
—1SA=1. 

It will be seen in Section 7 below that (ii) is false if f(@) is not bounded. 
In other words, there exist real-valued functions f(¢) of class LZ? for which 
the corresponding (Hermitian) Toeplitz matrix & satisfies Tz* — 0 for some 
vector 2*(40) of finite length. On the other hand, (i) can be formulated 
so as to remain valid for all (bounded or unbounded) Hermitian Toeplitz 
matrices; see Section 12. 

(i) will be deduced from (iii) below in Section 4; (ii) will be proved 
in Section 3. 


3. Proof of (ii). Suppose that P(Z) is not empty, so that there is a 
point in P(Z). If is the identity operator, then AX is the 
Toeplitz matrix belonging to f(¢) —Ao. Hence, it can be supposed that 
Ayo = 0 and that there exists an x%*(¢) +0 of class LZ? such that Tao*(¢) = 0. 
According to (4), this means that 


Since m < M, it follows that f(¢) 540. Furthermore, by a theorem of 
F. and M. Riesz, z)*(¢) +£0 implies that 2 *(¢) vanishes at most on a set of 
measure 0. Hence y-(¢) +0 and so there is a least (positive) integer n — NV 
satisfying y, 740. It can be supposed that y_y —1, so that 


If the relation (7) is multiplied by e#¥%, it is seen that Tzt(¢) —1 if 
is the inner function e‘V%z,*(¢). Similarly, it is seen that 
if z*() is the inner function (e+) — y (y,,)eN%)a,+(@). An induction 
shows that the range of & contains 1, e*%, e?#¢,- - - and is therefore dense in 
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the Hilbert space of interior functions. But this implies that the self-adjoint 
operator Z cannot annihilate x*(¢) +0. This contradiction proves the 
assertion (ii). 


Part II. Riemann’s problem. 


4. Equation (8). The problem of the existence of interior and exterior 
functions 2*(¢), (¢) satisfying the equation 


(8) =1+ yo (4); 


when f(¢) is given, goes back to Riemann; cf. [4], pp. 309-321 and [5], 
p- 210. The answer to this problem when f(¢) is real-valued, bounded and 
measurable and solutions 2)*(¢) of class L? are desired is contained in the 
following assertion : 


(iii) Let f(¢) be a real-valued, bounded, measurable function on 
(— 7,7). Necessary for the equation (8) to have a solution x *(¢) of 
class L* ts that f(¢) have the following properties: 


(9) sgn f(¢) ts a constant (1 or —1); 
(10) 1/f(¢) is of class 


It will follow from (iv) that the converse of (iii) is true, that is, that 
(9) and (10) are sufficient in order that (8) have solution 2,+(@) of class L*. 


Remark 1. It will be seen below (Section 7) that (9) does not remain 
a necessary condition if the hypothesis that f(¢@) is bounded is replaced, for 
example, by the assumption that f(¢) is of class L?. 


Remark 2. Proof of (i). The fact that (9) is a necessary condition 
for the solvability of (8), with an 2 *(¢) of class L*, implies (i). 


In order to see this, lett m<A< M. If A is not in the spectrum 
S(Z) of ZT, then the equation (I —AY)z*(¢) —1, that is, the equation 
(f(¢) —A)a*(?) =1+y(¢), has a solution 2*(¢) of class L*. But since 
sgn(f(¢) —A) is not a constant (almost everywhere), this leads to a con- 
tradiction. Hence 4 is in S(Z). Since S(Z) is a closed A-set, (i) follows. 


(iv) Let f(¢) be a function of class Lt. Sufficient for (8) to have a 
solution x,*(¢) of class L? is that f(¢) have the properties (9) and (10) 
(in which case yo(¢) is of class L?). 


It may be remarked that if (8) holds for functions x*(¢), yo"(¢) of 
class L?; p > 0, then 


at 
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(11) log | f(#)| is of class L*. 


In fact, log | and log |1-+ are of class ([9], p. 233 for 
the case p= 2 and [7], pp. 91-92, for the case p > 0). 

Theorem (iv) will be deduced from results of Szegé [8], pp. 176-177; 
its proof will only be sketched. Actually, (iv) can be deduced from a theorem 
of Szegé [9], p. 235, which implies that if 1/f(@) is of class L* (where f is 
not assumed to be of class Z'), then 1/f(¢) has a factorization of the form 
1/f(¢) = + | with an a *() of class L?, if and only if f(¢) has 
the properties (9) and (11). This theorem of Szegd does not imply (iii), 
which concerns (8), that is, the more general factorization 1o*(¢)/(1 + yo-(¢)) 
for 1/f(¢) ; on the other hand, it turns out that this more general factoriza- 
tion is necessarily of the form 1/f(¢) = + | 2o*()|? if f is bounded. 


5. Proof of (iii). Let f(¢@) be bounded and measurable and let (8) 
have a solution zo*(¢) of class L?; so that yo-(¢) is an exterior function of 
class L*. In the Fourier expansion of 2 *(¢), the 0-th (constant) term is 
not 0, for otherwise multiplication of (8) by e-'% gives a relation which shows 
that A= 0 is in the point spectrum P(Z) of and that is a 
corresponding eigenfunction. By (ii), this means that f(¢) =0 almost 
everywhere, which contradicts (8). Therefore, it is possible to write 


(12) ~ CS anei"®, where c0, ap — 1. 


n=0 
The function 1+ y."(¢) is of class Z? and has, therefore, a Fourier 
expansion, 


(13) + ($) ~3 yo= 1. 
Let 1 = Bo, B:,- be the sequence of numbers defined by the relation 
(14) ( 3 3 
for large |z|. Thus, formally, 
(15) (1+ ~3 Bo—1. 
Let X, 9), Yo denote the three Toeplitz matrices 

(1 00.) 

l. 
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belonging to the functions 2*(—¢)/c, (1+ 1+ y0(¢), 
respectively. 
Let (8) be multiplied by e‘"? and the result written as 


(17) = + Yn ($), 


where pna(d) = ei”? + y, is an interior function which is 
a polynomial of degree n in e*? and y,~(#) is an exterior function. Then 
(16) implies that, in terms of matrix-vector multiplication, 


The matrix 9)) has a unique formal inverse, 9)'* which is the transpose of 9), 


that is, 


(18) cind 3 Bu =1; 


ef., e. g., [12], p. 280. It follows from (17) and (18) that 


j=0 


where y(n)~(¢) is an exterior function. 

Let z* = (2%, 2:,: * +) be a vector having only a finite number of com- 
ponents different from 0 and let the last relation be multiplied by z, and the 
result summed with respect to n. There results the equation 


j =0 n=0 


in which the double series is a finite sum and y-(¢) is an exterior function. 

Since the case f(¢) +0 is excluded, A = 0 is not in the point spectrum 
of &. Hence T-* exists and is a self-adjoint operator, which may or may not 
be bounded. Equation (19) implies 


j=0 n=0 
It follows therefore from (12) and (14) that 


(21) == cX (Yat) — c(YX)2*. 


In fact, the legitimacy of the necessary rearrangement of summations is trivial, 
because of the triangular forms of X and Y) and the fact that the vector 2* 
has only a finite number of non-vanishing components. Since &-? is self- 
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adjoint (in particular, the scalar product relation (Z-*x,*, = (a@,*, 
holds if z,* and z.* are vectors with only a finite number of non-vanishing 
components), it follows that the matrix cX¥) is Hermitian. Since the first 
row of is 1. B,,- - - and its first column is 1,a,,°- -, 


(22) c is real and By = Gy, n=0,1,-°- 


Since is of class (12) and (18) show that yo-(¢) )7 is 
of class Z? and, in fact, that (1+ yo(¢))7? is the complex conjugate of 
ao*(¢)/c. Hence (8) implies that f(¢) can vanish only on a zero set and 
that 


(23) 1/f(¢) =| |?/e. 


Thus, according as c > 0 orc < 0, f(¢) > 0 or f(¢) < 0 almost everywhere. 
Finally, 1/f(¢) is of class L*, since 2o*(¢) is of class L*. This proves (iii). 


6. Proof of (iv). Let f(¢) > 0 almost everywhere and let — log f(¢) 
be of class Z*. In particular, — log f(¢) has a Fourier expansion, 


where 
(25) Jn = J-n- 


For |z| <1, put 
(26) G(z) = = f (— flog ) + 2) —z) 


so that G(z) = 49. + S and put 
n=1 


(27) H+ (z) = exp(490 G(2)). 
According to {8], pp. 176-177, the fact that 1/f(#) is of class Z* implies that 


lim f | H+ (ret?) |? dp 
r>1 > 
Consequently, 


(28) Xo*(p) = exp 9*($) 


is the interior function H*(e‘*) and is of class L*. Since f(¢) is of class L', 


-T 
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it is seen in the same way that the function y.(#) defined by 1+ yo"(¢) 
= exp(—g°(¢)) is an exterior function of class On writing f(¢) as 
exp(— g*(¢) —g°(¢) ). it follows that (8) holds. This proves (iv). 


Remark 1. The construction used in this proof goes back to Plemelj [5], 
p. 210. This becomes clear if it is noted that the regular function H*(z), 
satisfying = H*(e®), is given by 


(29*) H*(z) =exp (— log ) — z) "de®, at, 


and that 1+ yo (6) = H-(e!’), where H-{z) is the regular function 


(29-) H-(z) = exp (— log f(d) ) — 


Remark 2. Let f(¢) be bounded and positive almost everywhere and, 
in addition, let f(¢) satisfy (10). Then (8) has a solution x)*(¢@) of class 
[? and 1/f(¢) possesses the factorization (23), where 


c=exp(— 4 log f(¢)d¢) ; 


2r 


ef. (12) and (28). Correspondingly, the factorization (21) for ZT can be 
written as 


(30) cY*9), 


where 9) = X* is the complex conjugate transpose of the matrix X defined 
by (12) and (16). 


The Toeplitz matrix 9) is bounded if and only if the function H*(z), 
given by (26) and (27), is bounded for | z | < 1 (ef., e.g., Appendix below; 
for references, see [12], p. 278). This is the case if and only if the real 
part of G(z) is bounded from above for | z! <1. Since the real part of @(z) 
is the Poisson integral of — Slog f(¢), it follows that 9) is bounded if and 
only if —4Jlogf(¢) const. almost everywhere, that is, if and only if 
f(¢) = Const. > 0 almost everywhere. This gives another proof of the fact 
that if A is not on the interval [m, 17]. then A is not in the spectrum of ©. 
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Part III. A counter-example. 


7. A pathological case. An example proving the following assertions 
will be given: 

Let f(¢) be real-valued and of class L? and let & be the corresponding 
(Hermitian) Toeplitz matriz. 

(ii*) It is possible that equation (7%) has a solution x)*() of class L*, 
that is, that X = 0 ts in the point spectrum of &. 

(iii*) It is possible that equation (8) has a solution xo*(@) of class L?, 
even though (9) does not hold. 


It will be seen in the proof of (iii*) that if f(¢) is real-valued, bounded 
and measurable but does not have the property (9), then (8) can have a 
solution 2%*() of class L” for every p < 2 (ef. (33)-(34) below), although, 
according to (iii), (8) cannot have a solution x*(¢) of class L?. 


8. Proof of (iii*). Let F(z) denote the regular function 


(31) F(z) = {(i+2)/(i—z)}4 for |z| <1 (F(0) =1). 
Then 
(32) lim | F (ret?) <oo for every p < 2. 


Furthermore, F(e’%) is (defined and) continuous except at @¢=—42 and 
vanishes only at ¢ =— 4x. On the other hand, the real part of (F(et%))? 
is identically 0, so that u2(¢) ==v?(¢) if P(e?) =u + iv, where u(¢), v(¢) 
are real-valued. Put 

(33) f:(¢) = = 1 according as | ¢| < 4a or < 

Then f,(¢) = sgn v(¢)/u(¢) and f,(¢)(v — iv) = u — iv except at = + Fr. 
Hence, if z,(¢) denotes the interior function — iF (e‘*) = v — iu of class .?, 
P<e, 

(34) =1 + 

where 1 + y,-(¢) =w—iv and y,-(¢) is an exterior function, since wu — iv 
is the complex conjugate of F(e’%) and F(0) =1. 


— 
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(35) fe(p) = 1/|(t— (i + ef) 

Then f2(¢) is of class L* and 1/f.(@) is bounded. Hence, assertion (iv) and 
its proof show that there is a regular function, say H,*(z), on | z| <1 which 
is bounded on |z| <1 and has a boundary function 2.*(¢) = H,*(e‘?) 
satisfying 


(36) fo($)r2*(¢) =1+ yo (¢), 


where y.-(¢) is an exterior function, and 
(37) | |? = const./f2(¢) = const. | (i — (e + ef?) |e, 


Put f(¢) = f:(¢)f2(¢), so that, by (33) and (35), f(¢) is of class L?. 
Let 2o*(¢) = 22*(¢) = — iF (e**) H,*(e*%). Then (35) and (36) show 
that (8) holds with yo (¢) = (¢) + (¢) + (¢)y2"(¢). Since 22*(¢) 
is bounded, the function 2)*(¢) is of class L® for every p < 2; in fact, 


lim F H,*(ret®) |? dp <a if p<2. 
role 


On the other hand, the boundary function 2)*(¢) is of class L?, by (31) and 
(37), and so 


lim | F(re*%) (ret?) |? dp 
r->1 


Hence 2,*(¢) is an interior function of class Z?. It is seen in a similar 
manner, that y.-(¢) is an exterior function of class Z'. In view of (33), 
this proves (iii*). 


9. Proof of (ii*). In order to prove (ii*), it is sufficient to show that 
for the function f(¢) = f1(¢)f2(¢), just constructed, the equation (8) has 
at least two solutions 2)*(¢) of class L*. For, if this is the case, the difference 
of these solutions of (8) is a solution of (7). 

In order to construct another solution of (8), note that since f1(¢) 
= sgn(v/u) and u? = v*, (34) holds if z,+(¢) = (v + iw) /(u? + v?) denotes 
the interior function i/F(e**) =i/(u+ iv) and 1+ y,-(¢) the function 
(wu + iv)/(u? + v?), which is the complex conjugate of 1/F(e‘*). Thus 
if x.*(p) is the function satisfying (36) and (37), it follows that 2*(¢) 
= = (e!%) /F(e*%) is a solution of (8), where 1 + yo (¢) 
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= (1+ 4:-(¢))(1+ y.°(¢)). The arguments used after (37) show that 
is an interior function of class Z? and that yo is an exterior 
function of class L7. 

Consequently, (8) has both functions 2)*(¢) = — iF (e‘%) 
= /F(e*) of class as solutions. Thus (ii*) is proved ; 
in fact, the difference x)*(¢) + 1/F(e**)) 40 is a 
solution of (7) and of class L?. 


Remark. It is of interest to note that the functions x*(¢), 1 + yo (¢) 
satisfying (8) in the above proof are the boundary functions H*(e'”), H-(e*%) 
of the functions (29°), if log f(#) is suitably interpreted. Thus, in the 
formula log f(¢) = log f,(¢) + log fe(¢), let log fs(¢) be real and let 
log f:(¢) be 0 or ix according as f:(¢) =1 or fi1(¢) =—1. The resulting 
function (29*) has the boundary values H+(e*?) —— iF (e‘*) H.*(e**), that 
is, the first solution of (8) constructed above. On the other hand, if log f:(¢) 
is or t according as f,(¢) —1 or fi1(¢) =—1, the resulting function 
(29*) has the boundary values H*(e'*) —iH,*(e'%)/F(e%), that is, the 
second solution of (8) constructed above. 


On the other hand, not all solutions 2*(¢) of class L? of (8) result in 
this manner. For example, if 2)*(¢) #0 is a solution of (7), then for 2 
suitable integer N > 0 and a suitable const., the function const. e‘V¢r5*(¢) 
is a solution of (8); cf. the proof of (ii). But const. e*Y%r,*(¢), where 
|¢ | <-, are the boundary values of a function which is regular for | z| <1 
and which vanishes in the N-th order at z 0, while a function (29*) does 
not vanish for | z| <1 (if log f(¢) is of class L*). 


Part IV. Unbounded T-matrices. 


10. The Toeplitz operator 2. With a given real-valued f(¢) of class 
[? and its Toeplitz matrix ©, associate the following operators on the Hilbert 
space of interior functions x*(¢) of class L*: 

Let D(X) denote the set of those 2*(¢) of class L* having the property 
that y*() in (4) is of class L? and let Y denote the operator, with domain 
D(X), defined by Tz*(d) = y*(¢). (This is equivalent to saying that D(Z) 
is the set of those vectors x* of finite length with the property that y* = Za* 
is of finite length and that & is the operator, with domain D(X), given by 
Yat y*.) 

Let D(&,) denote the set of those z*(¢) which have only a finite number 
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of Fourier constants distinct from 0; equivalently, D(X) is the set of vectors 
x* having only a finite number of components different from 0. Let Z, denote 
the operator which is the contraction of T and has P(X) as its domain. Then 
Lp is a symmetric operator and its adjoint is &, 


(38) 


Let denote the adjoint of and D(Z*) the domain of Then T* 
is the least closed operator containing Tp, and Z* is a closed, symmetric 
operator. Finally, 


(39) (40) — 


While the Laurent operator & associated with a real-valued f(¢) of 
class L? is always self-adjoint, the Toeplitz operator T need not be self-adjoint. 


11. Self-adjoint =. A careful perusal of the proofs of (i), (ii) and 
(iii) leads to the following statement: 


(1) If the assumption that f(¢) ts bounded is replaced by the assump- 
tion that f(¢) has the properties that f() is of class L? and that & is self- 
adjoint, then the assertions of (i), (ii) and (ili) remain valid. 


There exist unbounded functions f(¢) which satisfy the assumptions of 
(I). This is clear from the following assertion: 


(Il) Jf f(¢) ts real-valued, of class L* and half-bounded, then & 1s 
self-adjoint. | 


In order to verify this statement, it can be supposed, without loss of 
generality, that f(¢) 2const. >0. Then (iv), the proof of (iii) and the 
Remark 2 in Section 6 show that Z-* exists, is a bounded operator and has 
a factorization (30), where c > 0 and Y) is a bounded (triangular) Toeplitz 
matrix. Hence &* is self-adjoint. Thus (II) follows. 


Remark. The consideration of Toeplitz’s matrices was initiated by 
Toeplitz [11] in dealing with the problem of characterizing the Fourier 
constants of a non-negative function f(¢). It is curious that the Toeplitz 
operator & associated with such an f(¢) of class L? is always self-adjoint, 
by (II). 


12. Non-self-adjoint $. The function f(¢) = f1(¢)fo(¢) constructed 
in Section 9 shows that a Toeplitz operator & need not be self-adjoint. In 
contrast to (ii), the following is the situation in such a case (since & is the 
adjoint of a symmetric operator) : 
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Whenever & is not self-adjoint, every (non-real) number A in at least 
one of the half-planes ImA> 0 or Imd < 0 is an eigenvalue of &. 

Actually, in the particular case of the function f(¢) =f1(¢)f2(¢) of 
Section 8, every non-real number A(ImA > 0 and ImA < 0) is an eigenvalue 
of t. For, by (33) and (35), both f,(¢) and f.(¢) are even functions of ¢, 
and so f(#) is an even function of ¢. Consequently, T is a real matrix, so 
that = 0 implies —AY)#* 0. Since A= 0 is in the point 
spectrum of &, the Corollary on p. 230, of [1] leads to the following: 


(1*) There exists a real-valued function f(¢) of class L* such that the 
corresponding Toeplitz operator Z is not self-adjoint but possesses a self- 
adjoint contraction having a non-empty point spectrum. 


In contrast to (ii), assertion (i) can be formulated so as to be valid for 
all Hermitian Toeplitz matrices T, whether or not & is self-adjoint. 


(III) Let f(¢) be a real-valued function of class L? and let m (= —o) 
and M (So) be its (essential) lower and upper bounds. If m< p< M, 
then & —pS does not possess a bounded inverse, that is, X=0 ts in the 
spectrum of the self-adjoint operator (X* — pS) zB). 


It is clear from (1) that if either m or M is finite, then p = m or p= M 
is allowed in this theorem. 

The equivalence of the two forms of the assertion of (III) is a 
classical result of Toeplitz. Whether or not }=0 is in the spectrum of 
— (T* — will remain undecided. 


13. Proof of (III). In view of (1), it is sufficient to prove (IIT) 
in the case that T is not self-adjoint (so that m==-—o and W=—x) and 
that p = 0. 

Suppose, if possible, that ZT is not self-adjoint but that A= 0 is not in 
the spectrum of T*T. In particular, X = 0 is not in the point spectrum of &. 

For any closed operator T possessing an adjoint T*, the spectra of T*T 
and IZT* are identical except for the possible occurrence of A =O in the 
point spectrum of one of the operators I*T, TI* but not in the point 
spectrum of the other; [1], p. 234. Thus, for the case at hand, where 
=D T* and A=0 is not in the point spectrum of T*Z, the spectra of T*T 
and TI* are identical. Hence A= 0 is not in the spectrum of TT*. 

Consequently, the closed, symmetric operator T* has an inverse (X*)-}, 
which is closed and bounded on its domain. The domain of (*)-1, that is, 
the range of &*, is dense in the Hilbert space (of interior functions 2*(¢) 
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or vectors 2+), for otherwise A = 0 is an eigenvalue of the adjoint ({*)* = &. 
Since (X*)-1 is closed and bounded on its domain, the latter is a closed set. 
In other words, the range of T* is the entire Hilbert space. Consequently, 
&* is self-adjoint. This leads to the contradiction that Z is self-adjoint. 

The fact that is a Toeplitz operator was not used in the proof of (III). 
Hence, the following theorem has also been proved: 


Let & be the adjoint of a closed, symmetric operator &*. If T ts not 
self-adjoint, then, for every real p, the spectrum of (X* — yp) (T—pzY) 
contains = 0. 


This theorem is known if &* possesses self-adjoint extensions (Theorem 
(iv) in [1], p. 236) and was first proved for certain differential operators 
in [2]. 


Appendix. 


In Section 1, a matrix T was called a Toeplitz matrix (or a U-matrix) 
if it has the form © = (frm), where n,m —0,1,2,---. Originally, Toeplitz 
had only considered the Hermitian case (5), fn—fn. Later on, he also 
considered the case where f,—0 if n <0, so that the matrix is of the 
triangular form 


fo fi fe 


(a1) 


A matrix of this form Toeplitz called a -matrix. 
In what follows, it is assumed that (1) holds but it is not assumed that 
the corresponding T-matrix is Hermitian or of the form (41). 


(*) A Z-matrix T —T(f) belonging to a function f(¢) of class L? is 
bounded if and only if f(¢) is bounded. 


Actually, (*) is a consequence of Toeplitz’s assertion on the Hermitian 
cases. In order to see this, note that: a matrix is bounded if and only if its 
complex conjugate transpose T* is. Put 


(42) T—F,+iF., where T,—— fi(T— 2%). 


Hence, it is seen that © is bounded if and only if ©, and &, are. On the 
other hand, if — X(f), then — T(Re f) and TY, — T(Im f) are T-matrices 
belonging to real functions and, hence, are Hermitian. Thus, according to 
Toeplitz, {, and ZT, are bounded if and only if Ref(¢) and Imf(¢) are 
bounded, respectively. Thus (*) follows. 


| 
ly 
t 
0 
ne 


THE SPECTRA OF TOEPLITZ’S MATRICES. 881 


It was conjectured in [3], p. 361, that the spectrum $(Z) of a bounded 
Hermitian matrix © (belonging to a real-valued f(¢)) is associated with the 
distribution function of the (real-valued) harmonic function 


(48) = 2 £(6) (1— 4) + 


(r<1), 
that is, 


(44) P(r, $) = & farinletna, (r <1). 


This conjecture has now been justified in a certain sense. For, according 
to (i), S(X) is the closure [m, M] of the range of values of F(r,) on the 
unit circle r< 1; cf. (1) and (III) for the case of an unbounded f(¢). 

From this point of view, (i) is the analogue of a result in [12], p. 279, 
stating that the spectrum S($8) of (41) is the closure of the set of values of 
the corresponding function (44) on r< 1. Im the case of the $-matrix (41), 
where f, = 0 if n < 0, (44) is the power series 


(45) P(r, $) = F(z) =3 fre", (2 — ret). 


According to [12], pp. 279-280, if the complex number yw is not in the closure 
of the set of values of (45) for |z| <1, then the $-matrix belonging to the 
sequence of coefficients h* = (ho, hi,- - -) of the power series 


(F(z) —p) = hint", Le} <1, 


is bounded and is the inverse of §$— pj. On the other hand, if F(z) =p 
for some z, | z| < 1, then \ is an eigenvalue of and 2* = (1, z, - - -) 
is a corresponding eigenvector. (This last fact contrasts with (ii), according 
to which a bounded Hermitian Z matrix has no eigenvalue, unless f = const.) 

The problem of determining the spectrum S(Z) of a T-matrix which is 
neither Hermitian nor triangular will remain open. 
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SOLUTION OF SOME PROBLEMS OF DIVISION.* ' 


PART I. Division by a Polynomial of Derivation. 


By Leon EHRENPREIS. 


1. Introduction. The purpose of this paper is to prove that every poly- 
nomial of derivation with constant coefficients P is completely inversible as 
regards distributions of finite order, i.e. for any distribution S of finite order 
there is a distribution 7 of finite order such that 


(1.1) PT = 


This means that division, in the sense of convolution multiplication, by a 
polynomial of derivation is always possible. The particular choice S =6 
(Dirac’s measure) shows that every constant coefficient partial differenial 
equation possesses an elementary solution. We show also that if S is an 
indefinitely differentiable function, T can be chosen to be an indefinitely 
differentiable function in (1.1). 

The above results are extended to systems of cme differential equations 
with constant coefficients. 

The method of proof is the following: We study the space of Fourier 
transforms of the space J) of Schwartz. Call this set D; we give it the 
topology to make the Fourier transform a topological isomorphism. D con- 
sists of all entire functions of exponential type which are rapidly decreasing 
by a theorem of Paley and Weiner (see [4] and [5]). The Fourier transform 
then defines, in a natural manner, a topological isomorphism of JY (the space 
of distributions) onto D’ (the dual of D). In this way, we define the Fourier 
transform of any distribution as an element of D’. The search for an 
elementary solution is thus transformed into the problem of division by a. 
polynomial. The heuristic argument is as follows: We know that we can 
divide by a polynomial where the polynomial is large. Also, a polynomial 
cannot have “too many zeros,” hence cannot be small too often. Now, the 
space D consists of entire functions; we make our division where the poly- 
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nomial is large and then use the maximum modulus theorem to show that 
things don’t get too bad where the polynomial is small. 

We are now preparing several papers in connection with the material 
presented here. In the first we shall study in detail the Fourier transform on 
the space D’, and the structure of the elements of D’. In another paper we 
shall study the structure of the elementary solutions in regard to regularity and 
growth at infinity. For example, we shall show that, for P any polynomial of 
derivation with constant coefficients, and a@ any positive number, we can find an 
elementary solution of P which, in some sense, increases less rapidly than el! 
Moreover, if the highest derivative that occurs in P is of order k, and if P 
is a partial differential operator of n variables, then there exists an elementary 
solution which is no worse than a derivative of order n+k+3 of a 
continuous function. In another paper, we shall solve other problems in 
division. For example, we shall show that, if S is a distribution whose Fourier 
transform is an entire function of finite order, then division by S in the sense 
of convolution is possible in a certain space of distributions. 

The methods of our paper, as well as most of the results of Sections I-V 
were obtained in April 1953 and were announced at that time in reports to 
the National Science Foundation. Since then, Malgrange [3], [3a] has 
obtained the corollary to Theorem 5 by a method which also uses the theory 
of analytic functions. Malgrange has shown how Theorem 5 implies the 
existence of an elementary solution and even the division property in 9,’ 
(space of distributions of finite order). We include in Section IV a brief 
account of this part of Malgrange’s work (Theorem 6). 

Malgrange has also obtained our Theorem 10 by a different method. 
However, he does not obtain Theorem 9 which is apparently more powerful. 

The author wishes to express his thanks to the referee for acquainting 
him with the work of Malgrange, as well as for his other useful comments. 


2. Preliminaries. Let n be a positive integer which will be fixed 
throughout this paper. By C we denote the complex Euclidean n-space, and 


by R we denote the real part of C. For ze (*.): we shall often write 


complex 
eal 
let p; be a point in C, pi= *, Pi,) Where each pj, +1 (For 
convenience we denote by p: = (1,1,:--,1).) Then by Cé we denote the 
subspace of C consisting of those points z= (21, 22, * -,2n) such that, for 
all j, &(z;) =0, or sgn X(z;) sgn p; where, for a a complex number, 
02 (a) denotes the real part of a, and X(a) denotes the imaginary part of a, 
and for b a real number 0, sgn bd is the signum of b. 


== (21, Z2,° **, %n) where the 2; are( ) numbers. -For = 1, 2,---, 2", 


f 


~ ~ ~_ Ay 


at 


SOLUTION OF SOME PROBLEMS OF DIVISION. 885 


For & a positive number, we shall denote by C;, the strip of width & 
around R in C, i.e. the set of all z= (a, %,° with | &(2;)| Sk 
for all j. 


Let F be a function on ‘¢ 


(1,2,--+,), and let a;, = be ( 


let r<n; let jx ja jr be a subset of 


complex 
eal 


complex 
we denote the function on P Eu- 


clidean n—r space whose value at any point (2;,, %,,%%,.,), Where ((t1, t2, 
-+,%-r) are those indices which do not appear in (ji,°--°,jr)), 18 
F(w1, * *,Wn) Where w;, = aj, if k = jp, or we If = ty. 

We shall review, rapidly, those properties of the theory of distributions 
of Schwartz [5] which we shall need in our paper. For convenience, we shall 
depart slightly from the notation of Schwartz. 

For 1 a positive number, denote by R; the closed cube in R, center at the 
origin, and side 2/. By &, we denote the space of indefinitely differentiable 
functions on R whose carriers are contained in R;. (The carrier of a function 
is the closure of the set of points where it is different from zero.) A sequence 
of functions f; converges to zero in J); if for P any partial differential operator 
on FR with constant coefficients, Pf; —>0 uniformly on R. It is known that 
G, is a complete, metrizable topological vector space» We shall, unless 
“vector space 


) numbers. Then by 


“vector space ” to mean 


explicitly stated to the contrary, use 
over the complex numbers.” 

By choosing a sequence of numbers 1; with 1;->00, the spaces Y);, form 
a sequence of definition for a space 9) of type (YF) in the sense of Dieu- 
donné and Schwartz [2]. Q is a complete topological Hausdorff space 
which is not metrizable. A function f on R is in Y) if and only if f lies in 
some Q),, i.e. if and only if f is an indefinitely differentiable function with 
compact carrier. A fundamental system of neighborhoods of zero in ) 
consists of those convex sets N for which NV QM Q) is a neighborhood of zero 
in 9); for all 1. 

By GY we denote the dual of Y), that is, the space of continuous linear 
functions on G) with the topology of uniform convergence on the bounded 
sets of G). Unless explicitly stated otherswise, “linear” will be taken to 
mean “linear over the complex numbers.” The important point to note is: 
Let § be a linear function on D whose restriction to G), is continuous on 
9), for each 1; then Se GY. Thus, to verify that a linear function on Q) is 
continuous, we need show only that, for any J, if {f;} is a sequence of func- 
tions in 9), which converge to zero in G), then S(f;) > 0. 
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If P is a partial differential operator with constant coefficients, (we 
sometimes say “ P is a polynomial of derivation ” if this is the case) then P 
clearly induces a continuous linear map of 4) into ) defined by f— Pf 
for any fe¥). We shall denote by P” the adjoint of P, that is, for any 
Sey, feQD, P’S(f) =S (Pf). It is easily verified that P’ is a continuous 
linear map of GY > Jy. 

By “dz” we shall denote the ordinary Lebesgue measure on F divided 
by (27)"?. Then for any fe YQ), we define the Fourier transform F of f as 


the function on C, F(z) -f where for zeC and 


== (When we omit the region of integration, 
it is to be assumed that this region is R.) F is an entire function of 
exponential type, and if fe YQ), F is of exponential type =/. Moreover, F is 
rapidly decreasing on FR, i.e. for Q any polynomial on C QF is bounded 
on #. On the other hand, if F is an entire function of exponential type <1 
which is rapidly decreasing, then the function f on FR defined by 


f(a) = J, F(y)et-vdy, 


is in G). Moreover, Fourier’s inversion formula holds: for feQ), 


if F(z) then f(z) F(y)e** 


We shall make the following convention of notation: Lower case letters 
will be used to denote functions of Y) and the corresponding upper case 
letters will be used to denote their Fourier transforms. 

Let P be a partial differential operator with constant coefficients on 4). 
Then for fe), there is a polynomial Q on C such that 


In this case, we write Q = F(P). Conversely, given any polynomial Q on C, 
we can find a polynomial of derivation P with Q =F (P). 


3. The space D. For any 1 > 0, we denote by D, the set of Fourier 
transforms of Y). By what we said above, D, consists of all entire functions 
of exponential type <1 which are capidly decreasing on R. We shall define 
in D; a topology to make the Fourier transform ¥ a topological isomorphism 
of g), onto D;. The following extension to n dimensions of a result of 
Phragmén and Lindeléf (see [6]) is needed. 
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Lemma 1. Let F be an entire function of exponential type S1 which 
satisfies | F(x)| SM for all ee R. Then for all zeC/, 


| F(z)exp(ilp;-z)| SM. 


Proof. We shall adjust our notation to the case 7 —1, the other cases 
being handled similarly. We use induction on n. For n=—1 the result is 
the known theorem of Phragmén and Lindeléf. Assume n > 1 and that the 
result is valid for n—1. Let a,eR; consider F’x,-, It is clear from the 
definitions that 

sup | Fx,<0,(z)| SM, 


zeRn-» 
where Rin1) denotes the real Euclidean n—1 space. Thus, for C*(n4), the 
analogue of C* for complex Euclidean n—41 space, by our induction 


assumption, 
sup |[exp il(2e-+ 24) ]Fxi-a(2)| <M. 
zeC)(n-» 
Now, let (22,° - -,2n) be any point of C*(»1), and consider the function 


of one complex variable z—> [exp il(Z2 ++ + Zn) | By 
the above, 


sup | [exp Ul | (4) | = M. 


Thus, by the result for n —1, 

|Lexp (21 + + Xn=en(#) | SM, 
or, what is the same thing, 

sup |[exp + 22+ ++ (a1, 22° +, 2n)| SM. 


Since this holds for all (z2,- - +, 2n) € C*(n-1), we have 
sup |[exp + 2+: +--+ 2,)]F(z)| SM, 
zeC! 


which is the desired result. 
From this lemma it follows that, for any a > 0, any polynomial P on C, 
and any function fe D, PF is bounded in C4. 


THEOREM 1. Let {fi} be a sequence of functions of Gy. A necessary 
and sufficient condition that f;—> 0 in Q) is, for any k > 0 and any polynomial 
Q on OC, QF; —>0 uniformly on C,. It is sufficient to have, for some k > 0 
and every polynomial P on C, PF;—>0 uniformly on C;. 


Proof. a. Sufficiency. Assume that, for some k > 0 and any polynomial 
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Q on C, QF;—> 0 uniformly on C;,. It follows immediately from Cauchy’s 
formula for the derivatives of an analytic function of several complex variables, 
that, for D any operator of partial differentiation with constant coefficients, 
and Q any polynomial on C, D(QF;) ~ 0 uniformly on R. Thus, F;->0 in 
the space 3 of Schwartz. It follows that f; > 0 in d (see [5]). Since the f; 
all have their carriers in R,, this means that f, 0 in Y). 


b. Necessity. Assume f;—>0 in Y,; let Q be a polynomial on R, and 
k>0. We know (see [5]) that Fj; 0 in d, and that the F; are all of 
exponential type <1. Thus, by the definition of the topology of 3, QF; 0 
uniformly on R; by Lemma 1, for any § with 1S 8 S 2", 


| Q(z) F;(z) exp ilps: z|—>0, 
uniformly for z eC, a fortiori for ze CSN Cy. Now, 
min | expilpgs-z| 
zeC3snc, 


Thus, QF;—0 uniformly on C5 for all 8S; this means that 0 
uniformly on C;,. 


For any positive number & and any polynomial Q in n variables, we 
define the function v;,g on D; by 


sup | Q(z) F(z)|; 


for any Fe D,; it is clear that the v;,9 are semi-norms. We use these v;,9 to 
define a topology on D,; we denote the space obtained again by D,, so that 
D, is a locally convex topological vector space. If Fe D,, F ~0, then there 
isa ze with F(z.) #0. Thus, 

= sup | F(z)| 2 F(z) >9, 
so that D, is Hausdorff. If Q is any polynomial, and k a positive number, 
there are integers A > 0, m > 0 so that, for all ze Cy, 


The totality of polynomials of the form A + [ > 2;7]™ is certainly a denumer- 
4=1 


able set J. It is clear that the topology of D; can be defined by means of 
semi-norms v;,3 for & rational and SeT. We need thus only a denumerable 
number of semi-norms to define the topology of D,, which meahs that this 
space is metrizable. 

Theorem 1 can now be restated as 
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THEOREM 2. The Fourier transform F is a topological isomorphism 
of GD, onto D, for each 1 > 0. 


Since each Y); is complete, the same is true of D;,, and these spaces are 
of type (F) (see [2]). It is clear that, if i > j, the topology induced by D; 
on the set of functions of D; is just the topology of the space D;. We may 
thus use the spaces D; for i integral as a sequence of definition for a space D, 
the inductive limit of the spaces D; in the sense of Dieudonné and Schwartz 


[2]. Set theoretically, D—= |) D,, i.e. a function F on C is in D if and 
é=1 


only if F is an entire function of exponential type which has the property 
that QF is a bounded function on R for every polynomial Q in n indeter- 
minates. The topology of D is defined as follows: A fundamental system of 
neighborhoods of zero in D consists of all convex sets N such that, for each 
1>0, NAD, is a neighborhood of zero in D, D is locally convex and 
complete, but not metrizable. From our Theorem 2 and Proposition 5 of [2] 
we obtain 


THEOREM 3. The Fourier transform § is a topological isomorphism of 
Q) onto D. 


By D’ we denote the dual of D with the topology of uniform convergence 
on the bounded sets of D. The important thing to know about D’ is: (see [2] 
Proposition 5) Let S be a linear function on D which is continuous on each 
D,; then S is continuous on D, i.e. Se D’. Since each D, is metrizable, in 
order to verify that S is continuous on D,, we need consider only sequences. 
For sequences, Lemma 1 gives us an explicit description of convergence. It is 
by means of this explicit description that we are able to show the existence 
of an elementary solution. 

Let us denote by F the adjoint of ¥-1, so that F is the map of G’ > D’ 
defined by (FS) (F) = S(f), for any Se It is natural to call F 
the Fourier transform on JY. (This will be discussed in a forthcoming 
article.) Thus, the Fourier transform of any distribution, in particular, of 
any locally summable function, is defined. From Theorem 3 we obtain easily 


THEOREM 4. F is a topological isomorphism of GY onto D’. 


We want to compute F(8) where § is the Dirac measure, that is, the 
measure consisting of a mass of + 1 placed at the origin. We have 


(FS) (F) =8(f) =f(0) = fF(x)dz, 
so that #8 — 1 where 1 is the element of D’, 1(F) = fF («)da, for all Fe D. 


| 
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4. Existence of an elementary solution. 


Lemma 2. Lei there be given m points 2, %2,° + *,%m im the complex 
plane. Then about any complex number a we can draw a circle L of radius 
= 1 so that, for any xeL, min | 2 1/2(m +1). 


Proof. We may certainly restrict ourselves to the case where a is the 
origin. Let - -,2; lie in or on the unit circle and %m lie out- 


side the unit circle. Then we arrange the = r+ 2 points 0, | 2, |,- - -, | a,], 


1 on the unit interval 7. We have thus decomposed J into at most r+ 1 
subintervals; it follows that one of these subintervals is of length at least 
1/r+1. Let b be the midpoint of this interval, and let LZ be the circle, 
center origin, and radius 6. Then for any re L, 1—1,2,---,n. 
a |)| 21/2(r+-1) 21/2(m+1). 


Lemma 3. Let Q be a polynomial in one indeterminate, 


Q(x) = QoX™ + +° + On. 


Then, there is a constant a0 (depending only on m) so that, about any 
complex number a we can draw a circle M of radius = 1 for which 


| Q(x)| Qo | 
forallxeM. (It is allowed that Qo =0.) 
Proof. For Qo = 0, the result is trivial. Let Q) 0 and write 
Q(X) = — a1) (X — az) (X — am), 


where 2,,:--,2» are the zeros of Q. Then, by Lemma 2, we can draw 
about aa circle M of radius <1 so that | e—a;|21/2(m-+1), for all 
weM,1Sism. Thus, for M, | Q(r)| =| Qo) 1/2"(m+1)". The 
result follows by choosing 2 = 1/2"(m-+ 


The main lemma used for the proof of the existence of an elementary 
solution is the following: © 


Lemma 4. Let Q bea non-constant polynomial in the letters X,,---+.Xn. 
Letr>1 and FeD satisfy 


| Q(z) F(z)| Sa. 


Suppose X» occurs in Q, and write Q=Q.Xp™+---+Qm, where the 
Q; are polynomials in Then we can find 
a B>O0 such that 
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sup | Qo(z)F(z)| Saf. 


zeCr+ 

Proof. We may suppose without loss of generality that p—1. Let 
Zi, be complex numbers with | &(z,)| Sr—1,---,| 
<r—1. By Lemma 3, we can find a number @ > 0, and a circle M with 
centers z, and radius = 1 so that, for all ze M, 


Thus, for ze M, 
| Q(z, +, 2n) F(z, %n)| <a, 
because (2, *,2n) On choosing = we find 


for all ze M. 


We have two cases: If Qo(22,° Zn) 0, we certainly have 
| Qo(225° Xn=en(%1) | S 
If Qo(22,* %n) then for all ze M 
| Xn=en(2) | SS @B/| Qo(2e,° |. 


Since F'x,-2.,---,xX,-:, is analytic, we get, by the maximum modulus theorem, 
| Fx, =20, +++, (21) | = aB/Qo (225 Zn). 
Thus, in any case, | Qo(22,° SaB. Since +, 2n) 


was an arbitrary point of C,., the lemma is established. We can now state 
our main result: 


THeorEM 5. Let Q be a polynomial different from zero in n tmdeter- 
minates, and let {F';} be a sequence of functions in D. Suppose that QF,— 0 
in D; then F;—>0 in D. 


Proof. We use induction on the number & of variables which occurs 
in Q. For k=0, Q is a constant ~0 and the result is trivial. Assume 
then that k > 0 and that the result is proven for all polynomials in k —1 
indeterminates. We may assume without loss of generality that X,,- - +, Xx 
actually occur in Q and write 


Q(X, » Xx) Qo(X2, + + Qm(Xo, ° 
where ~ 0. 
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Under our hypotheses, we know (see [2]) that the functions QF; all 
lie in some Q), and that QF;—0 in Y);. Since the exponential type of QF, 
is the same as that of Fj, the F; also lie in D,. Theorem 1 tells us that, for 
any r > 0 and any polynomial P on C, PQF;— 0 uniformly on C,. Lemma 4 
shows us that the same is true for the sequence {Q,F}, that is, for any r > 0 
and any polynomial P in n indeterminates, Q,.PF;— 0 uniformly on C,. This 
means that Q,f;—>0 in D,. By our induction hypothesis, F; > 0 in D, which 
implies F;—> 0 in D. 


Corotuary. Let P be an operator of partial differentiation with constant 
coefficients on G). Suppose {fi} is a sequence of functions of Q) such that 
Pf, >0 in Then also fi >0 ing. 


For the sake of completeness, we shall show, briefly, how Malgrange uses 
this corollary (or rather, a similar result) to obtain elementary solutions, and 
division in a certain space (see [3b]). 

For any 1 > 0, s=0, denote by &,* the space of functions on R which 
are s times continuously differentiable and which have their carriers in K;. The 
topology of 4),* is defined by means of the semi-norms vp(f) = 9 | (Df)()| 


for f e Dj? and D any partial differential operator with constant coefficients of 
order =s. By Y* we denote the inductive limit of the Q,* for 7 integral. 
Gy* denotes the dual of 9)*; this is the space of distributions of order s. We 
write Fr’ = |) G* so 9’ is the space of distributions of finite order. 


Suppose P is a partial differential operator with constant coefficients of 
order k. Then by P),*** we denote the subset of );* consisting of all functions 
of the form Pf for fe Q,**; the topology of PG),*** is that induced by 4’. 


THEOREM 6. For any SeQr’, there is a Te with PT =S8. In 
particular, P has an elementary solution in Q)y’. 


Proof. Suppose that S§ is a distribution of order s. By reasoning 
analogous to that in the proofs of Theorem 5 and its corollary, we can show 
that Pf — f is a continuous linear map of +9. Thus, T: Pf > 
is a continuous linear function on P&)s****, Let K be the set of functions 
g © PGs*4** for which 


max 9(7) | SL 

for all p:,- - -, pn such that +--+ --+ Then, for each integer 
l, Ki= KN X)** is a compact set in PY),**** by the theorem of Ascoli (see 
[5]). For each integer /, let U; be the set of f ¢ K; which satisfy | T-f | <1, 


4 

H 
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and call v the convex closure of U U;. Then, by Proposition 1 of [3b], v is a 
neighborhood of zero in PG)****, Further, for fev, | ie i |<1. Thus, T is 
continuous on P9)s**** and hence can be extended to a continuous linear 
function 7 on 9)*** by the Hahn-Banach theorem. For any fe), P’T:f 
—T:-Pf=T-Pf=—S-f. Thus, PT=S; if we take S=5, T is the 
elementary solution for P. 


5. Division in the space Gr’. We shall give an independent proof of 
Theorem 6 by means of the Fourier transform, without using the notions 
of Malgrange. 

We introduce a topology in Dr = N B® by saying that a set N in Op is 
a neighborhood of zero if it is the intersection with Gr of a neighborhood of 
zero in some )°. (The elements of Gp are the same as those of ), but the 
topology of Y) is stronger than that of Gr.) It follows immediately from the 
Hahn-Banach theorem that Q)r’ is the dual of Gr. By Dr we denote the 
space of Fourier transforms of functions of Gyr with the topology to make 
F : Dr — Dy a topological isomorphism. Dry consists, of course, of all rapidly 
decreasing entire functions of exponential type. 


THEOREM 7%. The topology of Dr can be described as follows: A funda- 
mental system of neighborhoods of zero in Dy consists of all sets N C Dr 
for which we can find a finite sequence of polynomials Ar and 
an increasing sequence of positive numbers a,—>0 so that N consists of all 
Ge Dy which satisfy, for each integer 1, 

(5. 1) max | (exp tlp;- z) Ax (z) G(z)| S1, 


zeCinCa, 
for k==1,2,---,r, and 
Proof. Suppose first that N satisfies the conditions stated in the theorem. 


Then is clearly convex, and for any integer m > 0 and any Ge Dn, the 
conditions 


max |(exp ilp;: z)A;(z)G(z)| 


zeCInCa, 


for any k, j7, and 1 > m are already implied by the conditions 


max 


zeCInCa,, 


for k =1,2,---,r, and j=1,2,---,m because of the Phragmén-Lindelof 
theorem (Lemma 1). Thus, by the definition of the topology of Dn, NN Dn 
is a neighborhood of zero in Dm. It is clear from the proof of Theorem 1, 
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that the fact that we are using a fixed set of polynomials A,,- - -,A, for 
the definition of NV means that we can find an s so that, for all m, N’ N YQ), 
is a neighborhood of zero in Q),,%, where N’ is the set of inverse Fourier 
transforms of the functions of N. Thus, by the definition of the topology of 
Dy, N is a neighborhood of zero in Dr. 


Conversely, let MW be a neighborhood of zero in Dr, and denote by M’ 
the set of inverse Fourier transforms of functions of M. Then, we can find 


a finite set of polynomials of derivation B,, B2,- --,B; and a decreasing 
sequence of numbers },—>0 so that WM’ contains the set of all fe) which 
satisfy 
(5. 2) max |(B,f)(«)| <b, 
agKnipi 
for all 7 and k ~ 1,2,---,t, and also 
((5. 3) | (Bef) S1, 
for k = 1,2,---,¢. (This can be shown exactly as the analogous description 


of the topology of 4) (see [5]) is demonstrated.) 

Let us denote by the polynomials 4n, 4nF(B;), 
for k=1,2,---,¢ and j=—1,2,---,m. We define the 
sequence {a;} by a; max(1,—log b,;). Let N be the set of Ge Dr which 
satisfy, for each 1, 

(5. 4) max | (exp S1, 

zeCINnCa, 
for k =1,2,---,r and j=1,2,---,m; let N’ denote the set of inverse 
Fourier transforms of N. We claim that N’ C M’. 

It is clear that any ge N’ satisfies (5.3). Let ge N’ and 
Let j be chosen so that (see Section 2) sgn p;, = sgn for k = 1,2,---,n. 
It is clear from Caucy’s theorem that we may write g(x) = § G(z) exp (iz: z)dz, 


where a, is the plane &(z,) —ayp;, for k =1,2,---,n. Thus, 


= G(z)exp z)exp(i(e—Ip,) de. 


Now, since 2# K¢ns1)1, for at least one k, x, > (n+1)l. Thus, for ze%, 
(2 —1p;) = (nl)a,— (n—1)la, = la; This means that, for ze %, 
| exp i(a—Ip;)-z| Se". Moreover, it is clear from (5.4) that 


f | G(z) (exp ilp;:2)| dz=1. 


Thus, | g(z)| S 


of 


fo 
th 


| 
as 
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A similar argument shows that, for =1,2,---,¢, |(Bug)(x)| Sb. 
This means that ge M/’ which is the desired result. 


Let Q be a polynomial 0 in n letters. We denote by QDr the space 
of all QG for Ge Dy with the topology induced by Dr. 


THEOREM 8. QG—G is a continuous linear map of QDr > Dr. 


Proof. We use induction on the number & of letters that occur in Q. 
For k = 0 the result is obvious; assume the theorem is true for all polynomials 
with fewer than / letters, and let Q be a polynomial in & letters. We may 
assume that Y, occurs in Q and write Q=Q,4"°+9,X"'+:---+9,, 
where the Q; are polynomials in fewer than & letters. The result will follow 
if we can show that 0G — Q,@ is a continuous linear map of QDr > Q.Dr. 


By linearity, we need prove continuity only at zero. Let then N be a 
neighborhood of zero in Q,.Dr. By Theorem 7 we can find polynomials 
+, A, and an increasing sequence of positive numbers a; so that 
N contains all Ge @Q,Dr which satisfy, for each 1, 

max | (exp ilp;: z)Ax(z)G(z)| S1 
zeCinCa, 
fork =1,2,---,rand j—1,2,---+,n. Let B be a constant which satisfies 
the conclusions of Lemma 4. For each 1, write af —=a,+1 and call 
A,’ =1/(eB)Ax. Let N’ be the set of Ge QDr which satisfy, for each lI, k, j, 
max | (exp S1. 
zeCInCa, 
Then, as in the proof of Lemma 4 we can show that, for QGe N’, QGeN 
which concludes the proof of Theorem 8. 


For P a polynomial of derivation, denote by PQ) the space of Pf, fe Dr 
with the topology induced from Gr. Then we have an immediate 


Corottary. Pf—f is a continuous linear map of > Dr. 


We can now furnish another proof of Theorem 6: Let S e 9)r’; then the 
map 7’: Pf» S-f is a continuous linear function on PQ» being the com- 
position of the continuous maps Pf—f and f—-><S-f. Thus, by the Hahn- 
Banach theorem, 7’ can be extended to a continuous linear function on )p, 
ie. to a TeQr’. We have, for feDr, 
Thus, P’7’ = which is the desired result. 
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6. Division of the space €. € is the space of indefinitely differentiable 
functions. €’, its dual, is the space of distributions of compact carrier (see 
[5]). We shall use the following characterization of the space €’ which will 
be established in our forthcoming paper, “Analytic Functions and the Fourier 


Transform of Distributions ”: 


Lemma 7. €’ consists exactly of those distributions S e GY for which the 
convolution S*feQ) for any fe YG). Moreover, the topology of €’ is the 
same as that obtained by considering every Se €’ as defining the continuous 
linear transformation f >S*f of J—>Q), and then giving this set of trans- 
formations the compact-open topology. That is, a fundamental system of 
neighborhoods of zero in €’ consists of those sets N for which we can find a 
compact set K C 4) and a neighborhood N of zero in Y) so that N consists 
of all Se €’ for which S*feM for all fe kK. 


Let us denote by €;’ the space of all SeQ)r’ such that S*feQ)r for 
all fe Gr; we consider each SeQ)r’ as defining the continuous linear map 
f—>S*f of Dr and we give the space €,’ the compact-open topology 
of this space of transformations. By Lemma 7, the elements of €,’ are the 
same as those of €’. Moreover, if K is a compact set of Q», then K is clearly 
a compact set in each 9)*. Thus, by [2], X is contained in some Y;. Since 
Q)r and Y) induce the same topology on 4), K is compact in J). Thus, the 
topology of €’ is stronger than that of E,’. 

We conclude from the above that, if we denote by €r the dual of Ey’, 
then for any U ¢ €y we can find an f e € such that, for all Se €r’, U- S =8°f. 


Lemna 8. For any fe€,S—S8-f is a continuous linear function on Ef’. 


Proof. The linearity being obvious, we need prove continuity only at 
zero. For each lattice point ye R, let L., be the closed cube, center y, side 4. 
Let {hy} be a sequence of functions of G such that, for each y, carrier 
h, C Ly, and, for any ce R, Sh,(x) =1 (this sum has meaning because 
only a finite number of terms are different from zero). For each y, and 
ge€, denote by the function (7,9) (x) =g(a—y) for any re R. For 
each y, choose a, > 0 such that a, = a_,, and 

ay= (2), 
weR,15jSi 
where {Q;} is an enumeration of the polynomials of derivation on R with 
rational coefficients, and {y,} is an enumeration of the lattice points cf R, 
and where y = yi. 
Let K be the closure in Q) of the set of all functions 1/(ay)ry(hyf) 80 
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that K C Q), and K is clearly bounded in 9),. Thus (see [5]) K is compact 
in J). Let N be the neighborhood of zero in 4r,consisting of all ge Yr such 
that, for each y 

max | g(x)| S by/max(1, a,). 


Where {b,} is a sequence of positive numbers with }b,y—1. Call M the 
set of Se Er’ such that S * pe N for all pe K; we claim that, for any Se M, 

To this end, let S ¢ /, and suppose that the carrier of § is contained in K;. 


Let be chosen so that hy,(v) =1 for any ce Then S: f= hif. 
j=1 g=1 


But, 
| S-hif | = |(S * hyf) (0) | = | * ty, (if) ) (ys) | S 


Thus, |S-f|S > bm <1, which completes the proof of the lemma. 


Let P be a constant-coefficient partial differential operator. By P’E,’ 
we denote the space of all P’S for Se €,’ with the topology induced by Er’. 
It follows immediately from the theorem of Paley and Wiener (see [5]) 
that P’ is one-one. Our main result of this section is: 


THEorEM 9. P’S—S is a continuous linear map of Ep’. 


Proof. Since the map is clearly linear, it suffices to prove continuity at 
zero. Let P’G be a filter base in P’Ey’ which converges to zero in P’E,’ 
We denote by G the filter base obtained from P’G by replacing each distribution 
P’S of P’G by S; the filter bases P’G * f for f e G) are defined similarly. Then, 
if K is any compact set in Gf, P’G * f converge to zero in G)r uniformly for 
fe kK. Thus, G* Pf converge to zero in Gy uniformly for fe K. But, for 
any Se H’, Pf =P(S*f) (see [5]). Thus, P(G*f) converge to zero 
in 9) uniformly for fe K, which is the same as the uniform convergence of 
P(G*f) in PHp. Now, Pg—g is a continuous linear map of P|r > Dr 
by the corollary to Theorem 8. Hence G * f converge to zero in Gr uniformly 
for fe K. This means that @ converges to zero in Ep’ which is the desired 
result.? 


We can now use this result to show that division is always possible in 2: 


* Footnote added in proof: In our forthcoming paper, “Analytic Functions and the 
Fourier Transform of Distributions,” it will be shown that, in fact, the spaces €,’ and 
€’ are the same. 
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THEOREM 10. For any fe€ there isa ge€ with Pg =f. 


Proof. The map g: P’S—S-f is a continuous linear function on P’€,’ 
because it is the composition of the continuous maps SS -f, 
Since P’E,’ is given the topology induced by €r’, g can be extended to a 
continuous linear function g on €’ by the Hahn-Banach theorem. By our 
above remarks, there is a ge € such that g-S =S-g for all Se Er’. Thus, 
for any Se€r, S:-Pg=P'S-g=—=9:-PS=g9:-PS=S-f. Because of the 
Hahn-Banach Theorem, Pg =f. 


7. Systems of equations. We shall extend the results of the previous 
section to systems of constant-coefficient paftial differential equations. Let N 
be a positive number which will be fixed throughout this paper. By 9). we 
denote the space of N-tuples of functions of 4); the topology of 4)" is that 
of coordinate-wise convergence. For any / > 0 we denote by Y);‘ the space 
of N tuples of functions of 4, with topology of coordinatewise convergence. 
It is clear that 4‘ is the inductive limit of the spaces Q);" for 7 integral. 
By G** we denote the set of continuous linear mappings of 4). into complex 
N-space. Thus, every element S of G)** can be represented by an NV by N 
matrix (Si;) where each Sie)’ and where, for f= (f.,- --,fv) e Dy, 


N 
S(f) = (9:1: with Sif for each 1; the topology of is 


that of elementwise convergence of these matrices. (By “matrix” we shall 
mean “ N by N matrix” unless explicitly stated otherwise. ) 

Let % = (Pi;) where the Pj; are polynomials of differentiation. Then % 
defines a continuous map of by Pf= (g:,- where, for 


N 

each i, g => Pi;f;. By 9’ we denote the adjoint of %, this is the map 

of > with (B'S) (f) = for all Se feQG’. By an 


elementary solution for we mean an Se such that — 8, where 
§ 0 


— 


is the generalized Dirac measure. 

For any fe QD’, f = (f1,- - -, fx), we define the Fourier transform of f: 
F = $%(f) +,Fn). Thus, maps onto DN (space of N- 
tuples of functions of D) and for any 1>0, ¥% maps G,% onto DX (N- 
tuples of functions of D,). We give the spaces DN and DW the topology of 
coordinate-wise convergence and ¥¥% is clearly a topological isomorphism. 


| 
i 
4 

i 
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Thus, D¥ is the inductive limit of the D;, for / integral. By D*” we denote 
the set of linear transformations of DY into complex N space. An element T 
of is thus a matrix with each and, for Fe DY, 


N N 
T(F) = (> TyF,- - 
g=2 


By F% we denote the adjoint of (#%)-1, that is, FY is the map of 
— defined by (FYS)(G@) =S(qg), for any Se ge (We 
again use the convention of representing an element of G)* by a lower case 
letter, and using the same upper case letter for its Fourier transform.) For 
fe we shall write f = (f:,- - -, fx) where all with a similar con- 
vention for DY. FY is again a topological isomorphism of G)*" — D*¥, which 
we can consider as defining the Fourier transform on Y)*%, 

Let S = (Sij) eQ**. Then we can write 


{Su for k=i+N(j—1). 


N2 
S= where Ti 0 otherwise. 


k=1 
We compute FY(T®): For Fe DY, FX(T®) (F) = T®(f) = Sij(f;), where 
k=t+N(j—1). Call Uij—F(Si;); for =1,2,---,N*, call N® the 
matrix for which V;;“ is Ui; or 0 according as & is or is not of the form 
i+ N(j—1). Then 


FN(T®) (F) = 8ij(fi) = F (Sy) (Fs) = Uy (Fi) = V (PF). 


Thus, FX(T) = By the linearity of FY we get FY(S) =(F(S V)). 

By a similar reasoning, if $$ — (Pi) where the Pj; are polynomials of 
derivation, then for fe DY, FY(¥f) = OF, where © is the matrix of poly- 
nomials Qj; such that, for ge G), F(Pijg) = QijG, and where, for Ge 


N N 
i= j=1 


For any ze C, we shall denote by Q(z) the matrix (Qi;(z)). We shall write, 
for convenience, Q =F‘ ($) when Q and are so related. Conversely, 
given any matrix © of polynomials, we can find a matrix $ of polynomials 
of derivation such that Q = F(). 

If 8 = (Pi;) where the Pi are constant-coefficient partial differential 
operators on Y), we can form the determinant of $8, which is a constant 
coefficient partial differential operator on G). We say that B is degenerate 
if this operator is 0; otherwise we call 98 non-degenerate. If Q is a matrix 
of polynomials in n-letters, then we can form the determinant of Q which is 
a polynomial. We call Q degenerate if this polynomial is zero; if it is different 


; 
2 
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from zero, we say Q is non-degenerate. From the above it is clear that, for 
9%$ a matrix of polynomials of derivation on 4, and Q = F%($), then Q is 
degenerate if and only if $ is and, even F (determinant 3) = determinant ©. 
For any Fe 9)’, ze C, we define 
(UF I) = (Fil) 


Theorem 1 has the consequence: 


THEOREM 10. Letl > 0 and {F;} be a sequence in GY. A necessary and 
sufficient condition that F; > 0 in ts: For any r > 0 and any polynomial 
Q on C, | Q(z)| || Fi(z)|| uniformly for ze C,. 


It is possible to carry out the procedure of Section 4 and obtain the 
analogue for J)" of Theorem 5. In order to do this, we had originally used 
a method of estimating the size of the characteristic roots of a matrix of 
polynomials in n-variables. However, following the suggestion of the referee, 
we shall adopt an algebraic approach. 

Let 8 = (Pi;) where the Pi; are polynomials of derivation in n vari- 
ables; assume that P is non-degenerate. Then we have, for any fe), if 


N 
BF = (91, 9n), D> = gi, for 1== 1, 2,---,N. Thus, by Cramer’s 
j=1 


rule, if A denotes the determinant of $8, and Aj; the cofactor of Pi, 


N 
(7.1) AiiJi, 
for j==1,2,---,N. 
Let us denote by BQ) the space of Bh for he GV; the topology of BG 


is that induced by GY. The spaces for 1 > 0 are defined similarly. 
Then (7.1) together with the corollary to Theorem 5 give immediately 


THEOREM 11. Let $ be a non-degenerate matrix of polynomials of 
derivation in n variables. Then Bf—f is a continuous linear map of 
PQN for each 1> 0. 


Let us denote by Gr the space of elements of GV with the topology of 
coordinatewise convergence in Gr. By Qr** we denote the set of mappings 
of Gr into the complex N-space. Each Se G)r*N can be represented by a 
matrix (Si;) where each Sije Dr’. 


THEOREM 12. Any non-degencrate matrix % of polynomials of derivation 
is completely inversible in that is, for any Se Q)r*N theréisa Te 
such that ¥’T = 8. In particular, 8 has an elementary solution. 


i 
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Proof. This can be proven in a manner similar to the proof of Theorem 
6, but we shall use a different method because we can use it for further 
results. Let U = (Ui;) © QDr*% where each UyeGr’. Then we have 


N 
(PU) y= > Pij Ui, because, if h = +, hw) DY, 
k=1 


N N N 
PU -h=U-Ph=D> > => hy, 


N 
j k=1 


j= 

which is the stated result. 

Now, if we call U,;—Uj; where b=i-+ N(j—1), then it is easily 
seen that the transformation $8’ on the space of (U,) is of the form 

where $8’ is the matrix (P;;). Since, for D,, D, any polynomials of deriva- 
tion, (D,D.)’ = D,’D.’, the determinant D of P is different from zero. Let 
us define V, for b= 1,2,---,N? by 


N2 
(7. 2) 
a=1 


where Dy is the cofactor of the term in the a row, b column of P. By 
Theorem 6 we can find T,¢9Y such that D’T,—V>». By. Cramer’s rule, 
which is the desired result. 


By Br we denote the space of Bf for feDr’ with the topology 
induced by Gr. Exactly as in the proof of Theorem 11, we can show 


THEOREM 13. With B as above, Bf—>f is a continuous linear map of 


The spaces are defined similarly. By we denote the 
space of all $’S with Se €p*%, the topology being that induced by €*.° 
Then we have, as in the proofs of Theorem 11, 12: 


THEOREM 14. is a continuous linear map of $’Ep*N — Ep*. 


THEOREM 15. For any fe €N there isa ge€N which satisfies Bg =f. 


* Footnote added in proof: Again it can be shown (see footnote 2) that €,** is the 
same as €*" (which is defined in the obvious manner). 


| 
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Suppose now that $$ is degenerate. Since the ring of polynomials of 
derivation has no zero divisors, we can procede along usual lines (see [7]) 
to obtain an f eG). such that $f—0. Moreover, we can find a polynomial 
of derivation Y such that every Yf for f ¢ Q can be written uniquely in the 
form f’ + f” where f’e (null space of and f’eN” (on which acts 
in a non-degenrate manner). Let $Qr’ be the space of Bf, fe DY, with the 
topology induced by 9)y’. Then we get, in the usual manner, 


THEOREM 16. $f —f” is a continuous linear map of > Dr’. 
We can also prove, by conventional methods 


THEOREM 17. Let SeQr*’; a necessary and sufficient condition that 
there exist a Te QGr* satisfying P’T =S is that S=0 on N’. 


CoroLttary. If is degenerate, there 1s no elementary solution for 


Proof. Let fe)’ satisfy Bf—0. Since derivation and translation 
commute, we may assume f(0) 0. Thus, 


f = (f1(0), f2(0),° fv(0)) 
and the result follows from Theorem 17. 


We can prove similar results for €%. 


8. General remarks. In Section 4 we proved that ZL: PQr—>Qr 
defined by L(Pf) =f for all feQr is a continuous linear map. In case 
that n = 1 and P is an ordinary differential operator with constant coefficients, 
then Schwartz [5] has shown that L can be extended to a continuous linear 
map M:Qr—>Qr. This results from the fact that the quotient space 
G)r/PQ)r is finite dimensional. Such is no longer the case for n > 1 and P 
arbitrary. We do not know whether Z can be extended in the general case. 
Such an extension would be a generalized integral. 

The above question can be reformulated as follows: Call (PQ)r)’ the 
dual of PQ) with the topology of uniform convergence on the bounded sets of 
PQ. Then the adjoint L’ of L is a continuous linear map of Q)r’ > (PQ) ?’. 
(L’ is defined by (L’S)(Pg) =S(LPg) for any geQr.) Can we lift L’ 
to a continuous linear map of Q)r’—>Q)r’? Or, what is the same thing: 
We know that every element of (PQ))r’ can be extended to an element of 9)r’; 
does there exist a canonical way of extending the element of (PQ) r’ to Dr’? 
That is, can we find a continuous linear map I: (PQ))r’ > Q such that, 
for T « (PQ))r’, M(T) is an extension of 7’? We hope to discuss this question 
at a future date. 
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Similar remarks apply to the space Er (Theorem 9). 


An interesting question is raised in the statement of Lemma 3. Let us 
denote, more precisely, by «(m) the constant a which arises in that Lemma. 
What is the best (greatest) possible value of a(m) as a function of m? The 
question may be reformulated as follows: Given m points 2,,° + +,%m in the 
unit interval, what is the largest possible value of «(m) for which we can be 


sure of finding an z in the unit interval such that } |c—2,|2a(m). The 
4=1 


methods of our proof of Lemma 3 which, of course, were not designed to find 
a best possible result, show that we may choose a(m) =1/2™(m-+1)™. 
My friend, Dr. Phlotz Newman, has pointed out to me that the best possible 
result is a(m) = 2/4" (see Polyé and Szegé, Aufgaben und Lehrsatze aus 
der Analysis, Berlin, 1925.) 


All the theorems on division that we have proven involve the division 
by a polynomial in a space of analytic functions. An important question is 
the following: If P is a polynomial in n-variables and Sey, can we find 
a Te Gy such that PT = S? It is clear that our methods shed no light on 


this problem. 
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A PERIODIC ALGORITHM FOR CUBIC FORMS II.* 


By Harvey Coun. 


15. Introduction to Part II. In this continuation of our discussion 
of a cubic sign-discrimination algorithm, introduced in an earlier volume of 
this JouRNAL [7], we shall turn our attention to the computation of units. 

First of all we shall show (leaning once more on the Minkowskian 
algorithm) that all units are discoverable by the “neighbor process” of 
Part I [7], pp. 829-833. In so doing we must restrict our lattice to a module 
M of preferably (but not necessarily) all integers in a totally real cubic field 
R. The field & need not be abelian but the module Xt must be non-singular 
(i. e., of dimension three over the rationals). The units would then lie in 9 
the order of the module Mi. 

Secondly we shall consider the manner in which the units should be 
written. As we observed in [7], p. 833 certain 3 X 4 matrices are propor- 
tional only under certain rearrangements of rows and columns. This makes 
it desirable to have a hypercomplex notation, which also brings out normal 
symmetries of the matrices. 

In comparing our modified algorithm with its Minkowskian progenitor 
we start with the realization that what we are doing is refining the algebraic 
properties of the algorithm at the expense of the geometric properties, but 
the algebraic simplifications make numerical work? more feasible while the 
geometric complications make themselves felt only in existence type proofs 
which are disposed of here. This dichotomy is illustrated in the geometric 
and algebraic portions of the present work. 


Existence of Units. 


16. Types of neighbors. We consider a reduced matrix ¢ in the sense 
of 4 (above) consisting of four column vectors which we shall write as 


* Received February 5, 1954. Research sponsored by the U. S. Army Office of 
Ordnance Research under Contract No. DA-20-018-ORD-12332. The sections and biblio- 
graphical refernces are numbered consecutively with those in item [7] which formed 
“Part I” of the present work. Errata for Part I are appended to the present paper. 

+The present method is being programmed for the EDVAC at Aberdeen, Md., by 
Dr. Saul Gorn. 
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We shall classify the 24 neighbor possibilities (explained in 10 above) into 
three types depending on the partition of the symbols serving as second and 
third element in the neighbor operation : 


Type 1 Type 2 Type 3 
P,8 or ow R,S or P,Q 
(QPS), (PQR) (PQS), (QPR) (PRS), (RPQ) 
(16.2) (RPS), (SQR) (RQS), (SPR) (QRS), (SPQ) 
(QSP), (PRQ) (PSQ), (QRP) (PSR), (RQP) 
(RSP),(SRQ) (RSQ), (SRP) (QSR), (SQP) 


Lemma. Every reduced matrix @ has either one or two (reduced) 
neighbors of each type. 


Outline of Proof. We must consider the subdivision into cases as deter- 
mined by the relative magnitudes of each row: (|p |,|q|,|7%|,|s |); 
(compare 11). There are actually twenty-four different cases (to within 
elementary interchanges of rows and columns and changes of sign in rows). 
The details are omitted in the interests of brevity we simply remark that 
this method was used in the (weaker) statement made earlier [7], p. 830 
that there are from three to six neighbors. 


17. Occurrence of independent units. The units deduced from the 
“controllable ” in the sense that if we take a 
succession of neighbors to one Minkowskian triple by “lowering the same 


Minkowskian algorithm were 


face” (say z;) each time we would obtain a unit whose 2; conjugate is less 
than unity and whose other conjugates are greater than unity in absolute 
value. If units are formed corresponding to two different 2; they are inde- 
pendent by the classical argument of Dirichlet. 

The same thing can be done here, except that the “face ” corresponding 
to z; belongs to none of the vectors in the quadruple (16.1), e.g., it can © 
happen that the maximum of | p, |, | ,| 8: | cannot be decreased 
through any neighbor operation. We need to use other quantities namely 


(17.1) hi = uj, 
where u; are the components of U“): 
(17. 2) UM —P+ 8, VUA—Q+sS, 8. 


(The h; are positive if we suppress the “ + ” in the left hand side of (4. 22) 
as we now do for convenience). 


(16. 1) [P, Q, R, 8]. 
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Lemma. A neighbor operation of type 1 decreases hy while it increases 
(or leaves the same) the other two h; (j #1). 


Proof. It suffices by the very abundant symmetry to note that the opera- 
tion (QPS) of type 1 changes h, (= p, + s,) into s; (< h,) while it changes 
hs (=13 + 83) into rs + s3 — ps (> hs) and leaves hz (= gz + 83) unchanged. 
Q. E. D. 

We note that the triple (U,U®,U®) is not Minkowskian since it 
determines a sub-lattice of index 2 but it is a kind of “invariant” for the 
quadruple (16.1) by equation (4. 21). 

Thus by virtue of the finiteness theorem of 8 (above) units are deter- 
mined which are independent. (Compare [3], p. 444). 


18. Occurrence of all units. Consider next how an arbitrary unit in 9, 
namely H = (¢,, é2,e;), (taken with positive norm for convenience), can be 
discovered by our algorithm. If we form the product 


(18. 1) = [PE, QE, RE, SE] 


we obtain a quadruple of vectors lying in Qt (by definition of ©), and 
spanning 9 again (since the sublattice spanned by ¢F has determinant equal 
to that of ¢ times the norm of #). By a rearrangement of columns (vectors) 
in oH we therefore obtain another reduced quadruple. Therefore all we need 
show is that ¢ and ¢£F are connected by a chain of neighbor operations (as 
in 12 above). 

We introduce the term overlapping to describe two sets of vectors such as 
triples or quadruples, etc., with two or more common elements. 


Lemma 18.1. Two overlapping quadruples can be connected by means 
of neighbor operations. 


Outline of proof. Let the two overlapping quadruples be doo and da,» 


(for integral a,b) where 
(18. 2) 
R@») —R+aP + bQ, S@) — § —aP —bQ. 


Now the requirement that ¢,,, be reduced leads to three sign conditions on 

and 8@) (—=—R—(a+1)P—(b+1)Q). Thus 
fi(at+i1,b+1) <0 fi(a,b) <0 

(18. 3) flat1b+1)<0 fr(a,b) <0 

fs(a+1,b6+1) <0 fs(a,b) > 0, 


— 
_ 
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where (under restrictions (4.21) and (4. 22)), 
(18. 4) fi(a, b) + api + bq. 


It is then a tedious but elementary matter to show (by representing the six 
inequalities (18.3) graphically) that if the relations (18.3) hold for (a, 6) 
then a and b are not both of the same sign, while the inequalities (18.3) hold 
with (a,b) replaced by (a,b + sgna) or (a+sgnb,b). (Here sgn0 = 0). 
Thus Lemma 18.1 follows by descent. Q. E. D. 


LemMA 18.2. To any reduced quadruple (or to one connected with tt 
by a chain of neighbors) there corresponds at least one overlapping Minkow- 
skian basis. 


The proof of this lemma will be deferred to the next section; (see 
Lemma 19. 3 below). 


LemMA 18.3. Any two neighboring Minkowskian bases each overlap 
overlapping reduced quadruples. 


For it is the nature of Minkowski’s neighbor concept, [3], that any two 
neighboring triples overlap; but by the theorem in 5 these triples are con- 
tained (discounting sign) in a reduced quadruple if they are bases. 

Now we use the further information that any two Minkowskian triples 
are connected by the Minkowski neighbor process. The proof is based on a 
topological argument suggested by Minkowski (and proved in [2], p. 342). 
Since every degenerate Minkowskian triple has non-degenerate triples (i. e.. 
basis triples) as neighbors, it takes very little effort to deduce from the same 
proof that any two Minkowskian bases are connected by a chain of Minkowskian 
bases as neighbors. Hence the lemmas yield the following result and with it 
the “ discoverability ” of all units: 


THEOREM. Any two reduced quadruples can be connected by a chain of 
neighboring (reduced) quadruples. 


19. Proximity of algorithms. In Section 5 (above) we showed that 
every Minkowskian basis is part of a reduced quadruple, which means, in 
effect, that every Minkowskian triple overlaps a reduced quadruple. The 
“converse” seems to be false, indicating that there are “more” reduced 
quadruples than Minkowskian triples in some sense perhaps analogous to the 
sense that the quadratic algorithm we introduced in 2 (above), possessing 
“mediants ” in addition to regular convergents of the simple continued frac- 
tion contains “more” reduced triples (see Section 25 below). Lemma 18. 2 
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seems to be as close as we can comfortably approach a converse to the 
theorem in 5. 

To get at the difficulty we define a mediant quadruple to be a reduced 
quadruple in which one vector achieves in each of its three components the 
minimum absolute value for all four vectors of the quadruple. Thus if (say) 


(19.1) <min| pil, | qi|,!s|, 1=iS3, 
then [P,Q, R,S] is a mediant quadruple. Here we would call R& the small : 
vector. 
We call a set of vectors © limiting if each (non-zero) vector achieves an 
absolute value in some 2;-component not exceeding that of a vector of G. ( 
Then by the Minkowskian method of admissible bodies if a set of limiting 
vectors contains at least three vectors none of which can be omitted from © 
(without loss of limiting property) then a Minkowskian triple can be chosen 
from the set S. If S consists of two vectors none of which can be omitted ; 
from © (without loss) then a Minkowskian triple can be chosen that over- , 
laps ©. 


Now mediant quadruples are undesirable because they do not have two 
vectors that serve as a limiting set (by virtue of the fact that the small vector 
can be omitted without loss of limiting property). Hence we must use 


Lemma 19.1. No complete chain of reduced neighboring quadruples 
can consist entirely of mediants. : 


Proof. Suppose R is the small vector of a mediant ¢. Then any 
neighbor operation with R “in the middle” is valid. Take in particular 
(PRS): We can apply the neighbor operation only a limited number of 
times, e. g., | 


(19. 2) - 


(since the operation decreases p,; (> 0) by | 7, | each time). Now if 
is still a mediant, the small vector must be Q (because it is no longer “R” 
and yet | p.| and | s,| now exceed ). We therefore apply (PQS) as 
long as possible until we obtain a .¢™”). This process must terminate since 
the operations (PQS) and (PRS) each decrease p, (>0) by | qi | or | | 
each time. Q. E. D. 

- We are now ready to handle the non-mediant quadruples. We are again 
confronted with a multitude of special cases that would be tedious to give in 
detail. We therefore first make use of limiting sets. 


Te 


Lemma 19.2. For any reduced quadruple the twenty-six vectors formed 
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from the following and their negatives are a limiting set: 
P, Q, R, S, P— R, Q—8, R—S8, 
P+8S, Q+8, R+8. 


Proof. Let V=aP+0Q+cR. Since V = (b—a)Q + (c—a)R—aS 
by (4. 21), we can assume a, b,c are not all of the same sign. We therefore 
can select a component j for which | v;| =| ap;| + | bq;|-+ | cr; |. Hence 
the result. Q. E. D. 


(19. 3) 


Lemma 19.3. If a reduced quadruple is not a mediant then it contains 
a limiting set of two vectors (and therefore overlaps a Minkowskian basis). 


Outline of proof. An actual listing of the twenty-four different types 
of @ according to the relative magnitudes of the rows (see 16), would show 
that only two types are mediants. The remaining twenty-two types reveal 
limiting sets on inspection, but the amount of space required to handle the 
inequalities in writing is almost prohibitive. It should be clear however that 
Lemma 19. 2 expresses the problem in finite terms. 


Hypercomplex Notation and Normal Symmetries. 


20. Reduction redefined. The distinguishing algebraic features of our 
present algorithm can best be brought out by considering the hypercomplex 
system % over the module Yt generated by the introduction of elements 
(a@,, @2,a3,1) that form the four-group: 


(20. 1) a;? = 1(— aja; = aja; a;, 1S+AjA~k SB. 
Then each 3 X 4 matrix ¢ can be written in the form 


(20. 2) ¢ = Pa, + Va, + Ra; + S81. 


In the system 9 the “scalars” are elements of Mt for which each conjugate 
multiplies (row by row) the 3 X 4 matrix ¢. Thus if M were an order, 
would be an “ algebra.” 

Now the element ¢ of 9 will be called a reduced basis of Yt (see 4 above) 
if any three of the quantities P,Q, R, S form a basis of Mt and (see (4. 21) ) 
if @ goes to 0 by the mapping of 9 that brings each a; into 1 and if (con- 
tinuing to suppress the “+ ” on the left of (4.22)), each a; conjugate of 
P,Q, R,S agrees in sign with xy; of a, respectively, where the 
(t= 1,2,3) are the three-non principal characters of the four-group, fixed 
so that y;(a;) = 1. 


12 


{ 
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Thus the finiteness theorem of 8 (above) asserts that the system may 
have only a finite number of reduced basis elements to within (scalar) propor- 
tionality constants. The argument in 9 enables us to choose these constants 
as units. 

The hypercomplex notation fits in with the classification into neighbor 
types. If we use the symbol (la,a.) to denote, for example, (SRQ), then 
a neighbor of type i (here type 1), is one in which the product of the second 
and third elements of the neighbor operation (e. g., a3 * a2), equals a; (here a,). 
The new notation enables us to obtain another very curious kind of result 
that we mention in passing: The sum or product of any two reduced elements 
¢:, ¢ belonging to two modules in & is equal to reduced elements belonging 
to a third module (which depends on the operation and elements). 

Our main purpose in using the notation will be for dealing with units. 
(Compare the hypercomplex notation use by Hasse [8] for a somewhat 
different analysis of cubic units). 


21. Hypercomplex units. Now we shall designate the units H =(e,, é2, es) 
of © that are not totally positive as multipliers in 9%. What we do is select 
the character y; for which y,;(a;), (1 SiS 3), agrees in sign with e; (assuming 
the norm of F positive as before). Then the product a; denotes the hyper- 
complex unit corresponding to # and for any reduced basis element ¢ of M, 
the product ¢a,/ yields another reduced basis element. This process would 
reduce to ordinary (scalar) multiplication for FH totally positive or totally 
negative. 

Now let & be the (abelian) group of all hypercomplex units so defined. 
Let &, be the sub-group of totally positive units. Then the nature of the 
quotient group &/@®, is uniquely determined by r the signature rank, i.e., 
the number of different arrays of signs that can be achieved by the scalar 
units (of positive and negative norms). Then for 


8, [a,E, 1], 
(21.1) r=44,6/6,=4 [ak™, 1], 


2, [1]. 
Note that by the eigenvalue discussion in 14 (above), only these hyper- 
complex wnits can emerge from proportionalities of reduced basis elements 
under interchange of columns with rows preserved. 


22. Mixing of rows. Consider two reduced basis eleménts ¢ and ¢* 
of 2. We are now concerned with proportionalities of one row of ¢ with 
another row of ¢*, accompanied by the necessary permutation of columns. 


( 
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For instance, a (deceivingly trivial) illustration is provided by ¢ = $* = do 
of 11 (above), where the first row is proportional to the second (with ratio 1) 
under cyclic permutation of P,Q, R. Such proportionality evidently could 
occur only for & abelian since otherwise p:/q: could equal (say) g2*/r2* only 
when both ratios (belonging to conjugate fields) are rational, in contradiction 
to the basis property. Actually we can say more: 


THEOREM. Two reduced basis elements of Mt can have different rows of 
each proportional (with some permutation of columns) if and only tf the 
module MN is equivalent to a normal module. 


Proof. Suppose such a proportionality of rows exists. Then the second 
row of ¢* is (say) &, times the first row of ¢ (with rearrangement of columns), 
where k, is an element of the (abelian) field . Likewise the third and first 
rows of ¢* (by using conjugates) are seen to be k, and k, times the second 
and third rows of ¢ (with rearrangement of columns). Thus K = (hi, ko, ks) 
is an element of & with norm N(K) —1 by virtue of the fact that ¢ and ¢* 
are basis elements of the same module. But by a well-known theorem of 
Hilbert [9], &,, having norm 1, will be the ratio of conjugate elements, which 
(by multiplication with a rational integer) can be chosen to lie in ©. Thus 
k, = t,/t. where 7’ = (¢,, ¢., ts) is in © and has positive norm. Hence ¢a;T 
and ¢*a;T' are for some 1 (0 =i 83), reduced basis elements of system AT 
corresponding to M¢7'—MN. Now M is easily the normal module required. 
The converse is immediate. Q. E. D. 


CoroLttaRY. A single reduced basis element o of Mt will have one row 
proportional to another row (with permutation of columns) if and only if 
the module Mt is equivalent to a module N with a normal basis that further- 
more consists of the conjugates of an element P of N with positive norm and 


negative trace. 


The preceding proof applies except that we display the basis element by 
letting Q and R be conjugates of P and letting S be the negative of the trace 
of P. 

The theorem and corollary are (strictly) more restrictive, i.e., not all 
abelian modules are normal, nor are all normal modules equivalent to one 
with a normal basis, nor must the normal basis have the desired sign on its 
norm and trace. 

As a final remark note that in both the theorem and corollary the order 
O-of M is the same as that of MN and therefore a normal module in &. Hence 
by a well known Minkowskian technique (see [8]), it follows that funda- 
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mental units of D can be taken as two conjugates of a single unit. Thus the 
signature rank r (of 21 above) may equal only 2 or 8 (but never 4). 


23. Use of Galois group operator. We can now enlarge our (abelian) 
hypercomplex system to a non-abelian one in the following manner: Consider 
Z the Galois group operator for our abelian field K (so that Z>—1), Then 


(23. 1) PZ = (pr, po, ps) Z = Ps, Pr)- 


Now the neecssity of preserving the sign pattern requires that a permutation 
of rows be balanced by a permutation of columns as follows: 


(23. 2) $Z — (RZ)a, + (PZ)a; + (QZ)a; + (SZ)1, 
Formally, then, we derive the commutator rules 
(23. 3) a,Z = Za;, ad = a,Z Za,, 1 Z = Z 1. 


Thus the juxtaposition of symbols Z‘g (where 0 = ¢< 2 and g belongs to 
the four-group), will have meaning as a (right) operator on ¢ and will in 
fact be isomorphic to the alternating group on four elements. 


Now if we consider a module equivalent to a normal module, then for 
every hypercomplex unit (0 Sj = 3), (of 21 above) we have three units 
belonging to the new hypercomplex system, namely (0 StS 2). In 
this manner the group @* of hypercomplex units in the enlarged system is 
determined from the group @, of totally positive units as follows (compare 
(21.1)): 

8 [1, Z,Z°] a,EZ?,1 
(28.4) r= 64/6 — 


Note that for r= 8 the fundamental units are shown as conjugates of a 
single unit while r 4 does not occur, in accordance with the conclusion 
of 22 above. In this way we obtain a group &* whose multiplication rules 
can be given as follows: 


(23. 5) (Z'a;E) (Z"a,E’) = 
where E” = (EZ“)E’ and a,—a,Z-“a;Z". 


Now, the only possible proportionalities that mix rows are those coming 
from the group &*, i.e., based on units, for the cube of any element of * 
is an element of ©, hence based on a scalar unit. (See section 21). 


ne 
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24. Degenerate quaternary forms. On the basis of the last three sec- 
tions we are in a position to discuss the quaternary form corresponding to 
¢@ namely 


(24. 1) = (mpi + mogi + + m45;), 


Several different @ can correspond to a single ¢ under permutation of the 
(m;), according to the signature rank r of the units in the order ©. 


Case I: Mt not equivalent to a normal module. Here r= 2,4, or 8 and 
every ® is one of a set of r/2 (i.e., r/2 different “ points” in the “ chain ” 
diagrams of 12). 


Case II: Mt equivalent to a normal module. Here r=2 or 8 and 
every ® is one of a set of 3r/2 except when the ¢ is proportional (row by row) 
to one in which three of the quantities P,Q, R,S are conjugates. (In the 
exceptional case ® is one of a set of r/2 again). 


In the table [7, p. 832] we find evidence of some of these cases for 
r= 8. Case I corresponds to A? = 148, while Case II (or its exception) 
corresponds to the proportionality classes that occur by twelves (or by fours). 
Note that the module for A* 196 is neither an ideal nor normal, yet the 
correspondence “by fours” means it must be proportional to a normal 
module (with even a normal basis). The numerical work is being carried 
much further than in Part I, but it will be the subject of a separate report. 


25. Concluding remarks. In Part I, Section 2, a modified quadratic 
simple continued fraction algorithm was outlined. It turns out to be more 
conventional than first thought, in fact it is equivalent to the continued 
fraction algorithm with the “ mediant ” fractions thrown in. The geometrical 
construction given in 2 can be seen to be a variation of Smith’s construction 
[10]. 

In Part III, the concluding portion, we shall treat the algorithm as a 
means of discovering “small norms” (analogously with the Pell Equation). 


Errata for Part I (i.e., [7]). 
Page 828 line 7, a should read a, 


line 18, should read 
830 line 16, —1.25 should read 1.25 
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832 line 4, —p?[— should read — p 
833 lines 8-10, by twelves should read by at most twelves 
by fours should read by at most fours. 
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ERRATUM. 


C. T. Rajagopal, “ On Riesz summability and summability by Dirichlet’s 
series: a further addendum and corrigendum,” this JoURNAL, vol. 76, 252-258. 


Page 256, line 9 from above insert, in front of “—>1,” the factor 
“exp — (ro + 1).” 


